That's already kind of how tools like Opencode Claude Code and Copilot work now. Copilot doesn't do it unless you remind the model is has the power to use a subagent, but yeah it works very well for that exact purpose.
Some people use it to save tokens or time for work. Like use opus for planning, then fire off some gpt or sonnet, or smaller models for other tasks. I often use the free models to do things like read and summarize code changes then report back to the main model. Copilot currently doesn't charge for submodel usage, but im sure they will in time.
It's also possible in Opencode to mix in your own local models too, although I've been struggling with it for a while.
I just setup some automated PR tasks using opencode. I tired using my self hosted models, but they couldn't fight their way out of a wet paper bag unfortunatly. I spent way too much time playing with it for now.
