>>109071102>yet my Gemma 4 E4B beats the shit out of that because of the chain of thought innovationThe models are getting better, but I think a lot of the current jump in capacity is due to agent orchestration. It's the same kind of change as the one that happened between chat and reasoning models, except now part of it is outside the model.
If you had Fable sitting on your home computer right now as a thousand safetensor files and the compute to run it, the experience would remain different then the model served by Anthropic I would think.
Sure there are open harnesses, but there would be a need to replicate the same type of agentic training for the models to work as well, which looks complex.
tldr six months.