we need a lot more innovation on open source *small* foundation models, as they will be what enable a lot of use cases, not your huge 1T parameter chatbot
quotinginteresting, new small qwen 3.5 models with no instruction tuning
nevent1q…mjcu

quotinginteresting, new small qwen 3.5 models with no instruction tuning
nevent1q…mjcu