Wondering if I'm being unreasonable, I'm running a test, taking turns trying each model within one chat, with the same prompt: "I'll upload a file - don't do anything with it until I tell you more." I'm uploading a different txt file each time.
Llama 3.3, OpenAI GPT-OSS, Gemma, and Qwen3-VL passed the test fine. Qwen 3 Coder and even Kimi K2 showed considerable restraint. DeepSeek R1 thought for 44 seconds and responded with a long answer, but didn't write code.
So I can try uploading files with the chat set to a model that obeys more easily, then switching to an overly eager one when it's time to build.
