Join Nostr
2026-02-01 19:14:09 UTC
in reply to

Globe99 on Nostr: Interesting. From the abstract of the linked paper: > Claude 3.5 Sonnet and o3-mini ...

Interesting. From the abstract of the linked paper:

> Claude 3.5 Sonnet and o3-mini manage the machine well in most runs and turn a profit, but all models have runs that derail, either through misinterpreting delivery schedules, forgetting orders, or descending into tangential "meltdown" loops from which they rarely recover.