This open-source model just became a major challenger to Claude
This is just unbelievable.
Imagine having a model that’s:
- More than 5 times cheaper than Claude Opus — yet just as intelligent?
- Blazing fast — up to 15x faster than Claude Opus for heavy prompts and massive context?
- Open-source with open weights to top it all off?
Yeah it’s definitely no surprise how so many people have been going absolutely wild over the few days over the new MiniMax M3.
Look how closely it matches Claude’s and GPT’s capabilities! It’s even better in certain key areas:

I still can’t wrap my head around the absolutely wild feats of endurance and intelligent it displayed during testing — and has been performing ever since then.
This just truly completely shattered the limits of what we thought was possible with open-source models — or any type of model really.
1. Not the same “1 million token context”…
Don’t let AI companies fool you with their loud claims of 1 million token context.
Not all 1 million token contexts are remotely the same — not even close.
Yes they can all process massive codebases, research archives, books, or long-running projects.
But they vastly differ in the quality of processing all that massive data, how much they can actually make sense of, how much they comprehend the interconnectedness of all the data.
And speed. A big, big one.
Most AI models become painfully slow and expensive when dealing with that much information.
M3 uses an innovative new approach called MiniMax Sparse Attention (MSA) that allows it to focus on the important parts instead of wasting resources on everything at once.
This allows M3 to:
- Read huge amounts of information more than 9x faster
- Generate responses more than 15x faster
- Use a fraction of the computing power normally required
The giant context window is no longer just for fancy — it actually becomes a beast in the real world.
And those models that still use slower methods of parsing large token contexts will get absolutely left in the dust.
2. Insane long-horizon ability
MiniMax M3 shocked every with insane feats of autonomy on tasks spanning several hours.
The 12-hour research paper loop: M3 was given a complex AI research paper and spent 12 hours straight reproducing the core experiments. It managed its own context window, parsed charts, wrote code, handled 18 GitHub commits, and generated 23 experimental figures completely unaided.
Self-training models: In another test, M3 was given raw base models and spent 12 hours handling data synthesis, training, and evaluation loops on them entirely on its own.
The 24-hour CUDA optimization: It was assigned to optimize an FP8 GEMM kernel (a notoriously painful low-level GPU operation). Over 24 hours, it autonomously went through 147 benchmark submissions and nearly 2,000 tool calls, boosting hardware peak utilization from a lousy 7.6% to a highly optimized 71.3% — a 9.4x speedup with zero human help.
This is the kind of work you’d normally assign to a high-level engineer or researcher, yet here comes MiniMax M3 doing it all on its own, unbelievable.
3. Punching way above its weight
Despite competing against much larger companies with way bigger resources at their disposal, M3 is posting amazing benchmark results.
- SWE-Bench Pro: 59.0%, edging past GPT-5.5 and Gemini 3.1 Pro
- BrowseComp: 83.5, outperforming Claude Opus 4.7’s 79.3
MiniMax is competing right at the cutting edge, particularly in software engineering and autonomous research tasks.
4. Ultra-competitive pricing
It will undoubtedly be one of the biggest selling points.
All that intelligence and speed and innovation, for such a ridiculously low price.
M3 launched at roughly:
- $0.60 per million input tokens
- $2.40 per million output tokens
That makes it significantly cheaper than many of the most advanced AI models available today.
Just compare to the latest Claude Opus pricing:
- $5.00 per million input tokens
- $25.00 per million output tokens
This will be huge for startups and smaller teams.
Many AI projects fail not because the technology isn’t good enough, but because running them becomes too expensive. Lower costs mean more companies can build products that were previously out of reach.
5. Open weights available to everyone
MiniMax isn’t keeping everything locked behind a proprietary wall.
And open-source availability means:
- Greater transparency
- Community-driven improvements
- Easier experimentation
- More deployment flexibility
- Reduced vendor lock-in
Open models used to trail the very best closed systems — not anymore.
M3 is part of the new wave of the models that completely disrupts the notion of open-source models being inherently weaker.
Why M3 matters
The biggest takeaway from M3 isn’t that it’s smarter than every other model.
It’s that MiniMax is attacking some of the biggest problems in AI at the same time:
- Cost
- Speed
- Long-term autonomy
- Multimodal understanding
- Accessibility through open source
If these early results hold up in real-world use, M3 could become one of the most important AI launches of the year — not because it’s the biggest model, but because it makes powerful AI far more practical for everyone.









































