Zhipu AI releases GLM-4.6 with open weights and frontier-tier benchmarks
Zhipu AI released GLM-4.6 with open weights under Apache 2.0, frontier-tier reasoning benchmark scores, and a single-H100 deployment story. The most credible open-weight challenger this quarter.
Zhipu AI released GLM-4.6 on May 11, 2026, under an Apache 2.0 license. Benchmark scores land in frontier territory; the deployment story is the genuinely new part.
What’s notable
Frontier-tier reasoning benchmarks. GLM-4.6 scores within striking distance of GPT-5 and Claude Opus 4.6 on the major public reasoning evals. We have not independently validated these scores yet — follow-up scorecard pending.
Single-H100 deployable. With appropriate quantization, GLM-4.6 runs on a single H100. That changes the deployment math for self-hosters who were previously running smaller open-weight models on accessible hardware.
Apache 2.0. Full open weights, permissive commercial use. No “open-but-not-really” caveat clauses.
Where it matters
For compliance-restricted teams (data residency, air-gapped environments, regulated industries), GLM-4.6 is the first open-weight model that lands “good enough” for serious production work. For everyone else, the hosted API economics of Opus or GPT-5 still beat self-hosting on raw cost.
What to watch
The China-origin question will affect adoption in some markets. Hosted GLM-4.6 endpoints from US/EU providers (Together, Fireworks, others) typically appear within weeks of release — that’s the gating event for mainstream Western adoption.
- Zhipu AI release notes github.com
- HuggingFace model card huggingface.co