All changelog entries
Improvement0.11.2ModelsContext

GLM-5.1 now handles 128K context

June 18, 2026

GLM-5.1 now advertises and serves a 128K context window — up from the previous 32K cap.

  • Large codebases, long documents, and extended chats no longer hit the old limit.
  • Requests are automatically routed to a backend that can handle the full window; very large prompts simply skip any route that can't.

No action needed — your existing calls just get more headroom.