GLM-5.1
GLM-5.1 is the latest iteration of Z.ai’s GLM series, designed as a frontier-level, agent-oriented AI model optimized for coding, reasoning, and long-horizon workflows. It builds on the GLM-5 architecture, which uses a Mixture-of-Experts (MoE) design to deliver high performance while keeping inference costs efficient, and is part of a broader push toward open-weight, developer-accessible models. A core focus of GLM-5.1 is enabling agentic behavior, meaning it can plan, execute, and iterate across multi-step tasks rather than simply responding to single prompts. It is specifically designed to handle complex workflows such as debugging code, navigating repositories, and executing chained operations with sustained context. Compared to earlier models, GLM-5.1 improves reliability in long interactions, maintaining coherence across extended sessions and reducing breakdowns in multi-step reasoning.
Learn more
Sarvam 105B
Sarvam-105B is the flagship large language model in Sarvam’s open source model family, designed to deliver high-performance reasoning, multilingual understanding, and agent-based execution within a single scalable system. Built as a Mixture-of-Experts (MoE) model with approximately 105 billion total parameters, of which only a fraction are activated per token, it achieves strong computational efficiency while maintaining high capability across complex tasks. The model is optimized for advanced reasoning, coding, mathematics, and agentic workflows, making it suitable for tasks that require multi-step problem solving and structured outputs rather than simple conversational responses. Sarvam-105B supports long-context processing of up to around 128K tokens, enabling it to handle large documents, extended conversations, and deep analytical queries without losing coherence.
Learn more
Command A+
Command A+ is Cohere’s fastest and most powerful language model yet, an open-source enterprise workhorse built for complex reasoning, multimodal and multilingual agentic tasks, and efficient private deployment. It is a sparse mixture-of-experts model with 218B total parameters and 25B active parameters, designed for high-performance agentic workflows with minimal compute overhead. Command A+ unifies capabilities from across the Command family into one scalable model, supporting text, image, reasoning, and tool use with a 128K input context, 64K max generation, and support for 48 languages. It is optimized for reasoning, agentic workflows, RAG, multilingual work, and multimodal document processing, with support for vLLM and Transformers. Compared with earlier Command A models, it improves enterprise workload performance across multimodal understanding, retrieval, long-horizon tasks, complex reasoning, coding, translation, and document understanding.
Learn more
Command A Translate
Command A Translate is Cohere’s enterprise-grade machine translation model crafted to deliver secure, high-quality translation across 23 business-relevant languages. Built on a powerful 111-billion-parameter architecture with an 8K-input / 8K-output context window, it achieves industry-leading performance that surpasses models like GPT-5, DeepSeek-V3, DeepL Pro, and Google Translate across a broad suite of benchmarks. The model supports private deployments for sensitive workflows, allowing enterprises full control over their data, and introduces an innovative “Deep Translation” workflow, an agentic, multi-step refinement process that iteratively enhances translation quality for complex use cases. External validation from RWS Group confirms its excellence in challenging translation tasks. Additionally, the model’s weights are available for research via Hugging Face under a CC-BY-NC license, enabling deep customization, fine-tuning, and private deployment flexibility.
Learn more