gpt-oss-120b

GPT-OSS-120B is a powerful open-weight language model by OpenAI, optimized for high-level reasoning, tool use, and agentic tasks. With 117B total parameters and 5.1B active parameters, it’s designed to fit on a single H100 GPU using native MXFP4 quantization. The model supports fine-tuning, chain-of-thought reasoning, and structured outputs, making it ideal for complex workflows. It operates in OpenAI’s Harmony response format and can be deployed via Transformers, vLLM, Ollama, LM Studio, and PyTorch. Developers can control the reasoning level (low, medium, high) to balance speed and depth depending on the task. Released under the Apache 2.0 license, it enables both commercial and research applications. The model supports function calling, web browsing, and code execution, streamlining intelligent agent development.

Features

117B parameters, 5.1B active (MoE)
Harmony-format compatible for chat and agents
Apache 2.0 license for free commercial use
Chain-of-thought reasoning with adjustable depth
Native support for tool use: browsing, code, functions
Fine-tuning support on H100 or consumer-grade hardware
Deployable via Transformers, vLLM, Ollama, and more
Efficient inference using MXFP4 quantization Preguntar a ChatGPT

Project Samples

Project Activity

See All Activity >

Follow gpt-oss-120b

gpt-oss-120b Web Site

Other Useful Business Software

Caller ID Reputation provides the most comprehensive view of your caller ID scores across all carriers

Instantly identify flagged caller IDs and decrease flags by up to 95% your first month.

Keep your agents on the phone with increased connection rates by monitoring your phone number reputation across all major carriers and call blocking apps.

Learn More

Rate This Project

User Reviews

Be the first to post a review of gpt-oss-120b!

Additional Project Details

Registered

2025-08-05

Similar Business Software

Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
Kimi K2 Thinking

Kimi K2 Thinking is an advanced open source reasoning model developed by Moonshot AI, designed specifically for long-horizon, multi-step workflows where the system interleaves chain-of-thought processes with tool invocation across hundreds of sequential tasks. The model uses a mixture-of-experts...

See Software
Google AI Studio

Google AI Studio is a comprehensive, web-based development environment that democratizes access to Google's cutting-edge AI models, notably the Gemini family, enabling a broad spectrum of users to explore and build innovative applications. This platform facilitates rapid prototyping by providing...

See Software
gpt-oss-120b

gpt-oss-120b is a reasoning model engineered for deep, transparent thinking, delivering full chain-of-thought explanations, adjustable reasoning depth, and structured outputs, while natively invoking tools like web search and Python execution via the API. Built to slot seamlessly into...

See Software
EXAONE Deep

EXAONE Deep is a series of reasoning-enhanced language models developed by LG AI Research, featuring parameter sizes of 2.4 billion, 7.8 billion, and 32 billion. These models demonstrate superior capabilities in various reasoning tasks, including math and coding benchmarks. Notably, EXAONE Deep...

See Software