Page 8 | reasoning models free download

Chameleon LLM

Codes for "Chameleon: Plug-and-Play Compositional Reasoning

Discover Chameleon, our cutting-edge compositional reasoning framework designed to enhance large language models (LLMs) and overcome their inherent limitations, such as outdated information and lack of precise reasoning. By integrating various tools such as vision models, web search engines, Python functions, and rule-based modules, Chameleon delivers more accurate, up-to-date, and precise responses, making it a game-changer in the natural language processing landscape. ...

Downloads: 0 This Week

Last Update: 2023-08-25

See Project

PRM800K

800,000 step-level correctness labels on LLM solutions to MATH problem

PRM800K is a process supervision dataset accompanying the paper Let’s Verify Step by Step, providing 800,000 step-level correctness labels on model-generated solutions to problems from the MATH dataset. The repository releases the raw labels and the labeler instructions used in two project phases, enabling researchers to study how human raters graded intermediate reasoning. Data are stored as newline-delimited JSONL files tracked with Git LFS, where each line is a full solution sample that...

Downloads: 2 This Week

Last Update: 1 day ago

See Project

CodeContests

Large dataset of coding contests designed for AI and ML model training

CodeContests, developed by Google DeepMind, is a large-scale competitive programming dataset designed for training and evaluating machine learning models on code generation and problem solving. This dataset played a central role in the development of AlphaCode, DeepMind’s model for solving programming problems at a human-competitive level, as published in Science. CodeContests aggregates problems and human-written solutions from multiple programming competition platforms, including AtCoder,...

Downloads: 1 This Week

Last Update: 7 hours ago

See Project

$Grade School Math$

Grade School Math

8.5K high quality grade school math problems

The grade-school-math repository (sometimes called GSM8K) is a curated dataset of 8,500 high-quality grade school math word problems intended for evaluating mathematical reasoning capabilities of language models. It is structured into 7,500 training problems and 1,000 test problems. These aren’t trivial exercises — many require multi-step reasoning, combining arithmetic operations, and handling intermediate steps (e.g. “If she sold half as many in May… how many in total?”). The problems are written by human authors (not automatically generated) to ensure linguistic variety and realism. ...

Downloads: 0 This Week

Last Update: 2025-10-03

See Project

ELI5

A library for debugging/inspecting machine learning classifiers

ELI5 is a Python library designed to help developers interpret, debug, and explain the predictions of machine learning models. The project focuses on improving model transparency by providing tools that visualize feature importance and prediction reasoning. It supports several popular machine learning frameworks including scikit-learn, XGBoost, LightGBM, CatBoost, and Keras. The library allows users to inspect model weights, analyze decision trees, and compute permutation feature importance for black-box models.

Downloads: 0 This Week

Last Update: 2026-03-15

See Project

Video Nonlocal Net

Non-local Neural Networks for Video Classification

...Non-local blocks compute attention-like responses across all positions in space-time, allowing a feature at one frame and location to aggregate information from distant frames and regions. This formulation improves action recognition and spatiotemporal reasoning, especially for classes requiring context beyond short temporal windows. The repo provides training recipes and models for standard datasets, as well as ablations that show how many non-local blocks to insert and at which stages. Efficient implementations keep memory and compute manageable so the blocks can be added without rewriting the entire backbone. ...

Downloads: 0 This Week

Last Update: 2025-10-07

See Project

Mistral Small 4

Model that fuses instruct, reasoning and agentic skills

The Mistral Small 4 collection is a set of open-weight large language models developed by Mistral AI that aim to unify multiple capabilities, including instruction following, reasoning, and coding, within a single efficient architecture. These models are part of the broader Mistral Small family, which is designed to deliver strong performance across a wide range of everyday AI tasks while maintaining relatively low latency and efficient deployment requirements. ...

Downloads: 0 This Week

Last Update: 2026-03-17

See Project

Nemotron 3 Nano

LL model providing reasoning and conversational capabilities

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 is a mid-sized open large language model created by NVIDIA to provide strong reasoning and conversational capabilities while maintaining efficient deployment requirements. The model contains roughly 30 billion parameters and is designed to balance performance and computational efficiency, making it suitable for developers building AI applications that cannot run extremely large models. It is trained from scratch and built using a hybrid architecture that integrates Transformer attention layers with Mamba-style sequence modeling components inside a Mixture-of-Experts framework. ...

Downloads: 0 This Week

Last Update: 2026-03-13

See Project

Nemotron 3 Super

Open language model developed by NVIDIA as part of Nemotron-3 family

NVIDIA-Nemotron-3-Super-120B-A12B-FP8 is a large-scale open language model developed by NVIDIA as part of the Nemotron-3 family of generative AI systems designed for advanced reasoning, conversational interaction, and agent-based workflows. The model contains approximately 120 billion parameters, but employs a Mixture-of-Experts architecture that activates only a smaller subset of parameters during inference, improving computational efficiency while maintaining high capability. Its...

Downloads: 0 This Week

Last Update: 2026-03-13

See Project

Leanstral

Open-source code agent designed for Lean 4

...The model is built to understand and generate Lean 4 code, which is used to express complex mathematical constructs as well as formal software specifications. By focusing on theorem proving and formal reasoning, Leanstral represents a specialized direction within large language models, targeting domains that require strict correctness and logical rigor rather than general conversational tasks. It leverages modern large-scale architectures, likely incorporating mixture-of-experts techniques, to balance efficiency and capability while handling structured symbolic reasoning tasks. ...

Downloads: 0 This Week

Last Update: 2026-03-17

See Project

DeepSeek-V3.2-Speciale

High-compute ultra-reasoning model surpassing model surpassing GPT-5

DeepSeek-V3.2-Speciale is the high-compute, ultra-reasoning variant of DeepSeek-V3.2, designed specifically to push the boundaries of mathematical, logical, and algorithmic intelligence. It builds on the DeepSeek Sparse Attention (DSA) framework, delivering dramatically improved long-context efficiency while preserving full model quality. Unlike the standard version, Speciale is tuned exclusively for deep reasoning and therefore does not support tool-calling, focusing its full capacity on...

Downloads: 0 This Week

Last Update: 2025-12-01

See Project

DeepSeek-V3.2

High-efficiency reasoning and agentic intelligence model

DeepSeek-V3.2 is a cutting-edge large language model developed by DeepSeek-AI, focused on achieving high reasoning accuracy and computational efficiency for agentic tasks. It introduces DeepSeek Sparse Attention (DSA), a new attention mechanism that dramatically reduces computational overhead while maintaining strong long-context performance. Built with a scalable reinforcement learning framework, it reaches near-GPT-5 levels of reasoning and outperforms comparable models like DeepSeek-V3.1 and Gemini-3.0-Pro in advanced benchmarks. ...

Downloads: 0 This Week

Last Update: 2025-12-01

See Project

gpt-oss-20b

OpenAI’s compact 20B open model for fast, agentic, and local use

GPT-OSS-20B is OpenAI’s smaller, open-weight language model optimized for low-latency, agentic tasks, and local deployment. With 21B total parameters and 3.6B active parameters (MoE), it fits within 16GB of memory thanks to native MXFP4 quantization. Designed for high-performance reasoning, it supports Harmony response format, function calling, web browsing, and code execution. Like its larger sibling (gpt-oss-120b), it offers adjustable reasoning depth and full chain-of-thought visibility...

Downloads: 0 This Week

Last Update: 2025-08-05

See Project

Search Results for "reasoning models" - Page 8

Showing 188 open source projects for "reasoning models"

Chameleon LLM

PRM800K

CodeContests

Grade School Math

ELI5

Video Nonlocal Net

Mistral Small 4

Nemotron 3 Nano

Nemotron 3 Super

Leanstral

DeepSeek-V3.2-Speciale

DeepSeek-V3.2

gpt-oss-20b

Search Results for "reasoning models" - Page 8

Showing 188 open source projects for "reasoning models"

Chameleon LLM

PRM800K

CodeContests

Grade School Math

ELI5

Video Nonlocal Net

Mistral Small 4

Nemotron 3 Nano

Nemotron 3 Super

Leanstral

DeepSeek-V3.2-Speciale

DeepSeek-V3.2

gpt-oss-20b

Related Categories