weight scale software free download

MiniMax-M1

Open-weight, large-scale hybrid-attention reasoning model

MiniMax-M1 is presented as the world’s first open-weight, large-scale hybrid-attention reasoning model, designed to push the frontier of long-context, tool-using, and deeply “thinking” language models. It is built on the MiniMax-Text-01 foundation and keeps the same massive parameter budget, but reworks the attention and training setup for better reasoning and test-time compute scaling.

Downloads: 0 This Week

Last Update: 2025-12-01

See Project

RTP-LLM

Alibaba's high-performance LLM inference engine for diverse apps

RTP-LLM is an open-source large language model inference acceleration engine developed by Alibaba to provide high-performance serving infrastructure for modern LLM deployments. The system focuses on improving throughput, latency, and resource utilization when running large models in production environments. It achieves this by implementing optimized GPU kernels, batching strategies, and memory management techniques tailored for transformer inference workloads. The framework is designed for...

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

UCCL

UCCL is an efficient communication library for GPUs

UCCL is a high-performance GPU communication library designed to support distributed machine learning workloads and large-scale AI systems. The library focuses on enabling efficient data transfer and collective communication between GPUs during training and inference processes. It supports a variety of communication patterns including collective operations such as all-reduce as well as peer-to-peer transfers that are commonly used in modern machine learning architectures.

Downloads: 0 This Week

Last Update: 2026-05-10

See Project

GLM-5.1

GLM-5: From Vibe Coding to Agentic Engineering

GLM-5.1 is a next-generation large language model developed by Z.ai for advanced coding, reasoning, and long-horizon agentic engineering tasks. Built as the successor to GLM-5, the model significantly improves performance in software engineering benchmarks, repository generation, and real-world terminal-based workflows. GLM-5.1 is designed to remain effective over extended problem-solving sessions, allowing it to iteratively refine strategies, analyze failures, and sustain productivity across hundreds of reasoning cycles and tool calls. The model leverages large-scale pretraining, reinforcement learning infrastructure, and sparse attention mechanisms to improve efficiency while maintaining strong long-context understanding. ...

Downloads: 157 This Week

Last Update: 2026-06-19

See Project

MarkPDFDown

A high-quality PDF to Markdown tool based on large language model

MarkPDFdown is an open-source document processing tool designed to convert PDF files into structured Markdown output that can be easily used for documentation, content pipelines, and AI processing workflows. The project focuses on extracting text, formatting, and structural information from complex PDF documents and transforming that information into clean Markdown that preserves the original hierarchy of headings, paragraphs, tables, and lists. By producing Markdown rather than raw text,...

Downloads: 2 This Week

Last Update: 2026-03-06

See Project

NVIDIA Generative AI Examples

Generative AI reference workflows

...Many of the examples show how to deploy AI services using containerized environments, GPU acceleration, and microservices that can scale across modern infrastructure. Developers can explore sample chatbot applications, document question-answering systems, and knowledge-base pipelines that illustrate how generative AI can interact with external data sources.

Downloads: 0 This Week

Last Update: 2026-03-05

See Project

Qwen2.5-Coder

Qwen2.5-Coder is the code version of Qwen2.5, the large language model

Qwen2.5-Coder, developed by QwenLM, is an advanced open-source code generation model designed for developers seeking powerful and diverse coding capabilities. It includes multiple model sizes—ranging from 0.5B to 32B parameters—providing solutions for a wide array of coding needs. The model supports over 92 programming languages and offers exceptional performance in generating code, debugging, and mathematical problem-solving. Qwen2.5-Coder, with its long context length of 128K tokens, is...

1 Review

Downloads: 18 This Week

Last Update: 2025-03-04

See Project

Alpa

Training and serving large-scale neural networks

Alpa is a system for training and serving large-scale neural networks. Scaling neural networks to hundreds of billions of parameters has enabled dramatic breakthroughs such as GPT-3, but training and serving these large-scale neural networks require complicated distributed system techniques. Alpa aims to automate large-scale distributed training and serving with just a few lines of code.

Downloads: 1 This Week

Last Update: 2023-03-23

See Project

DeepSeek-V4-Pro

Flagship MoE model for advanced reasoning, coding, and agents

DeepSeek-V4-Pro is a flagship open-weight Mixture-of-Experts language model designed for high-performance reasoning, coding, and agent-based workflows at scale. It features approximately 1.6 trillion total parameters with around 49B activated during inference, enabling strong efficiency while maintaining frontier-level capability. The model supports an ultra-long context window of up to 1 million tokens, making it highly suitable for long-document reasoning, large codebases, and complex multi-step tasks. ...

Downloads: 0 This Week

Last Update: 2026-04-24

See Project

Laguna M.1

Flagship Poolside model for agentic coding and software engineering

...Laguna M.1 was designed to compete with leading frontier coding models on benchmarks such as SWE-Bench, Terminal-Bench, and other agentic engineering evaluations. It supports reasoning, tool calling, and long-context workflows, making it suitable for autonomous coding agents, software maintenance, debugging, and large-scale development projects.

Downloads: 0 This Week

Last Update: 2026-06-19

See Project

Qwable-v1

Agentic coding model combining Opus reasoning and Fable tools

...When configured as an agent, it can emit structured tool-use XML for file editing, shell commands, codebase navigation, and workflow automation. Qwable-v1 is designed specifically for software engineering, code editing, debugging, and autonomous coding workflows.

Downloads: 0 This Week

Last Update: 2026-06-23

See Project

MiMo-V2.5-Pro

Flagship MoE model for long-context agents and complex coding

...It also integrates multi-token prediction modules that accelerate inference and improve reinforcement learning efficiency. Trained on around 27 trillion tokens with FP8 mixed precision and refined through supervised fine-tuning, large-scale agentic reinforcement learning, and distillation.

Downloads: 0 This Week

Last Update: 2026-05-04

See Project

Search Results for "weight scale software"

Showing 12 open source projects for "weight scale software"

MiniMax-M1

RTP-LLM

UCCL

GLM-5.1

MarkPDFDown

NVIDIA Generative AI Examples

Qwen2.5-Coder

Alpa

DeepSeek-V4-Pro

Laguna M.1

Qwable-v1

MiMo-V2.5-Pro

Search Results for "weight scale software"

Showing 12 open source projects for "weight scale software"

MiniMax-M1

RTP-LLM

UCCL

GLM-5.1

MarkPDFDown

NVIDIA Generative AI Examples

Qwen2.5-Coder

Alpa

DeepSeek-V4-Pro

Laguna M.1

Qwable-v1

MiMo-V2.5-Pro

Related Categories