Showing 21 open source projects for "weight scale software"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 1
    MiniMax-M1

    MiniMax-M1

    Open-weight, large-scale hybrid-attention reasoning model

    MiniMax-M1 is presented as the world’s first open-weight, large-scale hybrid-attention reasoning model, designed to push the frontier of long-context, tool-using, and deeply “thinking” language models. It is built on the MiniMax-Text-01 foundation and keeps the same massive parameter budget, but reworks the attention and training setup for better reasoning and test-time compute scaling.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    CogVideo

    CogVideo

    Text and image to video generation: CogVideoX and CogVideo

    CogVideo is an open-source family of advanced video generation models that can create videos from text, images, or existing video inputs. Built on large-scale Transformer and diffusion architectures, it enables multimodal generation across text-to-video, image-to-video, and video continuation tasks. The latest CogVideoX models offer higher resolution outputs, longer video durations, and improved controllability through prompt engineering. The project includes tools for inference,...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 3
    CO3D (Common Objects in 3D)

    CO3D (Common Objects in 3D)

    Tooling for the Common Objects In 3D dataset

    CO3Dv2 (Common Objects in 3D, version 2) is a large-scale 3D computer vision dataset and toolkit from Facebook Research designed for training and evaluating category-level 3D reconstruction methods using real-world data. It builds upon the original CO3Dv1 dataset, expanding both scale and quality—featuring 2× more sequences and 4× more frames, with improved image fidelity, more accurate segmentation masks, and enhanced annotations for object-centric 3D reconstruction. CO3Dv2 enables research...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    GLM-5.1

    GLM-5.1

    GLM-5: From Vibe Coding to Agentic Engineering

    GLM-5.1 is a next-generation large language model developed by Z.ai for advanced coding, reasoning, and long-horizon agentic engineering tasks. Built as the successor to GLM-5, the model significantly improves performance in software engineering benchmarks, repository generation, and real-world terminal-based workflows. GLM-5.1 is designed to remain effective over extended problem-solving sessions, allowing it to iteratively refine strategies, analyze failures, and sustain productivity across hundreds of reasoning cycles and tool calls. The model leverages large-scale pretraining, reinforcement learning infrastructure, and sparse attention mechanisms to improve efficiency while maintaining strong long-context understanding. ...
    Downloads: 157 This Week
    Last Update:
    See Project
  • Cut Data Warehouse Costs by 54% Icon
    Cut Data Warehouse Costs by 54%

    Easily migrate from Snowflake, Redshift, or Databricks with free tools.

    BigQuery delivers 54% lower TCO with exabyte scale and flexible pricing. Free migration tools handle the SQL translation automatically.
    Try Free
  • 5
    Qwen3.6

    Qwen3.6

    Qwen3.6 is the large language model series developed by Qwen team

    ...One of its defining goals is to enhance “agentic coding,” enabling the model to reason across entire codebases, handle multi-step development tasks, and assist with complex software engineering workflows. The architecture incorporates modern techniques such as mixture-of-experts and hybrid attention mechanisms, allowing it to scale efficiently while maintaining strong performance.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    Qwen2.5-Coder

    Qwen2.5-Coder

    Qwen2.5-Coder is the code version of Qwen2.5, the large language model

    Qwen2.5-Coder, developed by QwenLM, is an advanced open-source code generation model designed for developers seeking powerful and diverse coding capabilities. It includes multiple model sizes—ranging from 0.5B to 32B parameters—providing solutions for a wide array of coding needs. The model supports over 92 programming languages and offers exceptional performance in generating code, debugging, and mathematical problem-solving. Qwen2.5-Coder, with its long context length of 128K tokens, is...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 7
    Leanstral

    Leanstral

    Open-source code agent designed for Lean 4

    Leanstral is an open-weight large language model developed by Mistral AI and specifically designed as a code agent for the Lean 4 proof assistant, enabling advanced interaction with formal mathematics and program verification systems. The model is built to understand and generate Lean 4 code, which is used to express complex mathematical constructs as well as formal software specifications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Mistral Small 4

    Mistral Small 4

    Model that fuses instruct, reasoning and agentic skills

    The Mistral Small 4 collection is a set of open-weight large language models developed by Mistral AI that aim to unify multiple capabilities, including instruction following, reasoning, and coding, within a single efficient architecture. These models are part of the broader Mistral Small family, which is designed to deliver strong performance across a wide range of everyday AI tasks while maintaining relatively low latency and efficient deployment requirements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Ornith-1.0

    Ornith-1.0

    Open reasoning model for agentic coding and tool workflows

    Ornith-1.0 is a large open-source reasoning model from DeepReinforce, built for agentic coding, tool use, and complex software engineering workflows. It is part of the Ornith 1.0 family, which includes dense and MoE models post-trained on Gemma 4 and Qwen 3.5. The model focuses on coding-agent performance across benchmarks such as Terminal-Bench, SWE-Bench, NL2Repo, OpenClaw, and ClawEval. Its training uses a self-improving reinforcement learning framework that optimizes not only solution...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    LongCat-2.0

    LongCat-2.0

    Trillion-parameter MoE model for coding and million-token reasoning

    LongCat-2.0 is Meituan’s flagship open-weight Mixture-of-Experts language model designed for frontier-scale coding, reasoning, and autonomous agent workflows. It features 1.6 trillion total parameters with approximately 48 billion activated per token, combining high capability with efficient sparse inference. The model was pretrained on more than 35 trillion tokens and trained entirely on a large-scale cluster of domestically developed AI accelerators, demonstrating stable frontier-scale training without rollback events. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Mistral Large 3 675B Base 2512

    Mistral Large 3 675B Base 2512

    Frontier-scale 675B multimodal base model for custom AI training

    Mistral Large 3 675B Base 2512 is the foundational, pre-trained version of the Mistral Large 3 family, built as a frontier-scale multimodal Mixture-of-Experts model with 41B active parameters and a total size of 675B. It is trained from scratch using 3000 H200 GPUs, making it one of the most advanced and compute-intensive open-weight models available. As the base version, it is not fine-tuned for instruction following or reasoning, making it ideal for teams planning their own domain-specific finetuning or custom training pipelines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DeepSeek-V4-Pro

    DeepSeek-V4-Pro

    Flagship MoE model for advanced reasoning, coding, and agents

    DeepSeek-V4-Pro is a flagship open-weight Mixture-of-Experts language model designed for high-performance reasoning, coding, and agent-based workflows at scale. It features approximately 1.6 trillion total parameters with around 49B activated during inference, enabling strong efficiency while maintaining frontier-level capability. The model supports an ultra-long context window of up to 1 million tokens, making it highly suitable for long-document reasoning, large codebases, and complex multi-step tasks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MiniMax-M2.7

    MiniMax-M2.7

    Self-evolving AI model for agents, coding, and complex workflows

    MiniMax-M2.7 is a large-scale open-weight language model designed for advanced agent-based workflows, professional software engineering, and complex productivity tasks. With 229B parameters, it introduces a self-evolution framework in which the model actively improves its own capabilities by updating memory, generating skills, and iterating through reinforcement learning experiments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Laguna XS.2

    Laguna XS.2

    Open agentic coding model optimized for local deployment

    Laguna XS.2 is Poolside’s first open-weight Mixture-of-Experts model designed specifically for agentic coding and long-horizon software engineering tasks. The model contains 33B total parameters with only 3B activated per token, allowing it to deliver strong coding performance while remaining efficient enough to run locally on modern consumer hardware. It uses a hybrid attention architecture that combines Sliding Window Attention and global attention layers, reducing memory requirements and improving inference speed. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Laguna M.1

    Laguna M.1

    Flagship Poolside model for agentic coding and software engineering

    ...Laguna M.1 was designed to compete with leading frontier coding models on benchmarks such as SWE-Bench, Terminal-Bench, and other agentic engineering evaluations. It supports reasoning, tool calling, and long-context workflows, making it suitable for autonomous coding agents, software maintenance, debugging, and large-scale development projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Devstral 2

    Devstral 2

    Agentic 123B coding model optimized for large-scale engineering

    Devstral 2 is a large-scale agentic language model purpose-built for software engineering tasks, excelling at codebase exploration, multi-file editing, and tool-driven automation. With 123B parameters and FP8 instruct tuning, it delivers strong instruction following for chat-based workflows, coding assistants, and autonomous developer agents. The model demonstrates outstanding performance on SWE-bench, validating its effectiveness in real-world engineering scenarios.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Hy3 preview

    Hy3 preview

    Efficient MoE model for reasoning, coding, and AI agent workflows

    Hy3 preview is Tencent Hunyuan’s latest open-weight Mixture-of-Experts language model, designed for advanced reasoning, coding, instruction following, and autonomous agent workflows. It is the first model built on Tencent’s rebuilt training infrastructure and introduces significant improvements in context learning, software engineering, and tool-based task execution. The model features 295B total parameters with only 21B activated during inference, plus a dedicated 3.8B Multi-Token Prediction (MTP) layer that accelerates generation through speculative decoding. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Qwable-v1

    Qwable-v1

    Agentic coding model combining Opus reasoning and Fable tools

    ...When configured as an agent, it can emit structured tool-use XML for file editing, shell commands, codebase navigation, and workflow automation. Qwable-v1 is designed specifically for software engineering, code editing, debugging, and autonomous coding workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Qwen3.6-27B

    Qwen3.6-27B

    Dense multimodal Qwen model for coding, agents, and long context

    Qwen3.6-27B is an open-weight multimodal model built to deliver strong real-world coding, agent, and long-context performance in a dense 27B-parameter architecture. It combines a causal language model with a vision encoder and supports text, image, and video inputs, making it suitable for both software workflows and broader multimodal tasks. The model emphasizes stability and practical developer utility, with major improvements in agentic coding, frontend generation, and repository-level reasoning. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Kimi K2.6

    Kimi K2.6

    Multimodal agent model for coding, orchestration, and autonomy

    Kimi K2.6 is an open-source native multimodal agentic model built for advanced autonomous execution, long-horizon coding, and large-scale task orchestration. It is designed to handle complex end-to-end software workflows across multiple languages and domains, including front-end development, DevOps, performance optimization, and coding-driven design. Beyond coding, it can transform prompts and visual inputs into production-ready interfaces and lightweight full-stack outputs with structured layouts, interactivity, and polished visual detail. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MiMo-V2.5-Pro

    MiMo-V2.5-Pro

    Flagship MoE model for long-context agents and complex coding

    ...It also integrates multi-token prediction modules that accelerate inference and improve reinforcement learning efficiency. Trained on around 27 trillion tokens with FP8 mixed precision and refined through supervised fine-tuning, large-scale agentic reinforcement learning, and distillation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo