model-builder free download

Langflow

Low-code app builder for RAG and multi-agent AI applications

Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.

Downloads: 21 This Week

Last Update: 2026-06-09

See Project

Code World Model (CWM)

Research code artifacts for Code World Model (CWM)

CWM (Code World Model) is a 32-billion-parameter open-weights language model. It is developed by Meta for enhancing code generation and reasoning about programs. It is explicitly trained on execution traces, action-observation trajectories, and agentic interactions in controlled environments. It has been developed to better capture how code, actions, and state interact over time.

Downloads: 0 This Week

Last Update: 2025-09-26

See Project

llama.cpp

Port of Facebook's LLaMA model in C/C++

The llama.cpp project enables the inference of Meta's LLaMA model (and other models) in pure C/C++ without requiring a Python runtime. It is designed for efficient and fast model execution, offering easy integration for applications needing LLM-based capabilities. The repository focuses on providing a highly optimized and portable implementation for running large language models directly within C/C++ environments.

1 Review

Downloads: 342 This Week

Last Update: 11 hours ago

See Project

Floneum

Instant, controllable, local pre-trained AI models in Rust

Floneum is an open-source platform for building AI-powered workflows using large language models through a visual and extensible interface. The system allows users to design complex AI pipelines using a drag-and-drop workflow builder rather than writing extensive code. It focuses on enabling developers and researchers to create language model applications that combine different tools, data sources, and AI capabilities into automated workflows. Floneum supports a plugin architecture that allows external components to extend the platform while maintaining isolation and security. ...

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

DeepSeek R1

Open-source, high-performance AI model with advanced reasoning

DeepSeek-R1 is an open-source large language model developed by DeepSeek, designed to excel in complex reasoning tasks across domains such as mathematics, coding, and language. DeepSeek R1 offers unrestricted access for both commercial and academic use. The model employs a Mixture of Experts (MoE) architecture, comprising 671 billion total parameters with 37 billion active parameters per token, and supports a context length of up to 128,000 tokens.

1 Review

Downloads: 110 This Week

Last Update: 2025-07-09

See Project

Kimi K2

Kimi K2 is the large language model series developed by Moonshot AI

...The model family includes variants like a foundational base model that researchers can fine-tune for specific use cases and an instruct-optimized variant primed for general-purpose chat and agent-style interactions, offering flexibility for both experimentation and deployment. With its high-dimensional attention mechanisms and expert routing, Kimi-K2 excels across benchmarks in live coding, math reasoning, and problem solving.

Downloads: 23 This Week

Last Update: 2026-01-27

See Project

Qwen3-Coder

Qwen3-Coder is the code version of Qwen3

Qwen3-Coder is the latest and most powerful agentic code model developed by the Qwen team at Alibaba Cloud. Its flagship version, Qwen3-Coder-480B-A35B-Instruct, features a massive 480 billion-parameter Mixture-of-Experts architecture with 35 billion active parameters, delivering top-tier performance on coding and agentic tasks. This model sets new state-of-the-art benchmarks among open models for agentic coding, browser-use, and tool-use, matching performance comparable to leading models like Claude Sonnet. ...

1 Review

Downloads: 13 This Week

Last Update: 2026-03-24

See Project

llama.cpp

LLM inference in C/C++

llama.cpp is a high-performance C and C++ project for running large language models locally and in the cloud with minimal setup. It is built around efficient inference, broad hardware support, and the GGUF model format. The project supports many model families and has become a major foundation for local AI tools, model serving, and embedded inference workflows. It provides command-line tools, a server mode with an OpenAI-compatible API style, model conversion utilities, and extensive backend acceleration options. llama.cpp runs on CPUs and GPUs, with support for Apple silicon, x86, RISC-V, CUDA, HIP, Vulkan, SYCL, Metal, and hybrid CPU-GPU execution. ...

Downloads: 6 This Week

Last Update: 4 hours ago

See Project

GLM-5

From Vibe Coding to Agentic Engineering

GLM-5 is a next-generation open-source large language model (LLM) developed by the Z .ai team under the zai-org organization that pushes the boundaries of reasoning, coding, and long-horizon agentic intelligence. Building on earlier GLM series models, GLM-5 dramatically scales the parameter count (to roughly 744 billion) and expands pre-training data to significantly improve performance on complex tasks such as multi-step reasoning, software engineering workflows, and agent orchestration compared to its predecessors like GLM-4.5. ...

Downloads: 81 This Week

Last Update: 2026-05-15

See Project

ChatWiki

ChatWiki WeChat official account's AI knowledge base workflow agent

ChatWiki is an open-source AI knowledge base and workflow automation platform designed to help organizations build intelligent question-answering systems using large language models and retrieval-augmented generation techniques. The system enables companies to transform internal documents and data into searchable knowledge bases that can power AI assistants capable of answering domain-specific questions. It provides a complete pipeline for ingesting documents, preprocessing and segmenting...

Downloads: 5 This Week

Last Update: 4 days ago

See Project

DeepSeek-V3

Powerful AI language model (MoE) optimized for efficiency/performance

DeepSeek-V3 is a robust Mixture-of-Experts (MoE) language model developed by DeepSeek, featuring a total of 671 billion parameters, with 37 billion activated per token. It employs Multi-head Latent Attention (MLA) and the DeepSeekMoE architecture to enhance computational efficiency. The model introduces an auxiliary-loss-free load balancing strategy and a multi-token prediction training objective to boost performance.

1 Review

Downloads: 56 This Week

Last Update: 2025-07-09

See Project

Heretic

Fully automatic censorship removal for language models

...Beyond simple decensoring, Heretic includes research-oriented options for analyzing model internals and interpretability data.

Downloads: 11 This Week

Last Update: 2 days ago

See Project

MiniMax-01

Large-language-model & vision-language-model based on Linear Attention

...MiniMax-VL-01 extends this core by adding a 303M-parameter Vision Transformer and a two-layer MLP projector in a ViT–MLP–LLM framework, allowing the model to process images at dynamic resolutions up to 2016×2016.

Downloads: 4 This Week

Last Update: 2025-12-01

See Project

CodeGeeX

CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)

CodeGeeX is a large-scale multilingual code generation model with 13 billion parameters, trained on 850B tokens across more than 20 programming languages. Developed with MindSpore and later made PyTorch-compatible, it is capable of multilingual code generation, cross-lingual code translation, code completion, summarization, and explanation. It has been benchmarked on HumanEval-X, a multilingual program synthesis benchmark introduced alongside the model, and achieves state-of-the-art performance compared to other open models like InCoder and CodeGen. ...

Downloads: 22 This Week

Last Update: 2 days ago

See Project

Mosec

A high-performance ML model serving framework, offers dynamic batching

Mosec is a high-performance and flexible model-serving framework for building ML model-enabled backend and microservices. It bridges the gap between any machine learning models you just trained and the efficient online service API.

Downloads: 5 This Week

Last Update: 2026-04-15

See Project

LLaMA Models

Utilities intended for use with Llama models

This repository serves as the central hub for the Llama foundation model family, consolidating model cards, licenses and use policies, and utilities that support inference and fine-tuning across releases. It ties together other stack components (like safety tooling and developer SDKs) and provides canonical references for model variants and their intended usage. The project’s issues and releases reflect an actively used coordination point for the ecosystem, where guidance, utilities, and compatibility notes are published. ...

Downloads: 6 This Week

Last Update: 2025-10-08

See Project

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training

MedicalGPT training medical GPT model with ChatGPT training pipeline, implementation of Pretraining, Supervised Finetuning, Reward Modeling and Reinforcement Learning. MedicalGPT trains large medical models, including secondary pre-training, supervised fine-tuning, reward modeling, and reinforcement learning training.

Downloads: 0 This Week

Last Update: 2026-04-20

See Project

Langfuse

Open source LLM engineering platform: LLM Observability, metrics, etc.

Langfuse is a logging and analytics tool for large language model (LLM) applications, providing insights into usage, performance, and troubleshooting.

Downloads: 3 This Week

Last Update: 12 hours ago

See Project

Parallax

Parallax is a distributed model serving framework

...The platform also supports model sharding and pipeline parallelism, allowing very large models to run across distributed resources.

Downloads: 3 This Week

Last Update: 2026-03-09

See Project

H2O LLM Studio

Framework and no-code GUI for fine-tuning LLMs

...You can also use H2O LLM Studio with the command line interface (CLI) and specify the configuration file that contains all the experiment parameters. To finetune using H2O LLM Studio with CLI, activate the pipenv environment by running make shell. With H2O LLM Studio, training your large language model is easy and intuitive. First, upload your dataset and then start training your model. Start by creating an experiment. You can then monitor and manage your experiment, compare experiments, or push the model to Hugging Face to share it with the community.

Downloads: 3 This Week

Last Update: 2026-05-30

See Project

EmoLLM

Pre & Post-training & Dataset & Evaluation & Depoly & RAG

...The project is designed to help users through mental health conversations and has been fine-tuned from existing instruction-following LLMs rather than built as a base model from scratch. Its repository includes multiple model variants and training configurations spanning several underlying model families, including InternLM, Qwen, DeepSeek, Mixtral, LLaMA, and others, which shows that the initiative is structured as a broad ecosystem rather than a single release. The project also covers more than just model weights, with material for datasets, fine-tuning, evaluation, deployment, demos, RAG, and related subprojects such as its psychological digital assistant work.

Downloads: 1 This Week

Last Update: 2026-03-06

See Project

LLaMA 3

The official Meta Llama 3 GitHub site

This repository is the former home for Llama 3 model artifacts and getting-started code, covering pre-trained and instruction-tuned variants across multiple parameter sizes. It introduced the public packaging of weights, licenses, and quickstart examples that helped developers fine-tune or run the models locally and on common serving stacks. As the Llama stack evolved, Meta consolidated repositories and marked this one deprecated, pointing users to newer, centralized hubs for models, utilities, and docs. ...

Downloads: 23 This Week

Last Update: 2025-10-08

See Project

Ollama

Run models like Kimi-K2.5, GLM-5, DeepSeek, gpt-oss, Gemma, Qwen etc.

Ollama is an open-source platform that enables developers to run large language models locally on their own machines. It simplifies working with modern AI models by providing a unified interface to download, manage, and interact with them. Users can run models like Llama, Gemma, Qwen, and others directly from the command line or through APIs. Ollama also integrates with popular developer tools and AI agents, allowing seamless workflows across coding environments and applications. It supports...

Downloads: 1,053 This Week

Last Update: 5 days ago

See Project

uqlm

Uncertainty Quantification for Language Models, is a Python package

...UQLM also supports ensemble strategies and model-as-judge approaches for evaluating responses. By combining multiple uncertainty metrics, the system provides more reliable indicators of when language model outputs may be unreliable.

Downloads: 3 This Week

Last Update: 2026-06-08

See Project

wllama

WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

...The framework provides both high-level APIs for common tasks such as text generation and embeddings, as well as low-level APIs that expose tokenization, sampling controls, and model state management.

Downloads: 2 This Week

Last Update: 24 hours ago

See Project

Search Results for "model-builder"

Showing 414 open source projects for "model-builder"

Langflow

Code World Model (CWM)

llama.cpp

Floneum

DeepSeek R1

Kimi K2

Qwen3-Coder

llama.cpp

GLM-5

ChatWiki

DeepSeek-V3

Heretic

MiniMax-01

CodeGeeX

Mosec

LLaMA Models

MedicalGPT

Langfuse

Parallax

H2O LLM Studio

EmoLLM

LLaMA 3

Ollama

uqlm

wllama

Search Results for "model-builder"

Showing 414 open source projects for "model-builder"

Langflow

Code World Model (CWM)

llama.cpp

Floneum

DeepSeek R1

Kimi K2

Qwen3-Coder

llama.cpp

GLM-5

ChatWiki

DeepSeek-V3

Heretic

MiniMax-01

CodeGeeX

Mosec

LLaMA Models

MedicalGPT

Langfuse

Parallax

H2O LLM Studio

EmoLLM

LLaMA 3

Ollama

uqlm

wllama

Related Searches

Related Categories