llm free download - SourceForge

Showing 27 open source projects for "llm"

View related business solutions

Frameworks Clear Filters & Widen Search

$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

BricksLLM

Enterprise-grade API gateway that helps you monitor and impose cost

BricksLLM is an open-source framework for building and managing production-ready LLM (Large Language Model) applications. It provides tooling for prompt engineering, memory management, observability, and chaining, all in one unified developer experience. BricksLLM is designed to reduce boilerplate and increase the maintainability of LLM-based workflows.

Downloads: 3 This Week

Last Update: 2025-06-11
See Project
2

LlamaIndexTS

Data framework for your LLM applications

LlamaIndexTS is a data framework designed for Large Language Model (LLM) applications, focusing on server-side solutions to manage and process data efficiently.

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
3

NNCF

Neural Network Compression Framework for enhanced OpenVINO

NNCF (Neural Network Compression Framework) is an optimization toolkit for deep learning models, designed to apply quantization, pruning, and other techniques to improve inference efficiency.

Downloads: 2 This Week

Last Update: 2026-04-08
See Project
4

TradingAgents

Chinese Financial Trading Framework Based on Multi-Agent LLM

TradingAgents-CN is a Chinese-enhanced, multi-agent LLM framework aimed at building financial analysis and trading-oriented workflows, with an emphasis on collaboration between specialized agents rather than a single monolithic prompt. It organizes market-related tasks into roles and stages so different agents can contribute research, reasoning, aggregation, and decision support in a structured pipeline.

Downloads: 8 This Week

Last Update: 3 days ago
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
5

SentenceTransformers

Multilingual sentence & image embeddings with BERT

SentenceTransformers is a Python framework for state-of-the-art sentence, text and image embeddings. The initial work is described in our paper Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. You can use this framework to compute sentence / text embeddings for more than 100 languages. These embeddings can then be compared e.g. with cosine-similarity to find sentences with a similar meaning. This can be useful for semantic textual similar, semantic search, or paraphrase...

Downloads: 9 This Week

Last Update: 3 days ago
See Project
6

Superduper

Superduper: Integrate AI models and machine learning workflows

Superduper is a Python-based framework for building end-2-end AI-data workflows and applications on your own data, integrating with major databases. It supports the latest technologies and techniques, including LLMs, vector-search, RAG, and multimodality as well as classical AI and ML paradigms. Developers may leverage Superduper by building compositional and declarative objects that out-source the details of deployment, orchestration versioning, and more to the Superduper engine. This...

Downloads: 3 This Week

Last Update: 2025-08-26
See Project
7

MNN

MNN is a blazing fast, lightweight deep learning framework

MNN is a highly efficient and lightweight deep learning framework. It supports inference and training of deep learning models, and has industry leading performance for inference and training on-device. At present, MNN has been integrated in more than 20 apps of Alibaba Inc, such as Taobao, Tmall, Youku, Dingtalk, Xianyu and etc., covering more than 70 usage scenarios such as live broadcast, short video capture, search recommendation, product searching by image, interactive marketing, equity...

Downloads: 14 This Week

Last Update: 2026-04-07
See Project
8

Ray

A unified framework for scalable computing

Modern workloads like deep learning and hyperparameter tuning are compute-intensive and require distributed or parallel execution. Ray makes it effortless to parallelize single machine code — go from a single CPU to multi-core, multi-GPU or multi-node with minimal code changes. Accelerate your PyTorch and Tensorflow workload with a more resource-efficient and flexible distributed execution framework powered by Ray. Accelerate your hyperparameter search workloads with Ray Tune. Find the best...

Downloads: 4 This Week

Last Update: 7 days ago
See Project
9

Unsloth-MLX

Bringing the Unsloth experience to Mac users via Apple's MLX framework

Unsloth-MLX offers developers the power of Unsloth’s efficient large language model fine-tuning experience on Apple Silicon Macs by wrapping Apple’s native MLX framework with an API fully compatible with Unsloth workflows. This project removes traditional barriers that prevent Mac users from prototyping and experimenting with LLM training locally by allowing the same code used in cloud GPU environments to run on M-series hardware, improving workflow continuity and reducing iteration costs. It supports loading and training Hugging Face models with fine-tuning strategies like SFT, DPO, ORPO, and GRPO and even handles exporting models to formats like GGUF for downstream use, although some limitations apply with quantized models. ...

Downloads: 1 This Week

Last Update: 1 day ago
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

MegEngine

Easy-to-use deep learning framework with 3 key features

MegEngine is a fast, scalable and easy-to-use deep learning framework with 3 key features. You can represent quantization/dynamic shape/image pre-processing and even derivation in one model. After training, just put everything into your model and inference it on any platform at ease. Speed and precision problems won't bother you anymore due to the same core inside. In training, GPU memory usage could go down to one-third at the cost of only one additional line, which enables the DTR...

Downloads: 3 This Week

Last Update: 2024-04-30
See Project
11

Markdown Site

An open-source publishing framework built for AI agents and developers

...By leveraging Convex for backend real-time data management and Netlify for static hosting, markdown-site enables rich publishing features like SEO optimization, full-text search, analytics dashboards, and AI-indexed content tailored to LLM workflows.

Downloads: 0 This Week

Last Update: 2026-03-21
See Project
12

BentoML

Unified Model Serving Framework

BentoML simplifies ML model deployment and serves your models at a production scale. Support multiple ML frameworks natively: Tensorflow, PyTorch, XGBoost, Scikit-Learn and many more! Define custom serving pipeline with pre-processing, post-processing and ensemble models. Standard .bento format for packaging code, models and dependencies for easy versioning and deployment. Integrate with any training pipeline or ML experimentation platform. Parallelize compute-intense model inference...

Downloads: 0 This Week

Last Update: 2026-04-02
See Project
13

Seldon Core

An MLOps framework to package, deploy, monitor and manage models

The de facto standard open-source platform for rapidly deploying machine learning models on Kubernetes. Seldon Core, our open-source framework, makes it easier and faster to deploy your machine learning models and experiments at scale on Kubernetes. Seldon Core serves models built in any open-source or commercial model building framework. You can make use of powerful Kubernetes features like custom resource definitions to manage model graphs. And then connect your continuous integration and...

Downloads: 1 This Week

Last Update: 2026-01-23
See Project
14

TensorZero

TensorZero is an open-source stack for industrial-grade LLM apps

tensorzero is a lightweight C++ library designed for tensor operations and numerical computing. It offers a minimal and readable implementation of core tensor functionality, making it ideal for educational purposes, lightweight applications, or those wanting to understand how tensor libraries work under the hood. With no external dependencies, tensorzero is easy to integrate into C++ projects needing basic multi-dimensional array support.

Downloads: 4 This Week

Last Update: 2026-04-02
See Project
15

Fabric

Fabric is an open-source framework for augmenting humans using AI

...Patterns can be customized with variables, chained into pipelines, and applied to entire directories, which helps scale editorial or analytical tasks. A growing catalog of community patterns serves as a knowledge base for effective prompt engineering in practical contexts. In short, Fabric makes LLM work predictable and scriptable, so teams can share methods rather than one-off prompts.

Downloads: 12 This Week

Last Update: 19 hours ago
See Project
16

FastMCP Framework

A TypeScript framework for building MCP servers

...Because it’s built in TypeScript and designed with modern deployment targets (Node.js, Cloudflare Workers, etc), it aims to support production-ready MCP servers with strong type safety and extensibility. Developers can use it to build server back-ends that expose data or capabilities to LLM applications in a standardized way, reducing the custom boilerplate required for integrating with AI systems. It includes tooling for development (examples, CLI) and is open-source, inviting contributions and extensions.

Downloads: 9 This Week

Last Update: 5 days ago
See Project
17

Backtrack Sampler

An easy-to-understand framework for LLM samplers

Backtrack Sampler is a framework designed for experimenting with custom sampling strategies for language models (LLMs), enabling the ability to rewind and revise generated tokens. It allows developers to create and test their own token generation strategies by providing a base structure for manipulating logits and probabilities, making it a flexible tool for those interested in fine-tuning the behavior of LLMs.

Downloads: 0 This Week

Last Update: 2026-01-07
See Project
18

OpenAI Agents JS

A lightweight, powerful framework for multi-agent workflows

openai-agents-js is the JavaScript/TypeScript version of the OpenAI Agents SDK, designed to give developers a lightweight but powerful framework for building agentic workflows, voice agents, and tool-augmented LLM systems in JS/TS environments. The SDK is provider-agnostic, meaning while it’s deeply compatible with OpenAI’s APIs, you can also adapt it to use other backends or extension layers. At its core, it introduces primitives like Agents, Tools, Guardrails, and Handoffs, letting you define structured multi-agent systems in a modular way. ...

Downloads: 8 This Week

Last Update: 2026-04-06
See Project
19

OWL

Optimized Workforce Learning for General Multi-Agent Assistance

OWL (Optimized Workforce Learning) is a sophisticated open-source framework built on the CAMEL-AI ecosystem for orchestrating teams of AI agents to collaboratively solve complex, real-world tasks with dynamic planning and automation capabilities. Unlike single-agent systems, it treats task completion as a collaborative workforce where agents take on specialized roles (planning, execution, analysis) and coordinate via a modular multi-agent architecture that supports flexible teamwork across...

Downloads: 0 This Week

Last Update: 2026-04-10
See Project
20

PydanticAI

Agent Framework / shim to use Pydantic with LLMs

When I first found FastAPI, I got it immediately. I was excited to find something so innovative and ergonomic built on Pydantic. Virtually every Agent Framework and LLM library in Python uses Pydantic, but when we began to use LLMs in Pydantic Logfire, I couldn't find anything that gave me the same feeling. PydanticAI is a Python Agent Framework designed to make it less painful to build production-grade applications with Generative AI. Built by the team behind Pydantic (the validation layer of the OpenAI SDK, the Anthropic SDK, LangChain, LlamaIndex, AutoGPT, Transformers, CrewAI, Instructor, and many more).

Downloads: 0 This Week

Last Update: 22 hours ago
See Project
21

towhee

Framework that is dedicated to making neural data processing

Towhee is an open-source machine-learning pipeline that helps you encode your unstructured data into embeddings. You can use our Python API to build a prototype of your pipeline and use Towhee to automatically optimize it for production-ready environments. From images to text to 3D molecular structures, Towhee supports data transformation for nearly 20 different unstructured data modalities. We provide end-to-end pipeline optimizations, covering everything from data decoding/encoding, to...

Downloads: 1 This Week

Last Update: 2023-12-05
See Project
22

MMDeploy

OpenMMLab Model Deployment Framework

MMDeploy is an open-source deep learning model deployment toolset. It is a part of the OpenMMLab project. Models can be exported and run in several backends, and more will be compatible. All kinds of modules in the SDK can be extended, such as Transform for image processing, Net for Neural Network inference, Module for postprocessing and so on. Install and build your target backend. ONNX Runtime is a cross-platform inference and training accelerator compatible with many popular ML/DNN...

Downloads: 0 This Week

Last Update: 2023-12-25
See Project
23

pipeless

A computer vision framework to create and deploy apps in minutes

Pipeless is an open-source computer vision framework to create and deploy applications without the complexity of building and maintaining multimedia pipelines. It ships everything you need to create and deploy efficient computer vision applications that work in real-time in just minutes. Pipeless is inspired by modern serverless technologies. It provides the development experience of serverless frameworks applied to computer vision. You provide some functions that are executed for new...

Downloads: 16 This Week

Last Update: 2024-02-23
See Project
24

LangChain Apps on Production with Jina

Langchain Apps on Production with Jina & FastAPI

...And if you prefer, you can also deploy your LangChain apps on your own infrastructure to ensure data privacy. With long chain-serve, you can craft REST/WebSocket APIs, spin up LLM-powered conversational Slack bots, or wrap your LangChain apps into FastAPI packages on the cloud or on-premises.

Downloads: 0 This Week

Last Update: 2023-08-25
See Project
25

KotlinDL

High-level Deep Learning Framework written in Kotlin

KotlinDL is a high-level Deep Learning API written in Kotlin and inspired by Keras. Under the hood, it uses TensorFlow Java API and ONNX Runtime API for Java. KotlinDL offers simple APIs for training deep learning models from scratch, importing existing Keras and ONNX models for inference, and leveraging transfer learning for tailoring existing pre-trained models to your tasks. This project aims to make Deep Learning easier for JVM and Android developers and simplify deploying deep learning...

Downloads: 3 This Week

Last Update: 2024-01-29
See Project