Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Large Language Models (LLM)
Search Results

Search Results for "deploy"

x

Sort By:

Relevance

Clear All Filters

OS

BSD 26
ChromeOS 26
Linux 26
More...
Mac 26
Windows 26

Category

Artificial Intelligence 26

License

OSI-Approved Open Source 24

Programming Language

Python 11
TypeScript 8
JavaScript 3
Go 1
More...
Java 1
Rust 1

26 projects for "deploy" with 2 filters applied:

Large Language Models (LLM) BSD Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.

Try for Free
1

LlamaDeploy

Deploy your agentic worfklows to production

...Developers can define workflows that involve multiple steps such as data retrieval, reasoning, tool invocation, and response generation, then deploy them using the framework’s infrastructure tools. The design emphasizes scalability, modularity, and fault-tolerant execution so that agent systems can run reliably in production environments.

Downloads: 0 This Week

Last Update: 2026-04-06
See Project
2

Beelzebub

A secure low code honeypot framework

...Honeypots are systems intentionally exposed to attackers in order to capture malicious behavior, and Beelzebub enhances this concept by incorporating artificial intelligence and virtualization techniques. The platform allows organizations and researchers to deploy decoy services that mimic real infrastructure while recording attacker interactions. By using AI models to simulate realistic system behavior, the honeypot becomes harder for attackers to identify, increasing the likelihood that malicious activity can be observed and analyzed. The framework is designed with a low-code configuration approach so security teams can easily deploy honeypots for multiple services and ports.

Downloads: 0 This Week

Last Update: 6 days ago
See Project
3

LangServe

Helps developers deploy LangChain runnables and chains as a REST API

...The framework is built on top of FastAPI and uses Pydantic for request validation and structured data handling. It also includes client libraries that allow developers to interact with deployed chains from Python or JavaScript applications. LangServe is commonly used to deploy AI applications such as chatbots, document analysis pipelines, and agent-based systems that require scalable access through APIs.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
4

Chitu

High-performance inference framework for large language models

Chitu is a high-performance inference engine designed to deploy and run large language models efficiently in production environments. The framework focuses on improving efficiency, flexibility, and scalability for organizations that need to run LLM inference workloads across different hardware platforms. It supports heterogeneous computing environments, including CPUs, GPUs, and various specialized AI accelerators, allowing models to run across a wide range of infrastructure configurations. ...

Downloads: 2 This Week

Last Update: 2026-05-21
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
5

browserable

Open source and self-hostable browser automation library for AI agents

...Built primarily in JavaScript, the framework offers both a developer-friendly SDK and a REST API that allow integration with AI applications and automation pipelines. It is designed to be self-hostable, which means developers can deploy and run it on their own infrastructure without relying on third-party services. The platform enables the creation of browser-based agents capable of performing complex online workflows such as data collection, research tasks, and automated interactions with web platforms.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
6

AI Engineering Transition Path

Research papers and blogs to transition to AI Engineering

...The project organizes resources that cover fundamental topics required to understand modern AI systems, including transformers, vector embeddings, tokenization, infrastructure design, and mixture-of-experts architectures. Instead of presenting isolated tutorials, the repository provides a structured pathway that guides engineers through the technical knowledge needed to build and deploy large language model systems. The materials include curated research papers, blog posts, and code examples that explain both theoretical foundations and practical implementation strategies. By consolidating these resources into a single repository, the project helps developers navigate the rapidly expanding AI ecosystem without needing to search through scattered materials.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
7

LangBot

Production-grade platform for building agentic IM bots

LangBot is an open source platform designed to build and deploy AI-powered chatbots across multiple instant messaging ecosystems. The system allows developers to integrate large language models into messaging platforms so that bots can perform tasks, answer questions, and automate workflows directly within everyday communication tools. It supports numerous messaging services including Discord, Slack, Telegram, WeChat, and other enterprise communication systems, making it a flexible solution for both personal projects and organizational deployments. ...

Downloads: 0 This Week

Last Update: 2026-05-12
See Project
8

LLMChat

Unified interface for AI chat, Agentic workflows and more

LLMChat is an open-source AI chat platform designed to provide a unified interface for interacting with multiple large language model providers while emphasizing privacy and advanced research capabilities. The system is built as a modern monorepo using technologies such as Next.js and TypeScript, enabling developers to deploy a full-featured web-based chatbot environment. One of its primary goals is to support sophisticated research workflows that combine conversational AI with information retrieval and reasoning tools. The platform includes specialized interaction modes such as deep research analysis and enhanced search capabilities that help users explore complex topics more effectively. ...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
9

LLM-Pruner

On the Structural Pruning of Large Language Models

LLM-Pruner is an open-source framework designed to compress large language models through structured pruning techniques while maintaining their general capabilities. Large language models often require enormous computational resources, making them expensive to deploy and inefficient for many practical applications. LLM-Pruner addresses this issue by identifying and removing non-essential components within transformer architectures, such as redundant attention heads or feed-forward structures. The framework relies on gradient-based analysis to determine which parameters contribute least to model performance, enabling targeted structural pruning rather than simple weight removal. ...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
10

TaxHacker

Self-hosted AI accounting app. LLM analyzer for receipts

...It integrates large language models to analyze these documents, extract relevant financial information, and categorize expenses or income based on configurable rules. Users can deploy the application on their own infrastructure, ensuring that financial data remains private and under their control rather than being processed by external services. The software provides tools for tracking income streams, monitoring expenses, and organizing financial records in a structured format. Because the system supports customizable prompts and categories, users can adapt the AI analysis to match their accounting workflows or tax requirements.

Downloads: 0 This Week

Last Update: 2026-04-03
See Project
11

SmythOS

Cloud-native runtime for agentic AI

...It provides a foundational infrastructure layer that functions similarly to an operating system for agentic AI systems, managing resources such as language models, storage, vector databases, and caching through a unified interface. Developers can use the runtime to create, deploy, and orchestrate intelligent agents across local machines, cloud environments, or hybrid infrastructures without rewriting their application logic. The platform includes a software development kit and command-line interface that allow developers to define agent workflows, manage execution environments, and automate deployment processes. ...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
12

Generative AI Use Cases (GenU)

Application implementation with business use cases

...These examples cover tasks such as document analysis, conversational assistants, content generation, and knowledge retrieval systems. The repository is intended to serve as both a learning resource and a starting point for developers who want to deploy generative AI solutions using AWS infrastructure.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
13

Agent Development Kit (ADK) for Java

An open-source, code-first Java toolkit

Google’s Agent Development Kit for Java is an open-source toolkit that helps developers design, evaluate, and deploy advanced AI agents using the Java programming language. The framework follows a code-first approach that treats agent development as a structured software engineering task rather than a collection of prompt scripts. It provides abstractions and tools that allow developers to create agents capable of executing complex workflows, calling tools, and interacting with external services. ...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
14

Paddler

Open-source LLM load balancer and serving platform for hosting LLMs

Paddler is an open-source LLM infrastructure platform designed to deploy, manage, and scale large language models on private infrastructure. The system acts as a specialized load balancer and serving layer for language models, enabling organizations to run inference workloads without relying on external API providers. It supports running models locally through engines such as llama.cpp while distributing requests across multiple compute nodes to improve performance and reliability. ...

Downloads: 0 This Week

Last Update: 2026-04-30
See Project
15

MaxText

A simple, performant and scalable Jax LLM

...The framework focuses on simplicity while still supporting advanced techniques such as model sharding, distributed computation, and high-throughput training pipelines. MaxText includes ready-to-use configurations and reproducible training examples that help developers understand how to deploy large-scale AI workloads with modern machine learning infrastructure.

Downloads: 0 This Week

Last Update: 2026-05-08
See Project
16

Lagent

A lightweight framework for building LLM-based agents

Lagent is a lightweight open-source framework designed to help developers build autonomous agents powered by large language models. The framework provides tools and abstractions that allow language models to interact with external tools, execute tasks, and perform multi-step reasoning processes. Instead of using LLMs only for text generation, Lagent enables developers to transform models into agents capable of performing actions such as retrieving data, executing code, or interacting with...

Downloads: 0 This Week

Last Update: 2026-05-13
See Project
17

Agent Chat UI

Web app for interacting with any LangGraph agent (PY & TS) via a chat

...Once connected, the interface enables real-time conversations where messages are sent to the agent and responses are streamed back to the chat interface. The project is designed to serve as a flexible frontend for agent-based AI systems, allowing developers to test and deploy conversational interfaces quickly. It also integrates with tools such as LangSmith for monitoring and debugging agent interactions during development.

Downloads: 0 This Week

Last Update: 2026-05-14
See Project
18

II Agent

A new open-source framework to build and deploy intelligent agents

II-Agent is an open-source intelligent assistant framework designed to automate complex workflows across multiple domains using large language models and external tools. The platform allows users to interact with multiple AI models within a single environment while connecting those models to external services and knowledge sources. Through a unified interface, users can switch between models, access specialized tools, and execute tasks that require information retrieval, code execution, or...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
19

FastDeploy

High-performance Inference and Deployment Toolkit for LLMs and VLMs

...Developed within the PaddlePaddle ecosystem, the toolkit focuses on providing high-performance deployment capabilities for modern AI models including large language models and vision-language systems. The platform enables developers to deploy trained models quickly using optimized inference pipelines that support GPUs, specialized AI accelerators, and other hardware architectures. FastDeploy includes advanced acceleration technologies such as speculative decoding, multi-token prediction, and efficient KV cache management to improve throughput and latency during inference. ...

Downloads: 0 This Week

Last Update: 2026-04-08
See Project
20

AWS GenAI LLM Chatbot

A modular and comprehensive solution to deploy a Multi-LLM

AWS GenAI LLM Chatbot is an enterprise-ready reference solution for deploying a secure, feature-rich generative AI chatbot on AWS with retrieval-augmented generation capabilities. The project is built as a modular blueprint that helps organizations stand up a production-oriented chat experience rather than a simple demo, combining model access, knowledge retrieval, storage, security, and user interface components into one deployable system. It supports multiple model providers and endpoints,...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
21

Farfalle

AI search engine - self-host with local or cloud LLMs

...The project integrates large language models with multiple search APIs so that the system can gather information from external sources and synthesize responses into concise answers. It can run either with local language models or with cloud-based providers, allowing developers to deploy it privately or integrate with hosted AI services. The architecture separates the frontend and backend, using modern web technologies such as Next.js and FastAPI to deliver an interactive interface and scalable server logic. Farfalle also includes an agent-based search workflow that plans queries and executes multiple search steps to produce more accurate results than traditional keyword searches. ...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
22

NVIDIA Generative AI Examples

Generative AI reference workflows

...The repository includes examples covering topics such as retrieval-augmented generation pipelines, agent-based workflows, and multimodal AI applications that combine text, vision, and data processing. Many of the examples show how to deploy AI services using containerized environments, GPU acceleration, and microservices that can scale across modern infrastructure. Developers can explore sample chatbot applications, document question-answering systems, and knowledge-base pipelines that illustrate how generative AI can interact with external data sources.

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
23

Chinese-LLaMA-Alpaca 2

Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project

This project is developed based on the commercially available large model Llama-2 released by Meta. It is the second phase of the Chinese LLaMA&Alpaca large model project. The Chinese LLaMA-2 base model and the Alpaca-2 instruction fine-tuning large model are open-sourced. These models expand and optimize the Chinese vocabulary on the basis of the original Llama-2, use large-scale Chinese data for incremental pre-training, and further improve the basic semantics and command understanding of...

Downloads: 0 This Week

Last Update: 2024-01-23
See Project
24

YAYI

Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM

...In addition to producing coherent responses, the system is designed to handle tasks such as summarization, translation, question answering, and text classification. The repository provides model checkpoints, training resources, and inference tools that allow developers to deploy the model in their own applications. By releasing both the model and supporting infrastructure, the project encourages experimentation and research in multilingual AI systems.

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
25

gpu_poor

Calculate token/s & GPU memory requirement for any LLM

...The tool also provides a detailed breakdown of where GPU memory is allocated, including model weights, KV cache, activations, and other runtime overhead. This information allows developers to evaluate trade-offs between different quantization methods such as GGML, bitsandbytes, and QLoRA before attempting to deploy a model. gpu_poor is particularly useful for researchers and hobbyists.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project

Previous
You're on page 1
2
Next

Related Searches

chinese

Related Categories

Artificial Intelligence

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise