Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Large Language Models (LLM)
Search Results

Search Results for "optimization" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Linux 44
Windows 44
Mac 43
More...
BSD 32
ChromeOS 32

Category

Artificial Intelligence 45

License

OSI-Approved Open Source 41

Programming Language

Python 45

Showing 45 open source projects for "optimization"

View related business solutions

Large Language Models (LLM) Python Clear Filters & Widen Search

Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
1

LLMs-Zero-to-Hero

From nobody to big model (LLM) hero

LLMs-Zero-to-Hero is an open-source educational project designed to guide learners through the complete process of understanding and building large language models from the ground up. The repository presents a structured learning pathway that begins with fundamental concepts in machine learning and progresses toward advanced topics such as model pre-training, fine-tuning, and deployment. Rather than relying entirely on existing frameworks, the project encourages readers to implement...

Downloads: 0 This Week

Last Update: 2026-05-04
See Project
2

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

LMOps is a research initiative and open-source toolkit focused on the development and operational management of AI applications built with large language models and generative AI systems. The project explores the technologies and methodologies required to move foundation models from research environments into production-grade AI products. It includes experimental tools and frameworks that help developers optimize prompts, design workflows for generative models, and manage the lifecycle of...

Downloads: 0 This Week

Last Update: 2026-04-30
See Project
3

llm.c

LLM training in simple, raw C/CUDA

llm.c is a minimalist, systems-level implementation of a small transformer-based language model in C that prioritizes clarity and educational value. By stripping away heavy frameworks, it exposes the core math and memory flows of embeddings, attention, and feed-forward layers. The code illustrates how to wire forward passes, losses, and simple training or inference loops with direct control over arrays and buffers. Its compact design makes it easy to trace execution, profile hotspots, and...

Downloads: 0 This Week

Last Update: 2025-10-15
See Project
4

Mosec

A high-performance ML model serving framework, offers dynamic batching

Mosec is a high-performance and flexible model-serving framework for building ML model-enabled backend and microservices. It bridges the gap between any machine learning models you just trained and the efficient online service API.

Downloads: 0 This Week

Last Update: 2026-04-15
See Project
Error to trace to log to deploy. One click. No SSH.
Catch the cause before the pager goes off.

AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.

Free 30 days.
5

Ludwig AI

Low-code framework for building custom LLMs, neural networks

...Automatic batch size selection, distributed training (DDP, DeepSpeed), parameter efficient fine-tuning (PEFT), 4-bit quantization (QLoRA), and larger-than-memory datasets. Retain full control of your models down to the activation functions. Support for hyperparameter optimization, explainability, and rich metric visualizations. Experiment with different model architectures, tasks, features, and modalities with just a few parameter changes in the config. Think building blocks for deep learning.

Downloads: 0 This Week

Last Update: 2026-05-29
See Project
6

AgentEvolver

Towards Efficient Self-Evolving Agent System

AgentEvolver is an open-source research framework for building self-evolving AI agents powered by large language models. The system focuses on improving the efficiency and scalability of training autonomous agents by allowing them to generate tasks, explore environments, and refine strategies without heavy reliance on manually curated datasets. Its architecture combines reinforcement learning with LLM-driven reasoning mechanisms to guide exploration and learning. The framework introduces...

Downloads: 0 This Week

Last Update: 2026-03-28
See Project
7

OmAgent

Build multimodal language agents for fast prototype and production

...The framework provides abstractions and infrastructure for building AI agents that operate on text, images, video, and audio while maintaining a relatively simple interface for developers. Instead of forcing developers to implement complex orchestration logic manually, the system manages task scheduling, worker coordination, and node optimization behind the scenes. Its architecture uses a graph-based workflow engine where tasks are represented as nodes in a directed workflow, enabling modular composition of complex reasoning pipelines. The framework also includes support for various reasoning strategies commonly used in language agents, such as chain-of-thought prompting, self-consistency reasoning, and ReAct-style decision loops.

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
8

Torch Pruning

DepGraph: Towards Any Structural Pruning

Torch-Pruning is an open-source toolkit designed to optimize deep neural networks by performing structural pruning directly within PyTorch models. The library focuses on reducing the size and computational cost of neural networks by removing redundant parameters and channels while maintaining model performance. It introduces a graph-based algorithm called DepGraph that automatically identifies dependencies between layers, allowing parameters to be pruned safely across complex architectures....

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
9

Chitu

High-performance inference framework for large language models

Chitu is a high-performance inference engine designed to deploy and run large language models efficiently in production environments. The framework focuses on improving efficiency, flexibility, and scalability for organizations that need to run LLM inference workloads across different hardware platforms. It supports heterogeneous computing environments, including CPUs, GPUs, and various specialized AI accelerators, allowing models to run across a wide range of infrastructure configurations....

Downloads: 0 This Week

Last Update: 2026-06-04
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
10

The Alignment Handbook

Robust recipes to align language models with human and AI preferences

The Alignment Handbook is an open-source resource created to provide practical guidance for aligning large language models with human preferences and safety requirements. The project focuses on the post-training stage of model development, where models are refined after pre-training to behave more helpfully, safely, and reliably in real-world applications. It provides detailed training recipes that explain how to perform tasks such as supervised fine-tuning, preference modeling, and...

Downloads: 0 This Week

Last Update: 2026-03-08
See Project
11

Agents 2.0

An Open-source Framework for Data-centric Language Agents

Agents is an open-source framework designed to build and train autonomous language agents through a data-centric and learning-oriented architecture. The project introduces a concept known as agent symbolic learning, which treats an agent pipeline similarly to a neural network computational graph. In this framework, each node in the pipeline represents a step in the reasoning or action process, while prompts and tools act as adjustable parameters analogous to neural network weights. During...

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
12

Intel LLM Library for PyTorch

Accelerate local LLM inference and finetuning

Intel LLM Library for PyTorch is an open-source acceleration library developed to optimize large language model inference and fine-tuning on Intel hardware platforms. Built as an extension of the PyTorch ecosystem, the library enables developers to run modern transformer models efficiently on Intel CPUs, GPUs, and specialized AI accelerators. The framework provides hardware-aware optimizations and low-precision computation techniques that significantly improve the performance of large...

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
13

Ring

Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI

...Its design emphasizes reasoning, efficiency, and modular expert activation. In its “flash” variant (Ring-flash-2.0), it optimizes inference by activating only a subset of experts. It applies reinforcement learning/reasoning optimization techniques. Its architectures and training approaches are tuned to enable efficient and capable reasoning performance. Reasoning-optimized model with reinforcement learning enhancements. Efficient architecture and memory design for large-scale reasoning. If you are located in mainland China, we also provide the model on ModelScope.cn to speed up the download process.

Downloads: 0 This Week

Last Update: 2025-09-30
See Project
14

BISHENG

BISHENG is an open LLM devops platform for next generation apps

BISHENG is an open LLM application DevOps platform, focusing on enterprise scenarios. It has been used by a large number of industry-leading organizations and Fortune 500 companies. "Bi Sheng" was the inventor of movable type printing, which played a vital role in promoting the transmission of human knowledge. We hope that BISHENG can also provide strong support for the widespread implementation of intelligent applications. Everyone is welcome to participate.

Downloads: 0 This Week

Last Update: 2026-05-11
See Project
15

PKU Beaver

Constrained Value Alignment via Safe Reinforcement Learning

...These annotations include categories such as harmful language, unethical behavior, privacy violations, and other sensitive topics. By incorporating constraint-based optimization methods, Safe-RLHF trains models that balance reward objectives with safety requirements, ensuring that harmful outputs are penalized during training.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
16

RAG-Retrieval

Unify Efficient Fine-tuning of RAG Retrieval, including Embedding

RAG-Retrieval is an open-source framework for building and training retrieval systems used in retrieval-augmented generation pipelines. Retrieval-augmented generation combines large language models with external knowledge retrieval to improve factual accuracy and domain-specific reasoning. This repository provides end-to-end infrastructure for training retrieval models, performing inference, and distilling embedding models for improved performance. It includes implementations of modern...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
17

Firefly LLM

A large model training tool that supports training large models

Firefly is an open-source framework designed to simplify the training and fine-tuning of large language models through a unified and configurable workflow. The project provides a comprehensive environment where developers can perform tasks such as model pre-training, instruction tuning, and preference optimization using widely adopted machine learning techniques. Its architecture supports both full-parameter training and parameter-efficient strategies like LoRA and QLoRA, making it suitable for environments with limited computational resources. Firefly is compatible with a wide range of popular open-source models including LLaMA, Qwen, Baichuan, InternLM, and Mistral, enabling developers to experiment with different architectures using a consistent training pipeline. ...

Downloads: 0 This Week

Last Update: 2026-03-08
See Project
18

Mixtral offloading

Run Mixtral-8x7B models in Colab or consumer desktops

Mixtral-Offloading is an open-source project designed to enable efficient inference of large Mixture-of-Experts language models such as Mixtral-8x7B on hardware with limited GPU memory. The project implements techniques that allow model components to be dynamically moved between CPU memory and GPU memory during inference, significantly reducing the amount of GPU VRAM required to run the model. This approach takes advantage of the sparse activation properties of mixture-of-experts...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
19

EvaDB

Database system for building simpler and faster AI-powered application

Over the last decade, AI models have radically changed the world of natural language processing and computer vision. They are accurate on various tasks ranging from question answering to object tracking in videos. To use an AI model, the user needs to program against multiple low-level libraries, like PyTorch, Hugging Face, Open AI, etc. This tedious process often leads to a complex AI app that glues together these libraries to accomplish the given task. This programming complexity prevents...

Downloads: 0 This Week

Last Update: 2023-11-19
See Project
20

Chinese Llama 2 7B

The first Chinese LLaMA2 model in the open source community

...In addition to the model weights, the repository also includes supervised fine-tuning datasets and training resources that help developers build chat-optimized versions of the model. The project follows the input format used by the LLaMA-2 chat architecture, ensuring compatibility with existing optimization techniques and tools built for the LLaMA-2 ecosystem. By releasing both the model and associated datasets, the project allows researchers and developers to experiment with Chinese language models in a fully open environment.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project

Previous
1
You're on page 2
Next

Related Searches

ai

train ai

python ai

Related Categories

Artificial Intelligence

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise