Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "compression" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Linux 58
Windows 54
Mac 53
More...
BSD 25
ChromeOS 23
Mobile Operating Systems 2

Category

Artificial Intelligence 58
Software Development 7
Multimedia 4
Scientific/Engineering 3
Education 2
System 2
Blockchain 1
Business 1
Formats and Protocols 1
Internet 1
Mobile 1
Social sciences 1

License

OSI-Approved Open Source 55
Public Domain 1

Translations

English 4
Chinese (Simplified) 1

Programming Language

Python 39
C++ 7
Java 3
TypeScript 3
More...
C 2
JavaScript 1
Kotlin 1
Ruby 1
Rust 1
Swift 1
Unix Shell 1

Status

Alpha 4
Beta 2
Pre-Alpha 1
Production/Stable 1

Showing 58 open source projects for "compression"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
1

TurboQuant PyTorch

From-scratch PyTorch implementation of Google's TurboQuant

TurboQuant PyTorch is a specialized deep learning optimization framework designed to accelerate neural network inference and training through advanced quantization techniques within the PyTorch ecosystem. The project focuses on reducing the computational and memory footprint of models by converting floating-point representations into lower-precision formats while preserving performance. It provides tools for experimenting with different quantization strategies, enabling developers to balance...

Downloads: 1 This Week

Last Update: 2026-04-23
See Project
2

Advanced + Agentic RAG Cookbooks

Advanced RAG cookbooks for building accurate LLM applications

...Athina AI’s RAG Cookbooks covers the full RAG pipeline, including indexing, retrieval, augmentation, and generation, while also addressing evaluation to measure accuracy and relevance. It includes multiple approaches such as hybrid search, contextual compression, and agent-based retrieval strategies, allowing users to experiment and compare methods. It is designed to reduce development time by offering practical examples and references to research papers, making it useful for both learning and production use. Overall, it serves as a hands-on resource for improving LLM outputs using external data sources.

Downloads: 7 This Week

Last Update: 4 days ago
See Project
3

Edgee

AI gateway with token compression for Claude Code, Codex, and more

Edgee is an edge-native execution platform designed to run AI-driven logic and data processing directly at the network edge, reducing latency and improving responsiveness for modern applications. It enables developers to deploy functions and workflows closer to users, allowing real-time processing without relying heavily on centralized cloud infrastructure. The platform is built to support event-driven architectures, where actions are triggered by incoming requests, user behavior, or...

Downloads: 3 This Week

Last Update: 2026-06-08
See Project
4

Lossless Claw

LCM (Lossless Context Management) plugin for OpenClaw

...The system stores every interaction in a persistent database and incrementally summarizes older content into a hierarchical directed acyclic graph, allowing efficient compression without discarding information. This structure enables agents to dynamically reconstruct detailed context by expanding summaries when needed, effectively simulating perfect long-term memory.

Downloads: 2 This Week

Last Update: 6 hours ago
See Project
Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
5

Torch Pruning

DepGraph: Towards Any Structural Pruning

...Torch-Pruning physically removes parameters rather than masking them, which results in smaller and faster models during both training and inference. The toolkit supports a wide variety of architectures used in computer vision and large language models, making it a flexible solution for model compression tasks.

Downloads: 2 This Week

Last Update: 2026-03-05
See Project
6

NVIDIA Model Optimizer

A unified library of SOTA model optimization techniques

Model Optimizer is a unified library that provides state-of-the-art techniques for compressing and optimizing deep learning models to improve inference efficiency and deployment performance. It brings together multiple optimization strategies such as quantization, pruning, distillation, and speculative decoding into a single cohesive framework. The library is designed to reduce model size and computational requirements while maintaining accuracy, making it particularly valuable for deploying...

Downloads: 0 This Week

Last Update: 2026-05-13
See Project
7

LLM-Pruner

On the Structural Pruning of Large Language Models

LLM-Pruner is an open-source framework designed to compress large language models through structured pruning techniques while maintaining their general capabilities. Large language models often require enormous computational resources, making them expensive to deploy and inefficient for many practical applications. LLM-Pruner addresses this issue by identifying and removing non-essential components within transformer architectures, such as redundant attention heads or feed-forward...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
8

Advanced RAG Techniques

Advanced techniques for RAG systems

Advanced RAG Techniques is a comprehensive collection of tutorials and implementations focused on advanced Retrieval-Augmented Generation (RAG) systems. It is designed to help practitioners move beyond basic RAG setups and explore techniques that improve retrieval quality, context construction, and answer robustness. The repository organizes techniques into categories such as foundational RAG, query enhancement, context enrichment, and advanced retrieval, making it easier to navigate...

Downloads: 0 This Week

Last Update: 2026-04-15
See Project
9

DLRM

An implementation of a deep learning recommendation model (DLRM)

...The implementation is optimized for performance at scale, supporting multi-GPU and multi-node execution, quantization, embedding partitioning, and pipelined I/O to feed huge embeddings efficiently. It includes data loaders for standard benchmarks (like Criteo), training scripts, evaluation tools, and capabilities like mixed precision, gradient compression, and memory fusion to maximize throughput.

Downloads: 0 This Week

Last Update: 2026-01-12
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

OAGI Python SDK

Python SDK for the Computer Use model Lux, developed by OpenAGI

OAGI Python SDK is a Python client library for the Lux computer-use model that turns Lux into a programmable automation layer for operating human-facing software via vision and actions. It exposes the OAGI API in an ergonomic way, letting you trigger Lux in three main modes: Tasker for precise scripted sequences, Actor for fast one-shot tasks, and Thinker for open-ended, multi-step objectives. The SDK is designed around “computer use” as a paradigm, where the AI actually navigates...

Downloads: 1 This Week

Last Update: 2026-02-22
See Project
11

ERNIE

The official repository for ERNIE 4.5 and ERNIEKit

ERNIE is an open-source large-model toolkit and model family from the PaddlePaddle ecosystem that focuses on training, fine-tuning, compression, and practical application of ERNIE large language models. The repository positions ERNIEKit as an industrial-grade development toolkit, emphasizing end-to-end workflows that span high-performance pre-training, supervised fine-tuning, and alignment. It supports both full-parameter training and parameter-efficient approaches so teams can choose between maximum quality and lower-cost adaptation depending on their constraints. ...

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
12

InfiAgent

Build your own Cowork, AI Scientist and other SoTA Agents

...Designed as a “Multi-Level Agent” (MLA) system, it externalizes persistent state to the file system so that agents can operate over unlimited runtime without the need for token-intensive context compression, enabling workflows such as research paper drafting, experiments, coding, and document generation to run reliably. The framework uses a serial multi-agent hierarchy where specialized agents coordinate in tree-structured paths for clear task delegation and minimal tool conflicts, while batch file operations and persistent workspaces ensure reproducibility and traceability. ...

Downloads: 0 This Week

Last Update: 2026-03-30
See Project
13

muse

AI agent memory system—pure Markdown, zero dependencies, fully local

...Supports Claude Code, OpenClaw, Cursor, Windsurf, Gemini CLI, and Codex via one-command install. Built-in MCP Server for programmatic access. 56 skills, auto memory capture, semantic compression, role-based governance, multi-project management. Pure Markdown, no database, no cloud. MIT open source.

Downloads: 4 This Week

Last Update: 2026-03-16
See Project
14

realwatermark

A Python application to add watermarks (text or image) to PDF files

A Python application to add watermarks (text or image) to PDF files, converts them into image and back to PDF with options for OCR and compression.

Downloads: 0 This Week

Last Update: 2025-01-27
See Project
15

FastChat

Open platform for training, serving, and evaluating language models

...This requires 8-bit compression to be enabled and the bitsandbytes package to be installed, which is only available on linux operating systems.

Downloads: 0 This Week

Last Update: 2024-02-11
See Project
16

ADAMS

ADAMS is a workflow engine for building complex knowledge workflows.

ADAMS is a flexible workflow engine aimed at quickly building and maintaining data-driven, reactive workflows, easily integrated into business processes. Instead of placing operators on a canvas and manually connecting them, a tree structure and flow control operators determine how data is processed (sequentially/parallel). This allows rapid development and easy maintenance of large workflows, with hundreds or thousands of operators. Operators include machine learning (WEKA, MOA, MEKA)...

Downloads: 10 This Week

Last Update: 2024-03-21
See Project
17

CLIP-as-service

Embed images and sentences into fixed-length vectors

...No learning curve, minimalist design on client and server. Intuitive and consistent API for image and sentence embedding. Async client support. Easily switch between gRPC, HTTP, WebSocket protocols with TLS and compression. Smooth integration with neural search ecosystem including Jina and DocArray. Build cross-modal and multi-modal solutions in no time.

Downloads: 0 This Week

Last Update: 2023-12-20
See Project
18

Neural Network Intelligence

AutoML toolkit for automate machine learning lifecycle

Neural Network Intelligence is an open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning. NNI (Neural Network Intelligence) is a lightweight but powerful toolkit to help users automate feature engineering, neural architecture search, hyperparameter tuning and model compression. The tool manages automated machine learning (AutoML) experiments, dispatches and runs experiments' trial jobs generated by tuning algorithms to search the best neural architecture and/or hyper-parameters in different training environments like Local Machine, Remote Servers, OpenPAI, Kubeflow, FrameworkController on K8S (AKS etc.) ...

Downloads: 2 This Week

Last Update: 2023-09-13
See Project
19

FedLab

A flexible Federated Learning Framework based on PyTorch

A Python-based framework for federated learning simulation, emphasizing modularity, communication efficiency, and algorithmic flexibility. Supports both server- and client-side customization for research and development purposes.

Downloads: 1 This Week

Last Update: 2025-07-15
See Project
20

Minkowski Engine

Auto-diff neural network library for high-dimensional sparse tensors

...To run the examples, please install the package and run the command in the package root directory. Compressing a neural network to speed up inference and minimize memory footprint has been studied widely. One of the popular techniques for model compression is pruning the weights in convnets, is also known as sparse convolutional networks. Such parameter-space sparsity used for model compression compresses networks that operate on dense tensors and all intermediate activations of these networks are also dense tensors.

Downloads: 0 This Week

Last Update: 2022-08-11
See Project
21

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD

This repository provides pre-trained encoder-decoder models and its related optimization techniques developed by Alibaba's MinD (Machine IntelligeNce of Damo) Lab. Pre-trained models for natural language understanding (NLU). We extend BERT to a new model, StructBERT, by incorporating language structures into pre-training. Specifically, we pre-train StructBERT with two auxiliary tasks to make the most of the sequential order of words and sentences, which leverage language structures at the...

Downloads: 0 This Week

Last Update: 2022-08-17
See Project
22

TNN

Uniform deep learning inference framework for mobile

TNN, a high-performance, lightweight neural network inference framework open sourced by Tencent Youtu Lab. It also has many outstanding advantages such as cross-platform, high performance, model compression, and code tailoring. The TNN framework further strengthens the support and performance optimization of mobile devices on the basis of the original Rapidnet and ncnn frameworks. At the same time, it refers to the high performance and good scalability characteristics of the industry's mainstream open source frameworks, and expands the support for X86 and NV GPUs. ...

Downloads: 0 This Week

Last Update: 2022-08-03
See Project
23

TextBrewer

A PyTorch-based knowledge distillation toolkit

TextBrewer is a PyTorch-based model distillation toolkit for natural language processing. It includes various distillation techniques from both NLP and CV field and provides an easy-to-use distillation framework, which allows users to quickly experiment with the state-of-the-art distillation methods to compress the model with a relatively small sacrifice in the performance, increasing the inference speed and reducing the memory usage.

Downloads: 0 This Week

Last Update: 2025-01-22
See Project
24

wav2letter++

Facebook AI research's automatic speech recognition toolkit

...This repository includes recipes to reproduce the following research papers as well as pre-trained models. All results reproduction must use Flashlight <= 0.3.2 for exact reproducibility. At least one of LZMA, BZip2, or Z is required for LM compression with KenLM. It is highly recommended to build KenLM with position-independent code (-fPIC) enabled, to enable python compatibility. After installing, run export KENLM_ROOT_DIR=... so that wav2letter++ can find it. This is needed because KenLM doesn't support a make install step.wav2letter++ expects audio and transcription data to be prepared in a specific format so that they can be read from the pipelines. ...

Downloads: 0 This Week

Last Update: 2022-05-27
See Project
25

exchange-core

Ultra-fast matching engine written in Java based on LMAX Disruptor

...Cancel operation takes ~0.7µs, placing new order ~1.0µs. Disk journaling and journal replay support, state snapshots (serialization) and restore operations, LZ4 compression. Lock-free and contention-free order matching and risk control algorithms. Matching engine and risk control operations are atomic and deterministic.

Downloads: 0 This Week

Last Update: 2022-04-15
See Project

Previous
1
You're on page 2
3
Next

Related Searches

chatbot code

artificial neural network

arabic audio transcription

artificial intelligence stock market

chatbot

artificial intelligence projects chatbot

weka

rss feed creator

pdf

artificial intelligence projects

Related Categories

Artificial Intelligence

Software Development

Multimedia

Scientific/Engineering

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise