Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "cpu memory usage" - Page 5

x

Sort By:

Relevance

Clear All Filters

OS

Windows 141
Linux 140
Mac 134
More...
BSD 56
ChromeOS 56
Mobile Operating Systems 13

Category

Artificial Intelligence 156
Software Development 15
System 7
Multimedia 3
Business 2
Database 2
Scientific/Engineering 2
Communications 1
Education 1
Formats and Protocols 1
Games 1
Mobile 1
Productivity 1
Social sciences 1

License

OSI-Approved Open Source 134
Other License 2
Creative Commons Attribution License 1
GNU Free Documentation License 1

Translations

English 8
Chinese (Simplified) 2
Japanese 2
Spanish 2
More...
Arabic 1
Dutch 1
French 1
German 1
Italian 1
Korean 1
Portuguese 1
Russian 1
Turkish 1

Programming Language

Python 88
C++ 24
JavaScript 14
TypeScript 8
More...
Rust 7
C 5
Unix Shell 5
C# 4
Go 4
Java 3
Dart 1
Kotlin 1
Lua 1
PL/SQL 1
Swift 1

Status

Production/Stable 6
Beta 2

Showing 156 open source projects for "cpu memory usage"

View related business solutions

Artificial Intelligence Clear Filters & Widen Search

Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
1

AIMET

AIMET is a library that provides advanced quantization and compression

...Quantized inference is significantly faster than floating point inference. For example, models that we’ve run on the Qualcomm® Hexagon™ DSP rather than on the Qualcomm® Kryo™ CPU have resulted in a 5x to 15x speedup. Plus, an 8-bit model also has a 4x smaller memory footprint relative to a 32-bit model. However, often when quantizing a machine learning model (e.g., from 32-bit floating point to an 8-bit fixed point value), the model accuracy is sacrificed.

Downloads: 1 This Week

Last Update: 2026-05-20
See Project
2

tsai

Time series Timeseries Deep Learning Machine Learning Pytorch fastai

...If you require any of the dependencies that is not installed, tsai will ask you to install it when necessary) We've also added a new PredictionDynamics callback that will display the predictions during training. This is the type of output you would get in a classification task. New tutorial notebook on how to train your model with larger-than-memory datasets in less time achieving up to 100% GPU usage! See our new tutorial notebook on how to track your experiments with Weights & Biases

Downloads: 0 This Week

Last Update: 2 days ago
See Project
3

marqo

Tensor search for humans

A tensor-based search and analytics engine that seamlessly integrates with your applications, websites, and workflows. Marqo is a versatile and robust search and analytics engine that can be integrated into any website or application. Due to horizontal scalability, Marqo provides lightning-fast query times, even with millions of documents. Marqo helps you configure deep-learning models like CLIP to pull semantic meaning from images. It can seamlessly handle image-to-image, image-to-text and...

Downloads: 0 This Week

Last Update: 2026-04-02
See Project
4

DocArray

The data structure for multimodal data

...Data science powerhouse: greatly accelerate data scientists’ work on embedding, k-NN matching, querying, visualizing, evaluating via Torch/TensorFlow/ONNX/PaddlePaddle on CPU/GPU. Data in transit: optimized for network communication, ready-to-wire at anytime with fast and compressed serialization in Protobuf, bytes, base64, JSON, CSV, DataFrame. Perfect for streaming and out-of-memory data. One-stop k-NN: Unified and consistent API for mainstream vector databases.

Downloads: 0 This Week

Last Update: 2025-03-21
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
5

Smile

Statistical machine intelligence and learning engine

...Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster? Write applications quickly in Java, Scala, or any JVM languages. Data scientists and developers can speak the same language now! Smile provides hundreds advanced algorithms with clean interface. Scala API also offers high-level operators that make it easy to build machine learning apps. ...

Downloads: 0 This Week

Last Update: 2026-05-04
See Project
6

Mini Agent

A minimal yet professional single agent demo project

Mini-Agent is a minimal yet production-minded demo project that shows how to build a serious command-line AI agent around the MiniMax-M2 model. It is designed both as a reference implementation and as a usable agent, demonstrating a full execution loop that includes planning, tool calls, and iterative refinement. The project exposes an Anthropic-compatible API interface and fully supports interleaved thinking, letting the agent alternate between reasoning steps and tool invocations during...

Downloads: 0 This Week

Last Update: 2026-02-14
See Project
7

Qwen2.5-Omni

Capable of understanding text, audio, vision, video

Qwen2.5-Omni is an end-to-end multimodal flagship model in the Qwen series by Alibaba Cloud, designed to process multiple modalities (text, images, audio, video) and generate responses both as text and natural speech in streaming real-time. It supports “Thinker-Talker” architecture, and introduces innovations for aligning modalities over time (for example synchronizing video/audio), robust speech generation, and low-VRAM/quantized versions to make usage more accessible. It holds...

Downloads: 0 This Week

Last Update: 2025-09-23
See Project
8

FastChat

Open platform for training, serving, and evaluating language models

FastChat is an open platform for training, serving, and evaluating large language model-based chatbots. If you do not have enough memory, you can enable 8-bit compression by adding --load-8bit to the commands above. This can reduce memory usage by around half with slightly degraded model quality. It is compatible with the CPU, GPU, and Metal backend. Vicuna-13B with 8-bit compression can run on a single NVIDIA 3090/4080/T4/V100(16GB) GPU. In addition to that, you can add --cpu-offloading to commands above to offload weights that don't fit on your GPU onto the CPU memory. ...

Downloads: 0 This Week

Last Update: 2024-02-11
See Project
9

Quarto Solver

Quarto Solver calculates optimal moves for Quarto and Quarto 2x2

You can calculate for every game state in Quarto and Quarto 2x2 an optimal move.

Downloads: 4 This Week

Last Update: 2024-06-01
See Project
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
10

Gemini-Lite

Lightweight desktop client for Google Gemini built with Rust & Tauri.

...Key Features: Ultra-Low Resource Usage: The core backend runs on a tiny footprint (11MB-40MB), saving your system's memory for what matters. Native Performance: Fast startup, smooth rendering, and deep OS integration. Quick Rescue Shortcut: Hit Ctrl + H at any time to instantly reset the window and jump back to the main chat interface.

Downloads: 1 This Week

Last Update: 2 hours ago
See Project
11

Warlock-Studio

AI Suite for upscaling, interpolating & restoring images/videos

v6.0. Warlock-Studio is a Windows application that uses Real-ESRGAN, BSRGAN, IRCNN, GFPGAN, RealESRNet, RealESRAnime and RIFE Artificial Intelligence models to upscale, restore faces, interpolate frames and reduce noise in images and videos. the application supports GPU acceleration (including multi-GPU setups) and offers batch processing for large workloads. It includes drag-and-drop handling for single or multiple files, optional pre-resize functions, and an automatic tiling system...

Downloads: 21 This Week

Last Update: 2026-02-16
See Project
12

KoboldCpp

Run GGUF models easily with a UI or API. One File. Zero Install.

KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models, inspired by the original KoboldAI. It's a single self-contained distributable that builds off llama.cpp and adds many additional powerful features.

Downloads: 283 This Week

Last Update: 20 minutes ago
See Project
13

HunyuanVideo-I2V

A Customizable Image-to-Video Model based on HunyuanVideo

HunyuanVideo-I2V is a customizable image-to-video generation framework developed by Tencent, extending the capabilities of HunyuanVideo. It allows for high-quality video creation from still images, using PyTorch and providing pre-trained model weights, inference code, and customizable training options. The system includes a LoRA training code for adding special effects and enhancing video realism, aiming to offer versatile and scalable solutions for generating videos from static image inputs.

1 Review

Downloads: 6 This Week

Last Update: 2025-03-10
See Project
14

Glint Translator

...Features • 3 Translation Modes: Fluent (parallel), Area (overlay), Full Screen (smart detection) • Speaker detection with color-coding • Glint AI custom terminology control • Game-based profile system • Advanced settings with 50+ parameters for fine-tuned control • Share and import custom profiles (.glint) between users • Low CPU/RAM usage, optimized for Windows 10/11 Live Subtitle (Real-Time Voice Translation) Real-time speech-to-text translation for games, movies, and voice chats. Automatically detects audio, converts speech to text, and translates it instantly. Example: They speak German → you see Turkish AI Model Support • Google Gemini: 2.5 Flash, 2.5 Pro • OpenAI: GPT-4o, GPT-4 Turbo

1 Review

Downloads: 45 This Week

Last Update: 2026-05-02
See Project
15

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis

AutoGPTQ is an implementation of GPTQ (Quantized GPT) that optimizes large language models (LLMs) for faster inference by reducing their computational footprint while maintaining accuracy.

Downloads: 0 This Week

Last Update: 2025-01-21
See Project
16

OpenChat for Linux

OpenChat for Linux — a fast, lightweight desktop client for ChatGPT

OpenChat for Linux is a desktop client for ChatGPT / OpenAI Chat designed specifically for Linux. It’s built with Tauri (Rust) for low resource usage and stability, and it uses a “message window” approach (keeps a small active slice of the conversation and loads more as you scroll) so long chats don’t bog down or crash the app. Downloads are available in common Linux formats (AppImage, Debian package, tarball), with additional packaging manifests for Flatpak, Snap, RPM, AUR, and Nix.

Downloads: 22 This Week

Last Update: 2026-03-18
See Project
17

ClawBridge

The OpenClaw Mobile Dashboard.

The OpenClaw Mobile Dashboard. Monitor agent's real-time thoughts, actions, track token costs, and manage tasks from anywhere using your pocket-sized Mission Control.

Downloads: 0 This Week

Last Update: 2026-02-26
See Project
18

Mixtral offloading

Run Mixtral-8x7B models in Colab or consumer desktops

Mixtral-Offloading is an open-source project designed to enable efficient inference of large Mixture-of-Experts language models such as Mixtral-8x7B on hardware with limited GPU memory. The project implements techniques that allow model components to be dynamically moved between CPU memory and GPU memory during inference, significantly reducing the amount of GPU VRAM required to run the model. This approach takes advantage of the sparse activation properties of mixture-of-experts architectures, where only a subset of expert networks are used for each token during generation. ...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
19

OnnxStream

Lightweight inference library for ONNX files, written in C++

...Generally, major machine learning frameworks and libraries are focused on minimizing inference latency and/or maximizing throughput, all of which at the cost of RAM usage. So I decided to write a super small and hackable inference library specifically focused on minimizing memory consumption: OnnxStream. OnnxStream is based on the idea of decoupling the inference engine from the component responsible for providing the model weights, which is a class derived from WeightsProvider. A WeightsProvider specialization can implement any type of loading, caching, and prefetching of the model parameters.

Downloads: 7 This Week

Last Update: 2024-08-14
See Project
20

SuperAGI

A dev-first open source autonomous AI agent framework

...Connect to multiple Vector DBs to enhance your agent’s performance. Each agent is unique, use different models of your choice. Get insights into your agent’s performance and optimize accordingly. Control token usage to manage costs effectively. Enable your agents to learn and adapt by storing their memory. Get notified when agents get stuck in the loop, and provide proactive resolution. Read and store files generated by Agents.

Downloads: 0 This Week

Last Update: 2024-01-12
See Project
21

Firefly LLM

A large model training tool that supports training large models

Firefly is an open-source framework designed to simplify the training and fine-tuning of large language models through a unified and configurable workflow. The project provides a comprehensive environment where developers can perform tasks such as model pre-training, instruction tuning, and preference optimization using widely adopted machine learning techniques. Its architecture supports both full-parameter training and parameter-efficient strategies like LoRA and QLoRA, making it suitable...

Downloads: 0 This Week

Last Update: 2026-03-08
See Project
22

Punica

Serving multiple LoRA finetuned LLM as one

Punica is a system designed to efficiently serve multiple LoRA-fine-tuned large language models within a shared GPU environment. LoRA is a parameter-efficient fine-tuning method that allows developers to adapt large pretrained models to specific tasks by adding lightweight adapter layers rather than retraining the entire model. Punica introduces a serving architecture that allows multiple LoRA adapters to share the same base model during inference, significantly reducing memory consumption...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
23

gpu_poor

Calculate token/s & GPU memory requirement for any LLM

gpu_poor is an open-source tool designed to help developers determine whether their hardware is capable of running a specific large language model and to estimate the performance they can expect from it. The project focuses on calculating GPU memory requirements and predicted inference speed for different models, hardware configurations, and quantization strategies. By analyzing factors such as model size, context length, batch size, and GPU specifications, the system estimates how much VRAM...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
24

Synaptrix ChatGPT Desktop

Fuel your productivity with ChatGPT-Desktop

Fuel your productivity with ChatGPT-Desktop, blazingly fast and supercharged.

Downloads: 0 This Week

Last Update: 2024-07-01
See Project
25

ChatLLM Web

Chat with LLM like Vicuna totally in your browser with WebGPU

...To use this app, you need a browser that supports WebGPU, such as Chrome 113 or Chrome Canary. Chrome versions ≤ 112 are not supported. You will need a GPU with about 6.4GB of memory. If your GPU has less memory, the app will still run, but the response time will be slower. The first time you use the app, you will need to download the model. For the Vicuna-7b model that we are currently using, the download size is about 4GB. After the initial download, the model will be loaded from the browser cache for faster usage.

Downloads: 0 This Week

Last Update: 2023-08-25
See Project

Previous
1
2
3
4
You're on page 5
6
7
Next

Related Searches

koboldcpp

speech

ai

time series analysis and forecasting

unstructured data

smile

chatbot code

quarto

video ai

gguf

Related Categories

Artificial Intelligence

Software Development

System

Multimedia

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise