Showing 373 open source projects for "lightweight linux"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    MusicGPT

    MusicGPT

    Generate music based on natural language prompts using LLMs

    ...The software allows users to run advanced music generation systems directly on their own devices without requiring heavy dependencies such as Python or full machine learning frameworks. Instead, it provides a lightweight environment capable of executing music generation models locally on CPUs or GPUs while maintaining strong performance across operating systems including Windows, macOS, and Linux. Users can describe a musical style, mood, or instrumentation using text prompts, and the system produces original audio samples based on those instructions. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Gemma Chat

    Gemma Chat

    Local AI chat + coding agent for Apple Silicon, powered by Gemma 4

    Gemma Chat is a local-first AI chat and coding assistant designed to run fully on-device, particularly optimized for Apple Silicon machines. It leverages Google’s Gemma family of lightweight language models, which are built on the same underlying technology as Gemini and designed for efficient local inference and reasoning tasks. The project enables users to interact with AI through a chat interface while also supporting code generation and editing workflows. It emphasizes privacy and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Hollama

    Hollama

    A minimal LLM chat app that runs entirely in your browser

    Hollama is a lightweight open-source chat application designed to run entirely within the browser while interacting with large language model servers. The project provides a minimal but powerful user interface for communicating with local or remote LLMs, including servers powered by Ollama or OpenAI-compatible APIs. Because the application runs as a static web interface, it does not require complex backend infrastructure and can be easily deployed or self-hosted. Hollama supports both...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    KnowNote

    KnowNote

    A local-first AI knowledge base & NotebookLM alternative

    KnowNote is a local-first, open-source AI knowledge base and notebook application created as an Electron-based alternative to Google NotebookLM that emphasizes privacy, control, and simplicity. It lets users build an intelligent, searchable knowledge base from uploaded documents such as PDFs, Word files, PowerPoints, and web pages, and then interact with that content using LLM-powered chat, summarization, and reasoning tools. Unlike many NotebookLM alternatives that rely on Docker or cloud...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    Pocket TTS

    Pocket TTS

    A TTS that fits in your CPU (and pocket)

    Pocket TTS is a lightweight text-to-speech project designed to run efficiently on CPUs, targeting developers who want local speech generation without depending on GPUs or hosted web APIs. It is built to feel practical in everyday applications, where installation and usage should be as simple as adding a dependency and calling a function. The project focuses on keeping the runtime footprint manageable while still producing natural-sounding speech, which makes it attractive for offline tools,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Cactus Needle

    Cactus Needle

    26m function call model that runs on incredibly small devices

    Needle is an experimental 26-million-parameter function-calling model designed to run on extremely small devices such as phones, watches, glasses, and low-power personal AI hardware. It is based on a Simple Attention Network architecture and was distilled from a much larger model to focus on fast, compact tool-use behavior. The project provides open weights, training details, dataset generation resources, and a playground for testing the model with custom tools. Needle is optimized for...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Browser Harness

    Browser Harness

    Self-healing browser harness that enables LLMs to complete any task

    Browser Harness is a self-healing browser control system built to give language models direct and flexible access to a real Chrome browser through the Chrome DevTools Protocol. Its main philosophy is minimalism: instead of imposing a rigid framework, it exposes a very thin bridge so the agent can perform browser tasks with almost no abstraction in the way. A defining part of the project is that the agent can write or extend missing helper functions during a task, which is why the repository...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    OneFileLLM

    OneFileLLM

    Specify a github or local repo, github pull request

    OneFileLLM is an open-source project designed to simplify the distribution and execution of large language model applications by packaging them into a single portable file. The concept behind the project is to eliminate the complexity normally associated with deploying AI systems, which often require multiple dependencies, frameworks, and configuration steps. Instead, the entire runtime environment, model interface, and application logic are bundled together into a single executable...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    TreeQuest

    TreeQuest

    A Tree Search Library with Flexible API for LLM Inference-Time Scaling

    TreeQuest, developed by SakanaAI, is a versatile Python library implementing adaptive tree search algorithms—such as AB‑MCTS—for enhancing inference-time performance of large language models (LLMs). It allows developers to define custom state-generation and scoring functions (e.g., via LLMs), and then efficiently explores possible answer trees during runtime. With support for multi-LLM collaboration, checkpointing, and mixed policies, TreeQuest enables smarter, trial‑and‑error question...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 10
    OpenAI Agents (Python)

    OpenAI Agents (Python)

    A lightweight, powerful framework for multi-agent workflows

    openai-agents-python is a library developed by OpenAI to simplify the process of creating and running agents that interact with tools and APIs using OpenAI models. It provides abstractions for tool usage, memory management, and agent workflows, enabling developers to define function-calling agents that reason through multi-step tasks. Ideal for building custom AI workflows, the library supports dynamic tool definitions and contextual memory handling.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    GLM-4

    GLM-4

    GLM-4 series: Open Multilingual Multimodal Chat LMs

    GLM-4 is a family of open models from ZhipuAI that spans base, chat, and reasoning variants at both 32B and 9B scales, with long-context support and practical local-deployment options. The GLM-4-32B-0414 models are trained on ~15T high-quality data (including substantial synthetic reasoning data), then post-trained with preference alignment, rejection sampling, and reinforcement learning to improve instruction following, coding, function calling, and agent-style behaviors. The...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    GPUStack

    GPUStack

    Performance-optimized AI inference on your GPUs

    ...The system aggregates GPU resources from multiple machines into a unified cluster so developers and administrators can run large language models and other AI workloads efficiently across distributed infrastructure. Instead of requiring complex orchestration systems such as Kubernetes, GPUStack provides a lightweight environment that automatically selects appropriate inference engines, configures deployment parameters, and schedules workloads across available GPUs. The platform supports GPUs from a wide range of vendors and can run on laptops, workstations, and servers across operating systems such as macOS, Windows, and Linux. It also enables developers to deploy models from common repositories like Hugging Face and access them through APIs similar to cloud-based AI services.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    zclaw

    zclaw

    Your personal AI assistant at all-in 888KiB

    zclaw is a highly compact personal AI assistant framework designed to run on constrained embedded hardware such as the ESP32. The project focuses on delivering core assistant capabilities within an extremely small footprint, demonstrating how AI-driven automation can operate on microcontrollers. It includes support for GPIO control, scheduled tasks, memory handling, and other embedded automation features that enable real-world device interaction. The architecture is optimized for efficiency,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    smolagents

    smolagents

    Agents write python code to call tools and orchestrate other agents

    This library is the simplest framework out there to build powerful agents. We provide our definition in this page, where you’ll also find tips for when to use them or not (spoilers: you’ll often be better off without agents). smolagents is a lightweight framework for building AI agents using large language models (LLMs). It simplifies the development of AI-driven applications by providing tools to create, train, and deploy language model-based agents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Compute Library

    Compute Library

    The Compute Library is a set of computer vision and machine learning

    The Compute Library is a set of computer vision and machine learning functions optimized for both Arm CPUs and GPUs using SIMD technologies. The library provides superior performance to other open-source alternatives and immediate support for new Arm® technologies e.g. SVE2.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Eris

    Eris

    A NodeJS Discord library

    A Node.js wrapper for interfacing with Discord. You will need NodeJS 10.4+. If you need voice support you will also need Python 2.7 and a C++ compiler. Create a directory for your bot, and change to that directory in your command line. If you want to be more updated (at the expense of stability), you can install the beta builds instead. Eris supports a few optional libraries that could potentially improve bot performance but may require additional dependencies. If you want Eris to use any of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    GLM-4.6V

    GLM-4.6V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    GLM-4.6V represents the latest generation of the GLM-V family and marks a major step forward in multimodal AI by combining advanced vision-language understanding with native “tool-call” capabilities, long-context reasoning, and strong generalization across domains. Unlike many vision-language models that treat images and text separately or require intermediate conversions, GLM-4.6V allows inputs such as images, screenshots or document pages directly as part of its reasoning pipeline — and...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    shimmy

    shimmy

    Python-free Rust inference server

    The shimmy project is a lightweight local inference server designed to run large language models with minimal overhead. Written primarily in Rust, the tool provides a small standalone binary that exposes an API compatible with the OpenAI interface, allowing existing applications to interact with local models without significant code changes. This compatibility enables developers to replace remote AI services with locally hosted models while keeping their existing software architecture...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    OmniBox

    OmniBox

    Collect, organize, use, and share, all in OmniBox

    Omnibox (mirror) is a SourceForge mirror of the Omnibox open-source project, which provides a software interface designed to simplify interaction with multiple tools and services through a unified command or search interface. The project focuses on creating a centralized input field where users can enter commands, queries, or shortcuts that trigger actions across different applications or services. Inspired by the omnibox concept used in modern browsers, the system combines search...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    mistral.rs

    mistral.rs

    Fast, flexible LLM inference

    mistral.rs is a fast and flexible LLM inference engine implemented in Rust, designed to run and serve modern language models with an emphasis on performance and practical deployment. It provides multiple entry points for developers, including a CLI for running models locally and an HTTP server that exposes an OpenAI-compatible API surface for easy integration with existing clients. The project includes hardware-aware tooling that can benchmark a system and choose sensible quantization and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Agent Lightning

    Agent Lightning

    The absolute trainer to light up AI agents

    Agent Lightning is an open-source framework developed by Microsoft to train and optimize AI agents using techniques like reinforcement learning (RL), supervised fine-tuning, and automatic prompt optimization, with minimal or zero changes to existing agent code. It’s designed to be compatible with a wide range of agent architectures and frameworks — from LangChain and OpenAI Agent SDKs to AutoGen and custom Python agents — making it broadly applicable across different agent tooling...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    MCP Shrimp Task Manager

    MCP Shrimp Task Manager

    Shrimp Task Manager is a task tool built for AI Agents

    Shrimp Task Manager is an MCP server that converts natural-language requests into structured development tasks with dependencies, status, and style/format rules—built for agents that reason step-by-step. It emphasizes chain-of-thought and reflection loops, allowing an assistant to plan, refine, and re-prioritize work like a human project assistant. The server exposes typed tools so clients can create tasks, link prerequisites, record progress, and enforce writing or coding standards for...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    MobileLLM

    MobileLLM

    MobileLLM Optimizing Sub-billion Parameter Language Models

    MobileLLM is a lightweight large language model (LLM) framework developed by Facebook Research, optimized for on-device deployment where computational and memory efficiency are critical. Introduced in the ICML 2024 paper “MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases”, it focuses on delivering strong reasoning and generalization capabilities in models under one billion parameters. The framework integrates several architectural innovations—SwiGLU...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    runprompt

    runprompt

    Run LLM prompts from your shell

    ...Designed to be cross-platform, RunPrompt works with standard shells on Windows, macOS, and Linux while honoring the user’s preferred environment and configurations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    LLM From Scratch

    LLM From Scratch

    Build and train a GPT-style language model

    LLM From Scratch is a hands-on educational workshop project that teaches developers how to build and train a GPT-style language model entirely from scratch using PyTorch. Instead of relying on high-level abstractions or prebuilt frameworks, the project walks users through implementing every core component manually, including tokenization, transformer architecture, training loops, and autoregressive text generation. The repository is intentionally simplified to focus on conceptual clarity,...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB