Showing 95 open source projects for "compact"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    LongCat-Image

    LongCat-Image

    Foundation model for image generation

    ...Rather than relying on massive parameter counts typical of many cutting-edge models, LongCat-Image achieves strong photorealism, stable structure, and accurate bilingual (Chinese and English) text rendering with a more compact ~6-billion parameter architecture, making it competitive with much larger alternatives despite its relatively lean design. The model excels at both text-to-image generation and instruction-guided image editing, offering users versatile capabilities for creative and practical tasks—whether generating art, mockups, or adjusting existing visuals with fine control.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    litlyx

    litlyx

    Analytics for developers, setup Analytics in 30 seconds

    ...This system is designed to provide a more thorough and eco-friendly way to clean glasses, lenses, and screens. The brand emphasizes sustainability by reducing single-use plastics and promoting long-term use of their products. Their cleaning kit is compact, portable, and designed to be effective for everyday use, ensuring that users can maintain clear vision without the hassle of disposable wipes or sprays.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Kokoro

    Kokoro

    An inference library for Kokoro-82M

    Kokoro is an open-weight text-to-speech model and inference library built around the lightweight Kokoro-82M model. It is designed to generate high-quality speech from text while staying fast, compact, and cost-efficient compared with larger TTS systems. The project is useful for developers who want deployable speech synthesis without depending on a closed platform. It can be installed as a Python package and used in applications, scripts, experiments, or production workflows. Kokoro’s Apache-licensed weights make it flexible for personal, research, and commercial use cases. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    zclaw

    zclaw

    Your personal AI assistant at all-in 888KiB

    zclaw is a highly compact personal AI assistant framework designed to run on constrained embedded hardware such as the ESP32. The project focuses on delivering core assistant capabilities within an extremely small footprint, demonstrating how AI-driven automation can operate on microcontrollers. It includes support for GPIO control, scheduled tasks, memory handling, and other embedded automation features that enable real-world device interaction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    SmallCode

    SmallCode

    AI coding agent optimized for small LLMs. 87% benchmark

    ...Its workflow is built around terminal usage, which makes it suitable for developers who prefer command-line control and local project context. smallcode emphasizes efficient agent behavior, careful tool use, and benchmark-driven improvements for constrained models. Its main value is giving developers a compact coding-agent environment that treats small local models as first-class tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Karpathy Skills

    Karpathy Skills

    A single CLAUDE.md file to improve Claude Code behavior

    Karpathy Skills is a compact guidance project built around a single CLAUDE.md file for improving AI coding agent behavior. It translates observations about common LLM coding failures into practical working rules for Claude Code and compatible workflows. The project focuses on reducing wrong assumptions, hidden confusion, overengineered solutions, and unnecessary edits outside the requested scope.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    LLM From Scratch

    LLM From Scratch

    Build and train a GPT-style language model

    ...Instead of relying on high-level abstractions or prebuilt frameworks, the project walks users through implementing every core component manually, including tokenization, transformer architecture, training loops, and autoregressive text generation. The repository is intentionally simplified to focus on conceptual clarity, using a compact model of roughly 10 million parameters that can train on consumer hardware such as laptops within a relatively short time. Inspired by Andrej Karpathy’s nanoGPT, the project emphasizes learning through direct implementation and experimentation rather than black-box usage. The workshop documentation explains concepts such as self-attention, embeddings, gradient clipping, optimizer scheduling, and decoding strategies in a practical and approachable way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Porcupine

    Porcupine

    On-device wake word detection powered by deep learning

    ...Porcupine is a highly-accurate and lightweight wake word engine. It enables building always-listening voice-enabled applications. It is using deep neural networks trained in real-world environments. Compact and computationally-efficient. It is perfect for IoT. Cross-platform. Arm Cortex-M, STM32, PSoC, Arduino, and i.MX RT. Raspberry Pi, NVIDIA Jetson Nano, and BeagleBone. Android and iOS. Chrome, Safari, Firefox, and Edge. Linux (x86_64), macOS (x86_64, arm64), and Windows (x86_64). Scalable. It can detect multiple always-listening voice commands with no added runtime footprint. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Nano-vLLM

    Nano-vLLM

    A lightweight vLLM implementation built from scratch

    ...The project recreates the core functionality of vLLM in a simplified architecture written in approximately a thousand lines of Python, making it easier for developers and researchers to understand how modern LLM inference systems work. Despite its compact design, nano-vllm incorporates advanced optimization techniques such as prefix caching, tensor parallelism, and CUDA graph execution to achieve high performance during model inference. The engine is intended primarily for educational use, experimentation, and lightweight deployments where a full production-grade inference stack may be unnecessary. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Cactus Needle

    Cactus Needle

    26m function call model that runs on incredibly small devices

    Needle is an experimental 26-million-parameter function-calling model designed to run on extremely small devices such as phones, watches, glasses, and low-power personal AI hardware. It is based on a Simple Attention Network architecture and was distilled from a much larger model to focus on fast, compact tool-use behavior. The project provides open weights, training details, dataset generation resources, and a playground for testing the model with custom tools. Needle is optimized for single-shot function calling rather than broad conversational ability, so its core use case is selecting the right tool and producing structured arguments. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Ollama RAG Chatbot

    Ollama RAG Chatbot

    Chat with multiple PDFs locally

    ...The main value of the project is its ability to process multiple PDF inputs and turn them into a question-answering workflow centered on document retrieval. With Docker support, script-based setup, optional ngrok exposure, and a clear local run path, it serves as a compact starter project for people who want a hands-on, self-hosted PDF chat system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    ReMe

    ReMe

    Memory Management Kit for Agents

    ReMe is a memory management kit for AI agents that gives them structured, persistent memory capabilities, enabling agents to extract, store, and reuse information across sessions, tasks, and interactions. It is designed to support long-running agent workflows where context matters and working memory alone isn’t enough, helping agents remember user preferences, task histories, and relevant past observations. The toolkit provides APIs to offload large, ephemeral outputs to external storage and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Monty

    Monty

    A minimal, secure Python interpreter written in Rust for use by AI

    ...It prioritizes guardrails like resource limits and restricted capabilities, which is especially useful for agentic workflows that need to execute small pieces of Python for data transforms, validation, or tool-like computations. Because it’s written in Rust, it’s positioned to deliver a compact, portable runtime that can be embedded into larger systems that need dependable isolation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    TONL

    TONL

    TONL (Token-Optimized Notation Language)

    TONL is a cutting-edge data platform built around a production-ready serialization format designed to be both compact and powerful, combining human readability with performance features that make it suitable for large-scale applications and AI workflows. It provides a serialization format that significantly reduces token usage compared with traditional JSON, which can result in lower costs and more efficient prompt size utilization in LLM-driven systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    HY-MT

    HY-MT

    Hunyuan Translation Model Version 1.5

    HY-MT (Hunyuan Translation) is a high-quality multilingual machine translation model suite developed to support mutual translation across dozens of languages with strong performance even at smaller model scales. It ships with both an 1.8 B parameter model and a larger 7 B model, the latter optimized not only for direct translation but also for formatted and contextualized output, allowing better handling of terminology and mixed-language content. The project emphasizes both speed and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    nanocode

    nanocode

    Minimal Claude Code alternative. Single Python file, zero dependencies

    nanocode is a minimalist coding agent implementation designed as a compact alternative to Claude Code, packaged in a single Python file with no external dependencies and totaling around 250 lines of code. It implements a full agentic loop where the model can reason, decide when to use tools, execute those tools, and iterate until producing a final answer, making it useful for simple AI-assisted coding workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Magika

    Magika

    Fast and accurate AI powered file content types detection

    Magika is an AI-powered file-type detector that uses a compact deep-learning model to classify binary and textual files with high accuracy and very low latency. The model is engineered to be only a few megabytes and to run quickly even on CPU-only systems, making it practical for desktop apps, servers, and security pipelines. Magika ships as a command-line tool and a library, providing drop-in detection that improves on traditional “magic number” and heuristic approaches, especially for ambiguous or short files. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Solon

    Solon

    Java enterprise application development framework

    Solon is a full-scenario Java enterprise application framework that positions itself as a lean, high-performance alternative to heavy stacks. It advertises large concurrency gains, lower memory use, much faster startup, and dramatically smaller packages while remaining compatible from Java 8 through Java 24. The framework focuses on restrained APIs and an open ecosystem, with modules that cover web, data, cloud, and microservice patterns. Its messaging emphasizes “replaceable Spring”...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    CLI Printing Press

    CLI Printing Press

    Reads official API docs, studies CLI and MCP servers

    ...Instead of only wrapping endpoints, it studies the API, competing tools, useful workflows, authentication behavior, and hidden data opportunities before producing a more opinionated CLI. The generated tools are designed for AI agents first, with SQLite sync, offline search, structured output, compact modes, typed exit codes, and compound insight commands. It supports multiple entry paths, including direct specs, URLs without documentation, and HAR imports from browser developer tools. The project includes validation stages such as scorecards, dogfooding, proof-of-behavior checks, and optional live read-only smoke tests. Overall, CLI Printing Press aims to turn any useful web service into a practical automation surface for both humans and AI agents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    LEANN

    LEANN

    Local RAG engine for private multimodal knowledge search on devices

    ...LEANN introduces a storage-efficient approximate nearest neighbor index combined with on-the-fly embedding recomputation to avoid storing large embedding vectors. By recomputing embeddings during queries and using compact graph-based indexing structures, LEANN can maintain high search accuracy while minimizing disk usage. It aims to act as a unified personal knowledge layer that connects different types of data such as documents, code, images, and other local files into a searchable context for language models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Gitingest

    Gitingest

    Create prompt-friendly codebase digests from any Git repository URL

    ...It analyzes a repository and produces a consolidated textual representation that includes the file structure and code content in an organized format. This makes it easier to provide meaningful code context when working with AI systems that require compact, readable inputs. Developers can generate these digests from either a local directory or a remote repository by supplying a repository path or URL. The generated output is optimized for prompt usage, helping AI models understand codebases more effectively without requiring manual file aggregation. In addition to producing the code digest, Gitingest also calculates statistics about the extracted content such as repository structure, total size of the extract, and token count. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Machine learning algorithms

    Machine learning algorithms

    Minimal and clean examples of machine learning algorithms

    ...The repository includes implementations of both supervised and unsupervised learning techniques, along with dimensionality reduction and clustering methods. Many of the algorithms are written in a simplified style that prioritizes clarity and educational value over production-level optimization. Because the code is compact and easy to follow, it is often used as a learning resource by developers who want to understand how machine learning algorithms are constructed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Instant Neural Graphics Primitives

    Instant Neural Graphics Primitives

    Instant neural graphics primitives: lightning fast NeRF and more

    ...The system implements several neural graphics primitives including neural radiance fields, signed distance functions, neural images, and neural volumes. These representations are trained using a compact neural network combined with a multiresolution hash encoding that dramatically accelerates both training and rendering processes. The framework is capable of reconstructing detailed 3D scenes from images and generating realistic views of those scenes in real time. Compared with earlier neural radiance field approaches, instant-ngp significantly reduces training time and computational requirements, enabling models to be trained within seconds or minutes on modern GPUs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Chat with LLMs Everywhere

    Chat with LLMs Everywhere

    Run PyTorch LLMs locally on servers, desktop and mobile

    TorchChat is an open-source project from the PyTorch ecosystem designed to demonstrate how large language models can be executed efficiently across different computing environments. The project provides a compact codebase that illustrates how to run conversational AI systems using PyTorch models on laptops, servers, and mobile devices. It is intended primarily as a reference implementation that shows developers how to integrate large language models into applications without requiring a large or complex infrastructure stack. TorchChat supports running models through Python interfaces as well as integrating them directly into native applications written in languages such as C or C++. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Vision Transformer Pytorch

    Vision Transformer Pytorch

    Implementation of Vision Transformer, a simple way to achieve SOTA

    ...It breaks down the model into patch embedding, positional encoding, multi-head self-attention, feed-forward blocks, and a classification head so you can understand each component in isolation. The code is intentionally compact and modular, which makes it easy to tinker with hyperparameters, depth, width, and attention dimensions. Because it stays close to vanilla PyTorch, you can integrate custom datasets and training loops without framework lock-in. It’s widely used as an educational reference for people learning transformers in vision and as a lightweight baseline for research prototypes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo