855 projects for "tiny-core-plus" with 2 filters applied:

  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 1
    tiny-llm

    tiny-llm

    A course of learning LLM inference serving on Apple Silicon

    tiny-llm is an educational open-source project designed to teach system engineers how large language model inference and serving systems work by building them from scratch. The project is structured as a guided course that walks developers through the process of implementing the core components required to run a modern language model, including attention mechanisms, token generation, and optimization techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Core AI Models

    Core AI Models

    Model export recipes, Python primitives, and Swift runtime utilities

    Core AI Models is Apple’s repository for building and running on-device AI models with Core AI. It provides export recipes that convert supported open-source models into Core AI model files. It also includes Python primitives for authoring custom PyTorch models that are better suited for Apple platform deployment. The Swift package adds runtime utilities that help developers integrate exported models into macOS and iOS apps.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    SGR Agent Core

    SGR Agent Core

    Schema-Guided Reasoning (SGR) has agentic system design

    SGR Agent Core is an open-source framework for building intelligent AI research agents based on a methodology known as Schema-Guided Reasoning (SGR). The framework provides a core library that allows developers to design autonomous agents capable of structured reasoning and complex task execution. Instead of relying solely on free-form prompts, the system organizes reasoning processes around schemas that guide how agents analyze problems, gather information, and generate outputs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    indie-hacker-tools-plus

    indie-hacker-tools-plus

    Here comes a selection of technology stacks and tool repositories

    Indie Hacker Tools Plus is an open-source curated repository designed to help independent developers discover practical tools, frameworks, and technology stacks for building and launching software products. The project acts as a structured collection of resources commonly used by indie hackers, including development frameworks, AI tools, deployment services, and marketing utilities.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    TinyClaw

    TinyClaw

    The original Tiny Claw as your personal autonomous AI companion

    TinyClaw is an open-source autonomous AI companion framework designed to make personal AI agents simpler, cheaper to run, and more accessible to individual users. The project is built from scratch with a deliberately small native core and a modular plugin architecture that allows capabilities to expand without turning the system into a heavy monolith. Its philosophy centers on creating a persistent AI companion that behaves more like a helpful digital partner than a purely configurable assistant. TinyClaw incorporates self-improving memory and smart routing mechanisms intended to reduce large language model costs by tiering queries intelligently. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Cactus Needle

    Cactus Needle

    26m function call model that runs on incredibly small devices

    ...The project provides open weights, training details, dataset generation resources, and a playground for testing the model with custom tools. Needle is optimized for single-shot function calling rather than broad conversational ability, so its core use case is selecting the right tool and producing structured arguments. It can be fine-tuned locally, including on consumer machines, which makes it useful for experimentation with small personalized agents. The project is best suited for researchers and developers exploring tiny AI models, edge inference, and lightweight tool-calling systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    COCOON

    COCOON

    Confidential Compute Open Network, Decentralized AI Inference on TON

    COCOON is a privacy-aware desktop client framework designed by the developers of Telegram to provide a modern, secure, and extensible environment for building messaging and communication applications. At its core, it combines native desktop performance with web-like flexibility, packing a renderer, UI components, and plugin architecture that allows developers to craft rich experiences similar to those found in native apps. Cocoon’s architecture prioritizes privacy and security, making it suitable for applications where user data protection is critical, and its modular design lets teams include or exclude features as needed without bloating the core system. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Learn Claude Code

    Learn Claude Code

    Bash is all you need, write a claude code with only 16 line code

    Learn Claude Code is an educational repository that teaches how modern AI coding agents work by walking learners through a sequence of progressively more complex agent implementations, starting with a minimal Bash-based agent and culminating in agents with explicit planning, subagents, and skills. It emphasizes a hands-on learning path where each version (from v0 to v4) adds conceptual building blocks like the core agent loop, todo planning, task decomposition, and domain knowledge skills, illuminating the patterns behind what makes a true AI agent tick. The goal is to demystify agent architectures like Claude Code by having learners build simplified versions themselves and observe how tools, memory management, planning constraints, and context isolation contribute to reliable agent behavior. ...
    Downloads: 66 This Week
    Last Update:
    See Project
  • 9
    LTX-Video

    LTX-Video

    Official repository for LTX-Video

    ...The toolkit is built with both real-time and offline workflows in mind, enabling applications from consumer editing to professional content creation and batch processing. Internally optimized for multi-core processors and hardware acceleration where available, LTX-Video makes it feasible to work with high-resolution content and complex timelines without sacrificing responsiveness.
    Downloads: 21 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Hindsight

    Hindsight

    Hindsight: Agent Memory That Learns

    ...The system operates through three core mechanisms—retain, recall, and reflect—which respectively handle storing information, retrieving relevant context, and generating new insights based on accumulated experience.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    OpenWork

    OpenWork

    An open-source alternative to Claude Cowork, powered by opencode

    OpenWork is a framework for building decentralized collaborative work environments powered by AI and human contributions. At its core, the project enables contributors to define tasks, workflows, and goals that can be split, shared, and recombined across distributed nodes while agents and humans cooperate to advance progress. It offers structured templates for work items, decision logic for task allocation, and consensus mechanisms that let groups verify and validate results toward shared objectives. ...
    Downloads: 271 This Week
    Last Update:
    See Project
  • 12
    whisper.cpp

    whisper.cpp

    Port of OpenAI's Whisper model in C/C++

    whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples....
    Downloads: 510 This Week
    Last Update:
    See Project
  • 13
    bitnet.cpp

    bitnet.cpp

    Official inference framework for 1-bit LLMs

    bitnet.cpp is the official open-source inference framework and ecosystem designed to enable ultra-efficient execution of 1-bit large language models (LLMs), which quantize most model parameters to ternary values (-1, 0, +1) while maintaining competitive performance with full-precision counterparts. At its core is bitnet.cpp, a highly optimized C++ backend that supports fast, low-memory inference on both CPUs and GPUs, enabling models such as BitNet b1.58 to run without requiring enormous compute infrastructure. The project’s focus on extreme quantization dramatically reduces memory footprint and energy consumption compared with traditional 16-bit or 32-bit LLMs, making it practical to deploy advanced language understanding and generation models on everyday machines. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    tt-metal

    tt-metal

    TT-NN operator library, and TT-Metalium low level kernel programming

    tt-metal, also referred to in its documentation as TT-Metalium, is Tenstorrent’s low-level software development kit for programming applications on Tenstorrent AI accelerators. The project is designed for developers who need direct access to the company’s Tensix processor architecture, exposing a programming model that is closer to hardware control than high-level inference frameworks. Instead of following a traditional GPU model centered on massive thread parallelism, the platform is built...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    AI Engineering from Scratch

    AI Engineering from Scratch

    Learn it. Build it. Ship it for others

    ...The project is structured into more than 20 phases and hundreds of lessons, covering topics that range from foundational mathematics to advanced systems such as large language models, retrieval pipelines, and multi-agent architectures. Each lesson emphasizes hands-on implementation, requiring learners to write core components such as backpropagation, tokenizers, and attention mechanisms themselves before using higher-level tools. The curriculum spans multiple programming languages, including Python, TypeScript, Rust, and Julia, which broadens the learner’s exposure to different ecosystems and performance considerations. It also focuses on producing tangible outputs such as prompts, agents, and reusable systems, allowing learners to build a real portfolio while studying.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    llama.cpp

    llama.cpp

    Port of Facebook's LLaMA model in C/C++

    The llama.cpp project enables the inference of Meta's LLaMA model (and other models) in pure C/C++ without requiring a Python runtime. It is designed for efficient and fast model execution, offering easy integration for applications needing LLM-based capabilities. The repository focuses on providing a highly optimized and portable implementation for running large language models directly within C/C++ environments.
    Downloads: 162 This Week
    Last Update:
    See Project
  • 17
    GPT4All

    GPT4All

    Run Local LLMs on Any Device. Open-source

    GPT4All is an open-source project that allows users to run large language models (LLMs) locally on their desktops or laptops, eliminating the need for API calls or GPUs. The software provides a simple, user-friendly application that can be downloaded and run on various platforms, including Windows, macOS, and Ubuntu, without requiring specialized hardware. It integrates with the llama.cpp implementation and supports multiple LLMs, allowing users to interact with AI models privately. This...
    Downloads: 131 This Week
    Last Update:
    See Project
  • 18
    VibeTensor

    VibeTensor

    Our first fully AI generated deep learning system

    VibeTensor is a groundbreaking open-source research system software stack for deep learning that was uniquely generated almost entirely by AI coding agents under guided human supervision, demonstrating a new frontier in AI-assisted software engineering. It implements a PyTorch-style eager tensor library with a modern C++20 core that supports both CPU and CUDA backends, giving it the ability to manage tensors, automatic differentiation (autograd), and complex computation flows similar to mainstream frameworks. What makes VibeTensor remarkable is that every major component, from core libraries and dispatch systems to CUDA runtime support, caching allocators, and language bindings, was created and validated by coding agents using automated builds and tests rather than manual line-by-line human coding. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    NeuralNote

    NeuralNote

    Audio Plugin for Audio to MIDI transcription using deep learning

    NeuralNote is an open-source audio software tool designed to convert recorded audio into MIDI data using modern machine learning techniques. The software functions as an audio plugin that can be used inside digital audio workstations as well as a standalone application for music production and analysis. Its main purpose is to perform audio-to-MIDI transcription, allowing musicians to record a performance and automatically transform it into editable MIDI notes. NeuralNote supports polyphonic...
    Downloads: 126 This Week
    Last Update:
    See Project
  • 20
    Universal Tool Calling Protocol (UTCP)

    Universal Tool Calling Protocol (UTCP)

    Official python implementation of UTCP. UTCP is an open standard

    ...UTCP is an open, modern standard designed to let AI agents call any tool or API directly—over HTTP, CLI, WebSocket, gRPC, and more—without the overhead of extra wrapper layers or middleware. It leverages a modular, plugin-based architecture built around Pydantic models and separates the core functionality into a lightweight client and extensible protocol plugins, enabling secure, scalable, and low-latency direct tool calls. A pluggable architecture allows developers to easily add new communication protocols, tool storage mechanisms, and search strategies without modifying the core library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Hands-On Large Language Models

    Hands-On Large Language Models

    Official code repo for the O'Reilly Book

    Hands-On-Large-Language-Models is the official GitHub code repository accompanying the practical technical book Hands-On Large Language Models authored by Jay Alammar and Maarten Grootendorst, providing a comprehensive collection of example notebooks, code labs, and supporting materials that illustrate the core concepts and real-world applications of large language models. The repository is structured into chapters that align with the educational progression of the book — covering everything from foundational topics like tokens, embeddings, and transformer architecture to advanced techniques such as prompt engineering, semantic search, retrieval-augmented generation (RAG), multimodal LLMs, and fine-tuning. ...
    Downloads: 45 This Week
    Last Update:
    See Project
  • 22
    COLMAP

    COLMAP

    Structure-from-Motion and Multi-View Stereo

    COLMAP is a general-purpose Structure-from-Motion (SfM) and Multi-View Stereo (MVS) pipeline with a graphical and command-line interface. It offers a wide range of features for the reconstruction of ordered and unordered image collections. The software is licensed under the new BSD license.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 23
    TorchCode

    TorchCode

    Practice implementing softmax, attention, GPT-2 and more

    TorchCode is an interactive learning and practice platform designed to help developers master PyTorch by implementing core machine learning operations and architectures from scratch. It is structured similarly to competitive programming platforms like LeetCode but focuses specifically on tensor operations and deep learning concepts. The platform provides a collection of curated problems that cover fundamental topics such as activation functions, normalization layers, attention mechanisms, and full transformer architectures. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Writing AI Conference Papers

    Writing AI Conference Papers

    Writing AI Conference Papers: A Handbook for Beginners

    WritingAIPaper is an open-source guide designed to help beginners understand and navigate the process of writing and publishing academic papers in the field of artificial intelligence. The project provides structured guidance on how to transform research ideas into complete manuscripts, covering topics such as defining the core contribution, organizing the paper structure, and refining technical details. It emphasizes practical advice and common pitfalls, making it especially useful for students or early-career researchers who may struggle with academic writing conventions. The guide breaks down the process into manageable stages, from identifying novelty and contributions to drafting, revising, and preparing submissions for conferences. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    kagent

    kagent

    Kubernetes native framework for building AI agents

    Kagent is a Kubernetes-native framework for building, deploying, and operating AI agents as first-class cloud-native workloads. It models core agent concepts declaratively using Kubernetes custom resources, so teams can manage agents similarly to other platform components via YAML, controllers, and standard cluster workflows. In kagent’s design, an “Agent” represents a system prompt plus a set of tools and other agents, along with an LLM configuration, making the agent definition portable and repeatable across environments. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
Auth0 Logo