Showing 216 open source projects for "token"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    VideoRAG

    VideoRAG

    "VideoRAG: Chat with Your Videos

    ...When a user query is received, VideoRAG locates semantically relevant moments in the video using the embedding index, retrieves associated clips or transcripts, and feeds them to a generative model to produce accurate, grounded answers or summaries. This approach allows it to handle videos of arbitrary length without requiring the entire content to be passed into the model at once, overcoming token limits and enabling detailed, context-aware interaction.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    tiny-llm

    tiny-llm

    A course of learning LLM inference serving on Apple Silicon

    ...The project is structured as a guided course that walks developers through the process of implementing the core components required to run a modern language model, including attention mechanisms, token generation, and optimization techniques. Rather than relying on high-level machine learning frameworks, the codebase uses mostly low-level array and matrix manipulation APIs so that developers can understand exactly how model inference works internally. The project demonstrates how to load and run models such as Qwen-style architectures while progressively implementing performance improvements like KV caching, request batching, and optimized attention mechanisms. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Generative AI Swift

    Generative AI Swift

    This SDK is now deprecated, use the unified Firebase SDK

    deprecated-generative-ai-swift is a Swift client and example scaffold for building generative AI apps using the Gemini models. Although marked “deprecated”, the repo demonstrates how to integrate Gemini inference into iOS and macOS apps via Swift APIs, providing boilerplate for prompt dispatching, streaming responses, UI integration, and error handling. It includes a sample app that showcases a chat interface, where users send messages and receive responses streamed in real time, with UI...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Koog

    Koog

    Koog is the official Kotlin framework for building AI agents

    ...It features pure Kotlin implementation, seamless Model Control Protocol (MCP) integration for enhanced model management, vector embeddings for semantic search, and a flexible system for creating and extending tools that access external systems and APIs. Ready‑to‑use components address common AI engineering challenges, while intelligent history compression optimizes token usage and preserves context. A powerful streaming API enables real‑time response processing and parallel tool calls. Persistent memory allows agents to retain knowledge across sessions and between agents, and comprehensive tracing facilities provide detailed debugging and monitoring.
    Downloads: 5 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    Cleanlab

    Cleanlab

    The standard data-centric AI package for data quality and ML

    cleanlab helps you clean data and labels by automatically detecting issues in a ML dataset. To facilitate machine learning with messy, real-world data, this data-centric AI package uses your existing models to estimate dataset problems that can be fixed to train even better models. cleanlab cleans your data's labels via state-of-the-art confident learning algorithms, published in this paper and blog. See some of the datasets cleaned with cleanlab at labelerrors.com. This package helps you...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Tokenizers

    Tokenizers

    Fast State-of-the-Art Tokenizers optimized for Research and Production

    ...Designed for both research and production. Full alignment tracking. Even with destructive normalization, it’s always possible to get the part of the original sentence that corresponds to any token. Does all the pre-processing: Truncation, Padding, add the special tokens your model needs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    BotKube

    BotKube

    An app that helps you monitor your Kubernetes cluster

    BotKube is a messaging bot for monitoring and debugging Kubernetes clusters. It's built and maintained by InfraCloud. BotKube can be integrated with multiple messaging platforms like - Slack, Mattermost, Microsoft Teams to help you monitor your Kubernetes cluster(s), debug critical deployments and gives recommendations for standard practices by running checks on the Kubernetes resources. BotKube watches Kubernetes resources and sends a notification to the channel if any event occurs for...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Code2Prompt

    Code2Prompt

    Convert codebases into structured prompts optimized for LLM analysis

    code2prompt is an open source command line tool designed to convert an entire codebase into a structured prompt that can be easily used with large language models. It analyzes a project directory, gathers relevant source files, and formats them into a single prompt that includes the source tree and code content. This approach helps developers quickly provide full project context to AI models without manually copying files or assembling prompts. code2prompt is built in Rust and focuses on...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Claude Code Video Vision

    Claude Code Video Vision

    Give Claude the ability to watch and understand videos

    ...The system dynamically adapts how much data it extracts based on the user’s query, adjusting frame rate, resolution, and time windows to optimize both performance and token efficiency. It supports multiple backends for audio processing, including local and cloud-based options, enabling flexible deployment depending on privacy or performance requirements.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10
    Model Context Protocol (MCP) Servers

    Model Context Protocol (MCP) Servers

    Model Context Protocol Servers

    The Model Context Protocol (MCP) is an open protocol that enables seamless integration between LLM applications and external data sources and tools. Whether you’re building an AI-powered IDE, enhancing a chat interface, or creating custom AI workflows, MCP provides a standardized way to connect LLMs with the context they need.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Qwen Code

    Qwen Code

    Qwen Code is a coding agent that lives in the digital world

    Qwen Code is a command-line AI workflow tool designed to enhance developer productivity by leveraging the power of Qwen3-Coder models. Adapted from the Google Gemini CLI, it features an enhanced parser optimized specifically for Qwen-Coder models, enabling deep code understanding and manipulation. The tool supports querying and editing large codebases beyond traditional context limits, making it ideal for modern, complex projects. Qwen Code automates various development workflows, including...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 12
    LightLLM

    LightLLM

    LightLLM is a Python-based LLM (Large Language Model) inference

    ...Built primarily in Python, the project integrates optimization techniques and ideas from several leading open-source implementations, including FasterTransformer, vLLM, and FlashAttention, to accelerate token generation and reduce latency. LightLLM is designed to handle large-scale model workloads in production environments, supporting efficient batching and GPU utilization for fast inference across multiple requests. Its architecture allows models to be deployed with minimal overhead while maintaining compatibility with popular transformer-based model families such as LLaMA and GPT-style architectures.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    SafeClaw

    SafeClaw

    Chat with it via text and voice

    ...It emphasizes privacy and predictability by using traditional programming, rule-based intent parsing, and established machine learning tools rather than large language models, meaning there are no per-token API costs and deterministic behavior. The assistant offers features such as voice control using fully local speech-to-text (Whisper) and text-to-speech (Piper) capabilities, news aggregation with extractive summarization, and smart home or Bluetooth device control. SafeClaw supports multiple channels, including CLI and Telegram, and avoids prompt injection risk because it doesn’t rely on LLMs for core operations.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Gemini CLI

    Gemini CLI

    Open source AI agent CLI tool to bring Gemini into your terminal

    Gemini CLI is an open‑source AI agent that brings the capabilities of Google’s Gemini 2.5 Pro large‑language model directly into your terminal, enabling tasks ranging from coding and debugging to content creation and research via natural‑language prompts, with support for multimodal outputs like image and video generation. Gemini CLI integrates with external tools and MCP servers, enabling media generation and enhanced workflow automation. It also includes a built-in Google Search tool to...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 15
    TokenSpeed

    TokenSpeed

    TokenSpeed is a speed-of-light LLM inference engine

    ...TokenSpeed is useful for developers building local or server-side LLM infrastructure for agents, coding systems, and high-volume AI applications. Its main value is providing an inference layer optimized for fast token generation under practical agent workloads.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Transformer Debugger

    Transformer Debugger

    Tool for exploring and debugging transformer model behaviors

    ...TDB allows users to intervene directly in the forward pass of a model and observe how such interventions change predictions, making it possible to answer questions like why a token was selected or why an attention head focused on a certain input. It automatically identifies and explains the most influential components, highlights activation patterns, and maps relationships across circuits within the model. The tool includes both a React-based neuron viewer for exploring model components and a backend activation server for running inferences and serving data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    GPT Researcher

    GPT Researcher

    LLM based autonomous agent that does online comprehensive research

    Say Hello to GPT Researcher, your AI agent for rapid insights and comprehensive research. GPT Researcher is the leading autonomous agent that takes care of everything from accurate source gathering to organization of research results.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    InvokeAI

    InvokeAI

    InvokeAI is a leading creative engine for Stable Diffusion models

    InvokeAI is an implementation of Stable Diffusion, the open source text-to-image and image-to-image generator. It provides a streamlined process with various new features and options to aid the image generation process. It runs on Windows, Mac and Linux machines, and runs on GPU cards with as little as 4 GB or RAM. InvokeAI is a leading creative engine built to empower professionals and enthusiasts alike. Generate and create stunning visual media using the latest AI-driven technologies....
    Downloads: 10 This Week
    Last Update:
    See Project
  • 19
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    API-for-Open-LLM

    API-for-Open-LLM

    Openai style api for open large language models

    API-for-Open-LLM is a lightweight API server designed for deploying and serving open large language models (LLMs), offering a simple way to integrate LLMs into applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    LangDB AI Gateway

    LangDB AI Gateway

    Govern, secure, and optimize your AI traffic

    AI Gateway is a high-performance, open-source API gateway optimized for managing and monitoring LLM traffic at scale. Developed by the LangDB team, AI Gateway acts as an intermediary between clients and backend LLMs, providing advanced features like caching, rate limiting, prompt management, and observability. It helps teams secure and optimize their LLM deployments, whether using local models or external APIs like OpenAI or Anthropic. With native support for multi-tenant environments and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    DeepSeekMath-V2

    DeepSeekMath-V2

    Towards self-verifiable mathematical reasoning

    DeepSeekMath-V2 is a large-scale open-source AI model designed specifically for advanced mathematical reasoning, theorem proving, and rigorous proof verification. It’s built by DeepSeek as a successor to their earlier math-specialist models. Unlike general-purpose LLMs that might generate plausible-looking math but sometimes hallucinate or mishandle rigorous logic, Math-V2 is engineered to not only generate solutions but also self-verify them, meaning it examines the derivations, checks...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    deepclaude

    deepclaude

    Use Claude Code's agent loop with DeepSeek V4 Pro, OpenRouter & more

    ...It preserves the full Claude Code experience—including file editing, terminal execution, and multi-step agent workflows—while dramatically reducing operational costs. By swapping out the underlying model instead of the interface, deepclaude delivers the same familiar UX with significantly cheaper token pricing. The platform supports seamless backend switching in real time, allowing users to choose between cost efficiency and higher reasoning power when needed. It also includes built-in cost tracking and benchmarking tools to help developers monitor usage and optimize performance. Designed for flexibility and efficiency, deepclaude is ideal for developers who want powerful AI coding agents without the premium price tag.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    KIS Open API

    KIS Open API

    Korea Investment & Securities Open API Github

    The open-trading-api repository from Korea Investment & Securities provides sample code and developer resources for interacting with the KIS Developers Open Trading API, which enables programmatic access to financial market data and automated trading functionality. The project is designed primarily for Python developers and AI automation environments that want to build investment applications, algorithmic trading systems, or financial analytics tools using the brokerage’s infrastructure. It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    repo2txt

    repo2txt

    Web-based tool converts GitHub repository contents

    repo2txt is an open-source developer tool that converts the contents of a code repository into a single structured text file that can be easily consumed by large language models. The tool is designed to address the challenge of analyzing entire codebases with AI assistants, where code is normally distributed across many files and directories. By collecting repository contents and formatting them into a single text document, repo2txt allows developers to feed complete projects into AI systems...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo