Showing 216 open source projects for "token"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    Ralph AI Agent

    Ralph AI Agent

    AI agent loop that runs repeatedly until all PRD items are complete

    ...It provides a reactive loop where agents can repeatedly assess the current context, reason about the next best action using large language models, and execute actions across integrated tools and services. The runtime emphasizes safety boundaries by sandboxing operations, enforcing time and token limits, and isolating execution layers to prevent unpredictable side effects. Ralph also includes a built-in plugin system that lets developers attach custom tools, environment connectors, or monitoring hooks without modifying core logic. Designed for extensibility, the framework supports multi-model providers so agents can switch between models or fall back based on task needs. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    InfiAgent

    InfiAgent

    Build your own Cowork, AI Scientist and other SoTA Agents

    ...Designed as a “Multi-Level Agent” (MLA) system, it externalizes persistent state to the file system so that agents can operate over unlimited runtime without the need for token-intensive context compression, enabling workflows such as research paper drafting, experiments, coding, and document generation to run reliably. The framework uses a serial multi-agent hierarchy where specialized agents coordinate in tree-structured paths for clear task delegation and minimal tool conflicts, while batch file operations and persistent workspaces ensure reproducibility and traceability. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    HunyuanImage-3.0

    HunyuanImage-3.0

    A Powerful Native Multimodal Model for Image Generation

    ...It unifies multimodal understanding and generation in a single autoregressive framework, combining text and image modalities seamlessly rather than relying on separate image-only diffusion components. It uses a Mixture-of-Experts (MoE) architecture with many expert subnetworks to scale efficiently, deploying only a subset of experts per token, which allows large parameter counts without linear inference cost explosion. The model is intended to be competitive with closed-source image generation systems, aiming for high fidelity, prompt adherence, fine detail, and even “world knowledge” reasoning (i.e. leveraging context, semantics, or common sense in generation). The GitHub repo includes code, scripts, model loading instructions, inference utilities, prompt handling, and integration with standard ML tooling (e.g. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    DeepSeek-Reasonix

    DeepSeek-Reasonix

    DeepSeek-native AI coding agent for your terminal

    DeepSeek Reasonix is a DeepSeek-native AI coding agent designed for terminal-based software development. It is built around prefix-cache stability, which helps reduce token costs during long sessions and allows users to leave the agent running across extended workflows. Reasonix includes a coding mode with filesystem and shell tools, a lighter chat mode, one-shot task execution, health checks, session utilities, and project-scoped memory. It supports reviewed SEARCH/REPLACE edits, plan mode, MCP servers, web search, hooks, skills, semantic indexing, transcript replay, event logs, and cost or cache tracking. ...
    Downloads: 45 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    Vectorize MCP Server

    Vectorize MCP Server

    Official Vectorize MCP Server

    The Vectorize MCP Server is a Model Context Protocol server that integrates with Vectorize, offering advanced vector retrieval and text extraction capabilities. ​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    dLLM

    dLLM

    dLLM: Simple Diffusion Language Modeling

    dLLM is an open-source framework designed to simplify the development, training, and evaluation of diffusion-based large language models. Unlike traditional autoregressive models that generate text sequentially token by token, diffusion language models generate text through an iterative denoising process that refines masked tokens over multiple steps. This approach allows models to reason over the entire sequence simultaneously and potentially produce more coherent outputs with bidirectional context. The project provides an integrated pipeline that standardizes how diffusion language models are trained, evaluated, and deployed, helping researchers reproduce experiments and compare results more easily. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DevDocs by CyberAGI

    DevDocs by CyberAGI

    Completely free, private, UI based Tech Documentation MCP server

    ...Instead of sending entire documents to a language model, DevDocs organizes documentation into sections so that only the most relevant portions are retrieved during a query. This approach reduces token usage and improves the accuracy of responses generated by AI coding assistants. The platform is designed to integrate easily with modern developer tools and AI environments such as Cursor, Cline, and Claude-based workflows. It includes a user interface that allows developers to browse documentation repositories and connect them to AI systems while keeping the data private.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    AWS Agent Skills

    AWS Agent Skills

    AWS Skills for Agents

    ...Instead of streaming giant documentation sets or relying on episodic web search, this project compresses AWS best practices, usage patterns, edge cases, and real-world engineering guides into pre-structured skill definitions that are token-efficient and tailored for reasoning. The skills cover critical AWS services such as IAM, Lambda, DynamoDB, S3, API Gateway, EKS, and many more, letting agents offer actionable advice on infrastructure as code, debugging, security configurations, and architectural workflows. Skills are kept up to date with weekly documentation checks, ensuring they reflect current AWS patterns and service changes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OpenAI Forward

    OpenAI Forward

    An efficient forwarding service designed for LLMs

    OpenAI Forward is an open-source forwarding and reverse proxy service for large language model APIs, designed to sit between client applications and model providers. Its main purpose is to make model access more manageable and efficient by adding operational controls such as request rate limiting, token rate limiting, caching, logging, routing, and key management around existing LLM endpoints. The project can proxy both local and cloud-hosted language model services, which makes it useful for teams that want a single control layer regardless of whether they are using something like LocalAI or a hosted provider compatible with OpenAI-style APIs. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    ProxyPal

    ProxyPal

    A desktop app that lets you use your AI subscriptions

    ...By wrapping a CLI-based proxy API with an intuitive graphical interface, ProxyPal bridges your paid AI services into environments that don’t natively support them, allowing tools like text editors, terminals, and IDE plugins to route requests through a centralized proxy endpoint. This enables developers to use their own AI credentials and models consistently across different clients while monitoring usage, token counts, and performance in real time. ProxyPal detects installed AI clients and partially auto-configures settings, tracks analytics like success rates and response times, and supports multiple OS platforms.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 11
    MiniMax-01

    MiniMax-01

    Large-language-model & vision-language-model based on Linear Attention

    ...MiniMax-Text-01 uses a hybrid attention architecture that blends Lightning Attention, standard softmax attention, and Mixture-of-Experts (MoE) routing to achieve both high throughput and long-context reasoning. It has 456 billion total parameters with 45.9 billion activated per token and is trained with advanced parallel strategies such as LASP+, varlen ring attention, and Expert Tensor Parallelism, enabling a training context of 1 million tokens and up to 4 million tokens at inference. MiniMax-VL-01 extends this core by adding a 303M-parameter Vision Transformer and a two-layer MLP projector in a ViT–MLP–LLM framework, allowing the model to process images at dynamic resolutions up to 2016×2016.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    Tongyi DeepResearch

    Tongyi DeepResearch

    Tongyi Deep Research, the Leading Open-source Deep Research Agent

    ...It’s built to act like a research agent: synthesizing, reasoning, retrieving information via the web and documents, and backing its outputs with evidence. The model is about 30.5 billion parameters in size, though at any given token only ~3.3B parameters are active. It uses a mix of synthetic data generation, fine-tuning and reinforcement learning; supports benchmarks like web search, document understanding, question answering, “agentic” tasks; provides inference tools, evaluation scripts, and “web agent” style interfaces. The aim is to enable more autonomous, agentic models that can perform sustained knowledge gathering, reasoning, and synthesis across multiple modalities (web, files, etc.).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Box MCP Server

    Box MCP Server

    A Box model context protocol server to search, read and access files

    Box MCP Server is a Model Context Protocol (MCP) server designed to facilitate searching, reading, and accessing files stored on the Box cloud storage platform. It enables seamless integration with Box, allowing for efficient file management and retrieval through MCP-compatible clients. ​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Hermes Desktop

    Hermes Desktop

    Desktop Companion for Hermes Agent

    ...The platform supports multiple AI providers, including OpenAI, Anthropic, Google Gemini, Grok, OpenRouter, and local model endpoints. Its real-time streaming chat interface includes markdown rendering, syntax highlighting, tool progress tracking, and token usage monitoring. Hermes Desktop also integrates messaging platforms, automation features, and customizable personas to create a comprehensive AI productivity environment.
    Downloads: 79 This Week
    Last Update:
    See Project
  • 15
    CodeGraph

    CodeGraph

    Pre-indexed code knowledge graph for Claude Code, Codex, Cursor

    ...It is designed for tools such as Claude Code, Codex, Cursor, and OpenCode, with MCP support for agent access. CodeGraph stores project data locally, which helps reduce token usage and repeated file exploration. Its main purpose is to make agent-assisted coding faster, cheaper, and more context-aware.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 16
    DeepSeek-OCR 2

    DeepSeek-OCR 2

    Visual Causal Flow

    DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 17
    SGLang

    SGLang

    SGLang is a fast serving framework for large language models

    SGLang is a fast serving framework for large language models and vision language models. It makes your interaction with models faster and more controllable by co-designing the backend runtime and frontend language.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 18
    GLM-4.6

    GLM-4.6

    Agentic, Reasoning, and Coding (ARC) foundation models

    GLM-4.6 is the latest iteration of Zhipu AI’s foundation model, delivering significant advancements over GLM-4.5. It introduces an extended 200K token context window, enabling more sophisticated long-context reasoning and agentic workflows. The model achieves superior coding performance, excelling in benchmarks and practical coding assistants such as Claude Code, Cline, Roo Code, and Kilo Code. Its reasoning capabilities have been strengthened, including improved tool usage during inference and more effective integration within agent frameworks. ...
    Downloads: 47 This Week
    Last Update:
    See Project
  • 19
    Skills Janitor

    Skills Janitor

    Audit, track usage, and compare your Claude Code skills

    ...It also includes usage tracking by parsing conversation history, giving visibility into which skills are actively used and which are wasting resources. A notable feature is its token cost analysis, which helps developers understand how much context window space each skill consumes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    WavTokenizer

    WavTokenizer

    SOTA discrete acoustic codec models with 40/75 tokens per second

    WavTokenizer is a state-of-the-art discrete acoustic codec designed specifically for audio language modeling, capable of compressing 24 kHz audio into just 40 or 75 tokens per second while preserving high perceptual quality. It is built to represent speech, music, and general audio with extremely low bitrate, making it ideal as a front-end for large audio language models like GPT-4o and similar architectures. The model uses a single-quantizer design together with temporal compression to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MiniMax-M2

    MiniMax-M2

    MiniMax-M2, a model built for Max coding & agentic workflows

    MiniMax-M2 is an open-weight large language model designed specifically for high-end coding and agentic workflows while staying compact and efficient. It uses a Mixture-of-Experts (MoE) architecture with 230 billion total parameters but only 10 billion activated per token, giving it the behavior of a very large model at a fraction of the runtime cost. The model is tuned for end-to-end developer flows such as multi-file edits, compile–run–fix loops, and test-validated repairs across real repositories and diverse programming languages. It is also optimized for multi-step agent tasks, planning and executing long toolchains that span shell commands, browsers, retrieval systems, and code runners. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    WhisperSpeech

    WhisperSpeech

    An Open Source text-to-speech system built by inverting Whisper

    ...The project aims to be for speech what Stable Diffusion is for images: powerful, hackable, and safe for commercial use, with code under Apache-2.0/MIT and models trained only on properly licensed data. Its architecture follows a token-based, multi-stage pipeline inspired by AudioLM and SPEAR-TTS: Whisper is used to produce semantic tokens, EnCodec compresses the waveform into acoustic tokens, and Vocos reconstructs high-fidelity audio from those tokens. The repository includes notebooks and scripts for inference, long-form synthesis, and finetuning, as well as pre-trained models and converted datasets hosted on Hugging Face. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Mercury Agent

    Mercury Agent

    Soul-driven AI agent with permission-hardened tools, token budgets

    Mercury Agent is a framework for building autonomous AI agents capable of executing complex workflows with minimal human intervention. It focuses on orchestrating tasks across multiple tools and services, enabling agents to perform end-to-end operations. The system includes mechanisms for planning, execution, and feedback, allowing agents to refine their actions iteratively. It supports integration with external APIs and services, making it adaptable to various domains. The architecture is...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24
    SAM 3

    SAM 3

    Code for running inference and finetuning with SAM 3 model

    SAM 3 (Segment Anything Model 3) is a unified foundation model for promptable segmentation in both images and videos, capable of detecting, segmenting, and tracking objects. It accepts both text prompts (open-vocabulary concepts like “red car” or “goalkeeper in white”) and visual prompts (points, boxes, masks) and returns high-quality masks, boxes, and scores for the requested concepts. Compared with SAM 2, SAM 3 introduces the ability to exhaustively segment all instances of an...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 25
    Ollama Grid Search

    Ollama Grid Search

    A multi-platform desktop application to evaluate and compare LLM

    ...It also includes experiment logging and A/B testing capabilities, which allow users to compare outputs side by side and track performance metrics such as latency or token usage.
    Downloads: 11 This Week
    Last Update:
    See Project
Auth0 Logo