Showing 22 open source projects for "token"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Claude Cognitive

    Claude Cognitive

    Persistent context and multi-instance coordination

    ...It introduces an attention-based context router that prioritizes files and content relevant to the current development discussion — tagging them as HOT, WARM, or COLD based on recency and keyword activation — so Claude Code doesn’t waste token budget rereading irrelevant code. This context routing dramatically reduces redundant token usage and accelerates large codebase interactions by focusing only on what truly matters to the current task. Additionally, Claude-Cognitive includes a pool coordinator to share state across multiple Claude Code instances, preserving what’s been learned or completed and preventing repetitive debugging or redundant exploration.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    claude-devtools

    claude-devtools

    A desktop app that reconstructs exactly what Claude Code did

    ...The tool was created to address the loss of detail in the standard CLI output, which often summarizes actions without exposing the full underlying operations. It surfaces granular information such as file reads, edits, tool calls, token consumption, and subagent activity, enabling developers to understand exactly how the AI interacted with their codebase. Because it runs entirely locally and makes no network calls, it requires no API keys or configuration and works with any previously recorded sessions.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 3
    Oh My OpenCode Slim

    Oh My OpenCode Slim

    Slimmed, cleaned and fine-tuned oh-my-opencode fork

    Oh My OpenCode Slim is a lightweight, optimized fork of the broader oh-my-opencode ecosystem, designed to deliver high-performance multi-agent coding workflows while significantly reducing token consumption and system overhead. It retains the core concept of orchestrating multiple specialized AI agents but streamlines their configuration, execution, and communication to make the system more efficient and practical for everyday use. The framework introduces a structured “pantheon” of agents, each with a defined role such as orchestration, exploration, and execution, allowing tasks to be automatically delegated and completed through coordinated workflows. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    OpenMonoAgent

    OpenMonoAgent

    Terminal-native coding agent powered by local LLMs

    OpenMonoAgent.ai is a self-hosted coding agent designed to run entirely on the user’s own hardware. It pairs a .NET CLI with a local llama.cpp inference server so developers can use agentic coding workflows without cloud subscriptions or per-token billing. The project emphasizes privacy, local control, and ownership of the model, compute, and project data. It includes a terminal-native workflow, built-in tools, Docker sandboxing, and code intelligence features. The system can run on CPU or GPU and is designed to auto-configure itself when possible. OpenMonoAgent.ai is best suited for developers who want a local AI development stack with no API keys, no cloud dependency, and no telemetry.
    Downloads: 3 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 5
    Claude-Mem

    Claude-Mem

    Claude Code plugin that automatically captures everything Claude does

    ...By enabling long-term continuity, Claude-Mem helps Claude “remember” project history, past fixes, and prior reasoning even after restarts or reconnects. Its progressive disclosure approach intelligently injects only the most relevant context, balancing usefulness with token efficiency. Claude-Mem runs automatically in the background with no manual workflow changes required. Designed for serious developers, it transforms Claude Code into a continuously learning, project-aware coding assistant.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    agentsview

    agentsview

    Local-first session intelligence and analytics for coding agents

    ...It indexes conversations from tools like Claude Code, Codex, Gemini CLI, Cursor, OpenHands, and many other agent systems. The project lets users browse, search, and analyze coding-agent activity without creating an account or sending session content to a hosted service. It tracks token usage, cost, models, projects, tools, and session behavior across different agents. Its web interface adds dashboards, heatmaps, full-text search, and live updates while sessions are active. It can also support team-oriented workflows through optional PostgreSQL sync and DuckDB mirroring.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    abtop

    abtop

    Like htop, but for AI coding agents. Monitor Claude Code & Codex CLI

    abtop is a terminal monitoring tool for AI coding agents, inspired by system monitors like htop and btop. It gives users a real-time view of active Claude Code, Codex CLI, and OpenCode sessions from local process and file state. The dashboard helps developers track token usage, context window percentage, rate limits, child processes, open ports, and multiple active profiles. It is read-only, so it does not require API keys or authentication and does not control the agents it observes. abtop is especially useful for developers running several agents across projects who need quick visibility into cost, quota pressure, context growth, and orphaned processes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DeepSeek-Reasonix

    DeepSeek-Reasonix

    DeepSeek-native AI coding agent for your terminal

    DeepSeek Reasonix is a DeepSeek-native AI coding agent designed for terminal-based software development. It is built around prefix-cache stability, which helps reduce token costs during long sessions and allows users to leave the agent running across extended workflows. Reasonix includes a coding mode with filesystem and shell tools, a lighter chat mode, one-shot task execution, health checks, session utilities, and project-scoped memory. It supports reviewed SEARCH/REPLACE edits, plan mode, MCP servers, web search, hooks, skills, semantic indexing, transcript replay, event logs, and cost or cache tracking. ...
    Downloads: 41 This Week
    Last Update:
    See Project
  • 9
    MiniMax-M2

    MiniMax-M2

    MiniMax-M2, a model built for Max coding & agentic workflows

    MiniMax-M2 is an open-weight large language model designed specifically for high-end coding and agentic workflows while staying compact and efficient. It uses a Mixture-of-Experts (MoE) architecture with 230 billion total parameters but only 10 billion activated per token, giving it the behavior of a very large model at a fraction of the runtime cost. The model is tuned for end-to-end developer flows such as multi-file edits, compile–run–fix loops, and test-validated repairs across real repositories and diverse programming languages. It is also optimized for multi-step agent tasks, planning and executing long toolchains that span shell commands, browsers, retrieval systems, and code runners. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10
    Context Mode

    Context Mode

    Context window optimization for AI coding agents

    Context Mode is a development approach and tooling concept that enhances how AI-assisted coding environments manage and inject context into language model interactions. It focuses on improving the relevance and accuracy of AI-generated outputs by controlling what information is provided to the model at each step. The project explores structured context management, enabling developers to define how files, code snippets, and metadata are included in prompts. It is particularly useful for large...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 11
    Edgee

    Edgee

    AI gateway with token compression for Claude Code, Codex, and more

    Edgee is an edge-native execution platform designed to run AI-driven logic and data processing directly at the network edge, reducing latency and improving responsiveness for modern applications. It enables developers to deploy functions and workflows closer to users, allowing real-time processing without relying heavily on centralized cloud infrastructure. The platform is built to support event-driven architectures, where actions are triggered by incoming requests, user behavior, or...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 12
    Gemini CLI

    Gemini CLI

    Open source AI agent CLI tool to bring Gemini into your terminal

    Gemini CLI is an open‑source AI agent that brings the capabilities of Google’s Gemini 2.5 Pro large‑language model directly into your terminal, enabling tasks ranging from coding and debugging to content creation and research via natural‑language prompts, with support for multimodal outputs like image and video generation. Gemini CLI integrates with external tools and MCP servers, enabling media generation and enhanced workflow automation. It also includes a built-in Google Search tool to...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 13
    Qwen Code

    Qwen Code

    Qwen Code is a coding agent that lives in the digital world

    Qwen Code is a command-line AI workflow tool designed to enhance developer productivity by leveraging the power of Qwen3-Coder models. Adapted from the Google Gemini CLI, it features an enhanced parser optimized specifically for Qwen-Coder models, enabling deep code understanding and manipulation. The tool supports querying and editing large codebases beyond traditional context limits, making it ideal for modern, complex projects. Qwen Code automates various development workflows, including...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14
    deepclaude

    deepclaude

    Use Claude Code's agent loop with DeepSeek V4 Pro, OpenRouter & more

    ...It preserves the full Claude Code experience—including file editing, terminal execution, and multi-step agent workflows—while dramatically reducing operational costs. By swapping out the underlying model instead of the interface, deepclaude delivers the same familiar UX with significantly cheaper token pricing. The platform supports seamless backend switching in real time, allowing users to choose between cost efficiency and higher reasoning power when needed. It also includes built-in cost tracking and benchmarking tools to help developers monitor usage and optimize performance. Designed for flexibility and efficiency, deepclaude is ideal for developers who want powerful AI coding agents without the premium price tag.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    MiniMax-M2.5

    MiniMax-M2.5

    State of the art LLM and coding model

    ...The model supports full-stack development across web, mobile, and desktop platforms, covering the entire lifecycle from system design to testing and code review. With native serving speeds of up to 100 tokens per second, it completes complex agentic tasks significantly faster than previous versions while maintaining high token efficiency. M2.5 is built to be highly cost-effective, enabling continuous deployment of powerful AI agents at a fraction of the cost of other frontier models.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Mentat

    Mentat

    Mentat - The AI Coding Assistant

    ...Mentat uses Git, so if your project doesn't already have Git set up, run git init. List the files you would like Mentat to read and edit as arguments. Mentat will add each of them to context, so be careful not to exceed the GPT-4 token context limit.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Qwen-of-Death
    Qwen of Death is a desktop coding assistant with GUI powered by the Qwen API. Users supply their own API key — all billing is handled directly with Openrouter.ai.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    GPT of Death
    GPT of Death is a desktop coding assistant with GUI powered by the OpenAI API. Users supply their own API key — all billing is handled directly with OpenAI.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Grok of Death
    Grok of Death is a desktop coding assistant with GUI powered by the Grok API. Users supply their own API key — all billing is handled directly with xAI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Gemini of Death
    Gemini of Death is a desktop coding assistant with GUI powered by the Googles Gemini API. Users supply their own API key — all billing is handled directly with Google.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Claude of Death
    Claude of Death is a desktop coding assistant with GUI powered by the Anthropic Claude API. Users supply their own API key — all billing is handled directly with Anthropic. 4-24-26: Just added memory to it, like Claude Code. Also improved the save function. 4-25-26: Fixed save function for Macs. 4-28-26: Added Save Code as File, in addition to already present Generate File. 5-9-26: Renamed from Code of Death to Claude of Death.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    lastest

    lastest

    AI-supported visual verification and tests you can actually trust.

    Lastest.cloud is a free, open-source verification of development, self-hosted visual regression and end-to-end testing platform for web applications. An AI agent records you clicking through your running app and generates Playwright tests with multi-selector fallback. Replays are deterministic and token-free, so your CI/CD bill doesn't scale with your test suite. Lastest ships three diff engines side-by-side — pixel (pixelmatch), structural (SSIM), and perceptual (Butteraugli) — so flaky pixel diffs stop crying wolf. A review dashboard tracks baselines, approvals, and audit history. Run it via Docker on your own infrastructure for unlimited screenshots and full data residency, or run on Lastest Cloud for managed CI runners. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo