Page 2 | token free download

Showing 216 open source projects for "token"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

Humanizer Skill

Claude Code skill that removes signs of AI-generated writing from text

...It also includes functions for transforming camelCase, snake_case, or PascalCase identifiers into spaced and capitalized representations suitable for user interfaces, reports, or documentation. Beyond text formatting, the library can handle pluralization, enumeration formatting (“A, B, and C”), and token expansion so that program-generated content feels more conversational.

Downloads: 118 This Week

Last Update: 6 days ago
See Project
2

Lunary

The production toolkit for LLMs. Observability, prompt management

Lunary helps developers of LLM Chatbots develop and improve them.

Downloads: 8 This Week

Last Update: 2025-10-21
See Project
3

OpenMonoAgent

Terminal-native coding agent powered by local LLMs

OpenMonoAgent.ai is a self-hosted coding agent designed to run entirely on the user’s own hardware. It pairs a .NET CLI with a local llama.cpp inference server so developers can use agentic coding workflows without cloud subscriptions or per-token billing. The project emphasizes privacy, local control, and ownership of the model, compute, and project data. It includes a terminal-native workflow, built-in tools, Docker sandboxing, and code intelligence features. The system can run on CPU or GPU and is designed to auto-configure itself when possible. OpenMonoAgent.ai is best suited for developers who want a local AI development stack with no API keys, no cloud dependency, and no telemetry.

Downloads: 3 This Week

Last Update: 4 days ago
See Project
4

OpenSpace

OpenSpace: Make Your Agents: Smarter, Low-Cost, Self-Evolving

...The platform emphasizes collective intelligence, enabling multiple agents to share learned behaviors and benefit from each other’s experiences. It also focuses on cost efficiency by reducing redundant computations and reusing successful workflows, significantly lowering token usage in repeated tasks. The framework includes monitoring and evaluation mechanisms to track skill performance and ensure reliability as systems evolve. It supports integration with various agent platforms, making it flexible and extensible across different environments.

Downloads: 3 This Week

Last Update: 2026-06-02
See Project
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
5

Step 3.5 Flash

Fast, Sharp & Reliable Agentic Intelligence

Step 3.5 Flash is a cutting-edge, open-source large language model developed by StepFun-AI that pushes the frontier of efficient reasoning and “agentic” intelligence in a way that makes powerful AI accessible beyond proprietary black boxes. Unlike dense models that activate all their parameters for every token, Step 3.5 Flash uses a sparse Mixture-of-Experts (MoE) architecture that selectively engages only about 11 billion of its roughly 196 billion total parameters per token, delivering high-quality reasoning and interaction at far lower compute cost and latency than traditional large models. Its design targets deep reasoning, long-context handling, coding, and real-time responsiveness, making it suitable for building autonomous agents, advanced assistants, and long-chain cognitive workflows without sacrificing performance.

Downloads: 5 This Week

Last Update: 2026-04-03
See Project
6

Tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models

tiktoken is a high-performance, tokenizer library (based on byte-pair encoding, BPE) designed for use with OpenAI’s models. It handles encoding and decoding text to token IDs efficiently, with minimal overhead. Because tokenization is a fundamental step in preparing text for models, tiktoken is optimized for speed, memory, and correctness in model contexts (e.g. matching OpenAI’s internal tokenization). The repo supports multiple encodings (e.g. “cl100k_base”) and lets users switch encoding names to match different model contexts. ...

Downloads: 5 This Week

Last Update: 2026-05-15
See Project
7

MiMo-V2-Flash

MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation

MiMo-V2-Flash is a large Mixture-of-Experts language model designed to deliver strong reasoning, coding, and agentic-task performance while keeping inference fast and cost-efficient. It uses an MoE setup where a very large total parameter count is available, but only a smaller subset is activated per token, which helps balance capability with runtime efficiency. The project positions the model for workflows that require tool use, multi-step planning, and higher throughput, rather than only single-turn chat. Architecturally, it highlights attention and prediction choices aimed at accelerating generation while preserving instruction-following quality in complex prompts. ...

Downloads: 4 This Week

Last Update: 2026-01-08
See Project
8

Claude-Mem

Claude Code plugin that automatically captures everything Claude does

...By enabling long-term continuity, Claude-Mem helps Claude “remember” project history, past fixes, and prior reasoning even after restarts or reconnects. Its progressive disclosure approach intelligently injects only the most relevant context, balancing usefulness with token efficiency. Claude-Mem runs automatically in the background with no manual workflow changes required. Designed for serious developers, it transforms Claude Code into a continuously learning, project-aware coding assistant.

Downloads: 6 This Week

Last Update: 1 day ago
See Project
9

PHP Client For NLP Cloud

NLP Cloud serves high performance pre-trained or custom models for NER

...It is ready for production, served through a REST API. You can either use the NLP Cloud pre-trained models, fine-tune your own models, or deploy your own models. Pass the model you want to use and the NLP Cloud token to the client during initialization. If you are making asynchronous requests, you will always receive a quick response containing a URL.

Downloads: 6 This Week

Last Update: 2024-11-27
See Project
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
10

Stanford CoreNLP

Stanford CoreNLP, a Java suite of core NLP tools

CoreNLP is your one stop shop for natural language processing in Java! CoreNLP enables users to derive linguistic annotations for text, including token and sentence boundaries, parts of speech, named entities, numeric and time values, dependency and constituency parses, coreference, sentiment, quote attributions, and relations. CoreNLP currently supports 6 languages, Arabic, Chinese, English, French, German, and Spanish. The centerpiece of CoreNLP is the pipeline. Pipelines take in raw text, run a series of NLP annotators on the text, and produce a final set of annotations. ...

Downloads: 6 This Week

Last Update: 2025-06-07
See Project
11

hfapigo

Unofficial (Golang) Go bindings for the Hugging Face Inference API

(Golang) Go bindings for the Hugging Face Inference API. Directly call any model available in the Model Hub. An API key is required for authorized access. To get one, create a Hugging Face profile.

Downloads: 8 This Week

Last Update: 2025-11-06
See Project
12

Model Context Protocol TypeScript SDK

The official Typescript SDK for Model Context Protocol servers

The TypeScript SDK for Model Context Protocol simplifies integration with the Model Context Protocol, enabling developers to interact with AI models effectively.

Downloads: 7 This Week

Last Update: 2026-03-30
See Project
13

MiniOneRec

Minimal reproduction of OneRec

...The framework provides an end-to-end pipeline for building generative recommender systems, including semantic identifier construction, supervised fine-tuning, and reinforcement learning-based optimization. Semantic IDs are created using techniques such as quantized variational autoencoders to convert item features into token sequences that can be modeled by transformer architectures. Developers can train and evaluate recommendation models using different backbone language models while benefiting from the generative framework’s parameter efficiency and scalability.

Downloads: 1 This Week

Last Update: 2026-05-14
See Project
14

Claude Code Bridge

Real-time multi-AI collaboration: Claude, Codex & Gemini

...The system allows developers to coordinate interactions between models such as Claude, Codex, and Gemini so that they can work together on programming tasks. By maintaining persistent shared context between these models, the tool reduces redundant prompts and minimizes token usage while allowing each AI system to contribute specialized capabilities. The architecture functions as a unified launcher that manages communication between multiple AI providers and coordinates their responses within the same development session. Developers can run the tool in terminal environments and integrate it with terminal multiplexers such as tmux or advanced terminal emulators.

Downloads: 2 This Week

Last Update: 12 hours ago
See Project
15

MCP Text Editor

Provides line-oriented text file editing capabilities

The MCP Text Editor Server provides line-oriented text file editing capabilities through a standardized API, optimized for integration with Large Language Models (LLMs). It enables efficient partial file access, minimizing token usage while ensuring safe concurrent editing.

Downloads: 6 This Week

Last Update: 2026-01-05
See Project
16

InfiAgent

Build your own Cowork, AI Scientist and other SoTA Agents

...Designed as a “Multi-Level Agent” (MLA) system, it externalizes persistent state to the file system so that agents can operate over unlimited runtime without the need for token-intensive context compression, enabling workflows such as research paper drafting, experiments, coding, and document generation to run reliably. The framework uses a serial multi-agent hierarchy where specialized agents coordinate in tree-structured paths for clear task delegation and minimal tool conflicts, while batch file operations and persistent workspaces ensure reproducibility and traceability. ...

Downloads: 2 This Week

Last Update: 2026-03-30
See Project
17

HunyuanImage-3.0

A Powerful Native Multimodal Model for Image Generation

...It unifies multimodal understanding and generation in a single autoregressive framework, combining text and image modalities seamlessly rather than relying on separate image-only diffusion components. It uses a Mixture-of-Experts (MoE) architecture with many expert subnetworks to scale efficiently, deploying only a subset of experts per token, which allows large parameter counts without linear inference cost explosion. The model is intended to be competitive with closed-source image generation systems, aiming for high fidelity, prompt adherence, fine detail, and even “world knowledge” reasoning (i.e. leveraging context, semantics, or common sense in generation). The GitHub repo includes code, scripts, model loading instructions, inference utilities, prompt handling, and integration with standard ML tooling (e.g. ...

1 Review

Downloads: 2 This Week

Last Update: 2026-02-03
See Project
18

Text Generation Inference

Large Language Model Text Generation Inference

Text Generation Inference is a high-performance inference server for text generation models, optimized for Hugging Face's Transformers. It is designed to serve large language models efficiently with optimizations for performance and scalability.

Downloads: 8 This Week

Last Update: 2025-12-18
See Project
19

JMusicBot

A Discord music bot that's easy to set up and run yourself

A cross-platform Discord music bot with a clean interface, and that is easy to set up and run yourself! Easy to run (just make sure Java is installed, and run!) Fast loading of songs. No external keys are needed (besides a Discord Bot token) Smooth playback. Server-specific setup for the "DJ" role that can moderate the music. Clean and beautiful menus. Supports many sites, including Youtube, Soundcloud, and more. Supports many online radio/streams. Supports local files. Playlist support (both web/youtube and local) This bot (and the source code here) might not be easy to edit for inexperienced programmers. ...

Downloads: 1 This Week

Last Update: 2024-08-05
See Project
20

Transformer Explainer

Learn How LLM Transformer Models Work with Interactive Visualization

...Through visual diagrams and interactive interfaces, the tool reveals how tokens are processed through layers such as embeddings, attention mechanisms, and feed-forward networks. Users can observe how attention weights change as the model predicts the next token, offering insight into how transformer architectures capture relationships between words. The design of the platform emphasizes educational accessibility, allowing students, researchers, and developers to explore complex machine learning concepts without requiring specialized hardware or installations.

Downloads: 2 This Week

Last Update: 2026-03-04
See Project
21

Vectorize MCP Server

Official Vectorize MCP Server

The Vectorize MCP Server is a Model Context Protocol server that integrates with Vectorize, offering advanced vector retrieval and text extraction capabilities.

Downloads: 2 This Week

Last Update: 2025-04-08
See Project
22

agentsview

Local-first session intelligence and analytics for coding agents

...It indexes conversations from tools like Claude Code, Codex, Gemini CLI, Cursor, OpenHands, and many other agent systems. The project lets users browse, search, and analyze coding-agent activity without creating an account or sending session content to a hosted service. It tracks token usage, cost, models, projects, tools, and session behavior across different agents. Its web interface adds dashboards, heatmaps, full-text search, and live updates while sessions are active. It can also support team-oriented workflows through optional PostgreSQL sync and DuckDB mirroring.

Downloads: 0 This Week

Last Update: 24 hours ago
See Project
23

abtop

Like htop, but for AI coding agents. Monitor Claude Code & Codex CLI

abtop is a terminal monitoring tool for AI coding agents, inspired by system monitors like htop and btop. It gives users a real-time view of active Claude Code, Codex CLI, and OpenCode sessions from local process and file state. The dashboard helps developers track token usage, context window percentage, rate limits, child processes, open ports, and multiple active profiles. It is read-only, so it does not require API keys or authentication and does not control the agents it observes. abtop is especially useful for developers running several agents across projects who need quick visibility into cost, quota pressure, context growth, and orphaned processes. ...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
24

RecursiveMAS

Offical Implementation for "Recursive Multi-Agent Systems"

...It also incorporates an inner–outer loop training approach that optimizes the entire system collectively rather than tuning each agent separately. This design improves efficiency, reduces token usage, and stabilizes learning during iterative reasoning.

Downloads: 0 This Week

Last Update: 2026-05-25
See Project
25

Streamdown

Streaming markdown renderer for AI apps with smooth updates

...Streamdown is built to handle partial Markdown input gracefully, progressively enhancing the output as more text becomes available. It is especially relevant for chat interfaces, coding assistants, and any environment where responses are streamed token by token. Streamdown emphasizes performance and simplicity, ensuring that developers can integrate it without unnecessary complexity. It prioritizes correctness in Markdown rendering while maintaining responsiveness during continuous updates. Overall, it serves as a practical solution for improving the user experience of real-time generated text displays.

Downloads: 0 This Week

Last Update: 2026-03-18
See Project