token free download - SourceForge

Showing 216 open source projects for "token"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
1

Token-Oriented Object Notation

Token-Oriented Object Notation (TOON)

Token-Oriented Object Notation is an open specification and toolkit for a data serialization format called Token-Oriented Object Notation (TOON), designed specifically to optimize how structured data is passed to large language models. The format aims to reduce token overhead compared with traditional formats like JSON while remaining human-readable and structurally expressive.

Downloads: 7 This Week

Last Update: 2026-05-20
See Project
2

rtk

CLI proxy that reduces LLM token consumption

rtk is an open-source command-line proxy designed to optimize interactions between AI coding agents and the terminal by reducing unnecessary token consumption. When AI assistants execute shell commands during software development tasks, the resulting terminal output often contains large amounts of repetitive or irrelevant information that can overwhelm the model’s context window. RTK intercepts these command outputs and compresses them into concise summaries before sending them to the language model. ...

Downloads: 44 This Week

Last Update: 1 day ago
See Project
3

DeepSeek-V3

Powerful AI language model (MoE) optimized for efficiency/performance

DeepSeek-V3 is a robust Mixture-of-Experts (MoE) language model developed by DeepSeek, featuring a total of 671 billion parameters, with 37 billion activated per token. It employs Multi-head Latent Attention (MLA) and the DeepSeekMoE architecture to enhance computational efficiency. The model introduces an auxiliary-loss-free load balancing strategy and a multi-token prediction training objective to boost performance. Trained on 14.8 trillion diverse, high-quality tokens, DeepSeek-V3 underwent supervised fine-tuning and reinforcement learning to fully realize its capabilities. ...

1 Review

Downloads: 64 This Week

Last Update: 2025-07-09
See Project
4

Tokscale

A CLI tool for tracking token usage from OpenCode, Claude Code

Tokscale is a CLI and terminal UI tool that tracks token usage and estimated cost across multiple AI coding assistants and development workflows. It treats tokens like a measurable resource, helping developers understand how much “AI energy” they are consuming over time and where it is being spent. The tool aggregates usage across supported platforms and presents it through interactive views that let users filter, sort, and explore trends without leaving the terminal.

Downloads: 7 This Week

Last Update: 4 days ago
See Project
Atera - an All-in-one platform for IT management
Ideal for IT departments and MSPs (managed service providers)

Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!

Try Atera now
5

TokenCost

Easy token price estimates for 400+ LLMs. TokenOps

TokenCost is an open-source developer utility designed to estimate the cost of using large language model APIs by calculating token usage and translating it into real monetary values. The tool focuses on helping developers understand how much their prompts and generated completions cost when interacting with commercial AI models. It works by counting tokens in prompts and responses before or after sending requests and then applying pricing information associated with different models. ...

Downloads: 3 This Week

Last Update: 2026-03-06
See Project
6

Repomix

Repomix is a powerful tool that packs your entire repository

...The tool is particularly valuable for code review, refactoring assistance, and automated documentation workflows where context size matters. Repomix intelligently respects ignore rules and can compress code structure to reduce token usage while preserving meaning. It supports multiple output formats and provides token counting to help developers stay within model limits. The project also includes CLI, browser, and editor integrations that make it easy to incorporate into everyday workflows. Overall, Repomix serves as a bridge between traditional repositories and AI-native development practices.

Downloads: 9 This Week

Last Update: 2026-05-26
See Project
7

FastVLM

This repository contains the official implementation of FastVLM

...Apple’s research brief frames FastVLM as targeting real-time or latency-sensitive scenarios, where lowering visual token pressure is critical to interactive UX. In short, it’s a practical recipe to make VLMs fast without exotic token-selection heuristics.

Downloads: 0 This Week

Last Update: 2025-10-08
See Project
8

OpenAI Privacy Filter

Bidirectional token-classification model for identifiable info

OpenAI Privacy Filter is an open-weight machine learning model designed to detect and mask personally identifiable information in text with high efficiency and contextual awareness. It operates as a bidirectional token classification system that labels sensitive data in a single forward pass rather than generating text sequentially, enabling fast processing for large datasets. The model supports long-context inputs, allowing it to analyze extensive documents without chunking, which improves consistency in redaction tasks. It can run locally on standard hardware, ensuring that sensitive information never leaves the user’s environment and supporting privacy-first workflows. ...

Downloads: 2 This Week

Last Update: 2026-04-24
See Project
9

OpenSquilla

Token-Efficient AI Agent with same budget, higher intelligence density

OpenSquilla is a token-efficient microkernel AI agent runtime designed for CLI, web UI, and chat-based workflows. It routes each turn through a shared loop that can select lower-cost models when appropriate while preserving tool dispatch, retries, memory, and decision logging. The project supports multiple LLM providers through a pluggable provider layer, making it adaptable to different model ecosystems.

Downloads: 5 This Week

Last Update: 2026-06-03
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

Claude Cognitive

Persistent context and multi-instance coordination

...It introduces an attention-based context router that prioritizes files and content relevant to the current development discussion — tagging them as HOT, WARM, or COLD based on recency and keyword activation — so Claude Code doesn’t waste token budget rereading irrelevant code. This context routing dramatically reduces redundant token usage and accelerates large codebase interactions by focusing only on what truly matters to the current task. Additionally, Claude-Cognitive includes a pool coordinator to share state across multiple Claude Code instances, preserving what’s been learned or completed and preventing repetitive debugging or redundant exploration.

Downloads: 5 This Week

Last Update: 2026-01-28
See Project
11

Claw Compactor

14-stage Fusion Pipeline for LLM token compression

...It addresses the challenge of finite context windows in language models by compressing or summarizing historical interactions while preserving essential information. The system works by transforming older conversation data into condensed representations that maintain continuity without exceeding token limits. This approach allows long-running agent sessions to continue operating efficiently without losing critical context. It is especially useful in autonomous workflows where agents accumulate large volumes of interaction history over time. The project aligns with broader strategies in AI systems that balance memory retention with computational constraints. ...

Downloads: 8 This Week

Last Update: 2026-03-31
See Project
12

caveman

Why use many token when few token do trick

Caveman is a lightweight and experimental project focused on simplifying backend or full-stack development workflows through minimalistic abstractions and rapid prototyping principles. It is designed to reduce the complexity of modern frameworks by offering a stripped-down approach that prioritizes speed, clarity, and ease of use. The project often serves as a foundation for developers who want to build applications quickly without being constrained by heavy conventions or extensive...

Downloads: 22 This Week

Last Update: 1 day ago
See Project
13

Phenaki - Pytorch

Implementation of Phenaki Video, which uses Mask GIT

Implementation of Phenaki Video, which uses Mask GIT to produce text-guided videos of up to 2 minutes in length, in Pytorch. It will also combine another technique involving a token critic for potentially even better generations. A new paper suggests that instead of relying on the predicted probabilities of each token as a measure of confidence, one can train an extra critic to decide what to iteratively mask during sampling. This repository will also endeavor to allow the researcher to train on text-to-image and then text-to-video. ...

Downloads: 3 This Week

Last Update: 2024-07-29
See Project
14

Solana Agent Kit

Connect any ai agents to solana protocols

solana-agent-kit is an open-source toolkit that enables AI agents to connect with Solana blockchain protocols. It allows any AI agent, regardless of the underlying model, to autonomously perform over 60 Solana actions, including token trading, launching new tokens, lending assets, sending compressed airdrops, executing blinks, launching tokens on Automated Market Makers (AMMs), and bridging tokens across chains.

Downloads: 8 This Week

Last Update: 2025-07-24
See Project
15

Hermes Web UI

The best way to use Hermes Agent from the web or from your phone

...It is built using simple technologies like Python and vanilla JavaScript, avoiding complex frontend frameworks. The UI supports real-time interaction, context tracking, and visualization of token usage. It connects to a self-hosted agent that continuously learns and evolves over time. The project emphasizes usability, accessibility, and seamless integration with existing workflows.

Downloads: 24 This Week

Last Update: 14 hours ago
See Project
16

Claude Code Usage Monitor

Real-time Claude Code usage monitor with predictions and warnings

Claude Code Usage Monitor is a developer-focused terminal tool that provides real-time visibility into Claude Code token consumption and session behavior. The project is designed to help users avoid unexpectedly hitting usage caps by continuously tracking token burn rate, message counts, and estimated costs during active sessions. It presents analytics through a visually rich terminal interface built with modern Python tooling, making it easy to interpret usage trends at a glance. ...

Downloads: 3 This Week

Last Update: 2026-03-02
See Project
17

claude-devtools

A desktop app that reconstructs exactly what Claude Code did

...The tool was created to address the loss of detail in the standard CLI output, which often summarizes actions without exposing the full underlying operations. It surfaces granular information such as file reads, edits, tool calls, token consumption, and subagent activity, enabling developers to understand exactly how the AI interacted with their codebase. Because it runs entirely locally and makes no network calls, it requires no API keys or configuration and works with any previously recorded sessions.

Downloads: 18 This Week

Last Update: 2026-05-13
See Project
18

Step-Audio-EditX

LLM-based Reinforcement Learning audio edit model

...Rather than treating audio editing as low-level waveform manipulation, this model converts speech into a sequence of discrete “audio tokens” (via a dual-codebook tokenizer) — combining a linguistic token stream and a semantic (prosody/emotion/style) token stream — thereby abstracting audio editing into high-level token operations. This allows users to modify not only what is said (the text) but also how it's said: emotion, tone, speaking style, prosody, accent, even paralinguistic cues. Because the model is trained with a “large-margin learning” objective over many synthesized and natural speech samples, it gains robust control over expressive attributes, and can perform iterative editing: e.g. you could record a line, then ask the model to “make it sadder,” “speak slower,” or “change accent to X.”

Downloads: 0 This Week

Last Update: 2026-04-09
See Project
19

Pinchtab

High-performance browser automation bridge and orchestrator

...Implemented as a small standalone HTTP server, it allows any agent or script to interact with web pages using simple API calls instead of heavyweight browser frameworks. The tool emphasizes accessibility-first snapshots that dramatically reduce token usage compared to screenshot-based approaches, making it cost-effective for large-scale automation. It launches and manages its own Chrome instance while remaining framework-agnostic, so it can be used with any language or agent system. Pinchtab also supports persistent sessions, stealth automation, and both headless and headed operation modes. ...

Downloads: 8 This Week

Last Update: 2026-05-31
See Project
20

TONL

TONL (Token-Optimized Notation Language)

TONL is a cutting-edge data platform built around a production-ready serialization format designed to be both compact and powerful, combining human readability with performance features that make it suitable for large-scale applications and AI workflows. It provides a serialization format that significantly reduces token usage compared with traditional JSON, which can result in lower costs and more efficient prompt size utilization in LLM-driven systems. TONL isn’t just a format — it includes a rich API for querying, indexing, modifying, and streaming data, along with tools for schema validation and TypeScript code generation. The platform comes with a complete command-line interface that supports interactive dashboards and cross-platform usage in browsers and server environments, and its high test coverage gives developers confidence in stability.

Downloads: 0 This Week

Last Update: 2026-02-07
See Project
21

DeepSeek R1

Open-source, high-performance AI model with advanced reasoning

...DeepSeek R1 offers unrestricted access for both commercial and academic use. The model employs a Mixture of Experts (MoE) architecture, comprising 671 billion total parameters with 37 billion active parameters per token, and supports a context length of up to 128,000 tokens. DeepSeek-R1's training regimen uniquely integrates large-scale reinforcement learning (RL) without relying on supervised fine-tuning, enabling the model to develop advanced reasoning capabilities. This approach has resulted in performance comparable to leading models like OpenAI's o1, while maintaining cost-efficiency. ...

1 Review

Downloads: 98 This Week

Last Update: 2025-07-09
See Project
22

Lossless Claw

LCM (Lossless Context Management) plugin for OpenClaw

...Instead of relying on traditional sliding-window truncation or lossy summarization, it introduces a lossless architecture that preserves all historical messages while maintaining usable context within token limits. The system stores every interaction in a persistent database and incrementally summarizes older content into a hierarchical directed acyclic graph, allowing efficient compression without discarding information. This structure enables agents to dynamically reconstruct detailed context by expanding summaries when needed, effectively simulating perfect long-term memory.

Downloads: 6 This Week

Last Update: 2026-06-05
See Project
23

webclaw

Fast, local-first web content extraction for LLMs

...It is built in Rust and operates without a headless browser, using advanced techniques such as TLS fingerprinting to bypass common scraping barriers and mimic real browser behavior. The tool addresses a major inefficiency in AI workflows by removing irrelevant elements like navigation menus, ads, and scripts, significantly reducing token usage when feeding data into language models. It supports multiple modes of operation, including CLI usage, REST API access, and an MCP server for direct integration with agent-based systems. Webclaw also provides advanced capabilities such as recursive crawling, structured JSON extraction, summarization, and content comparison, making it suitable for research and data pipelines. ...

Downloads: 9 This Week

Last Update: 3 days ago
See Project
24

Gitingest

Create prompt-friendly codebase digests from any Git repository URL

...In addition to producing the code digest, Gitingest also calculates statistics about the extracted content such as repository structure, total size of the extract, and token count. Gitingest can be used as a command line utility or integrated directly into Python applications.

Downloads: 7 This Week

Last Update: 2026-03-13
See Project
25

Oh My OpenCode Slim

Slimmed, cleaned and fine-tuned oh-my-opencode fork

Oh My OpenCode Slim is a lightweight, optimized fork of the broader oh-my-opencode ecosystem, designed to deliver high-performance multi-agent coding workflows while significantly reducing token consumption and system overhead. It retains the core concept of orchestrating multiple specialized AI agents but streamlines their configuration, execution, and communication to make the system more efficient and practical for everyday use. The framework introduces a structured “pantheon” of agents, each with a defined role such as orchestration, exploration, and execution, allowing tasks to be automatically delegated and completed through coordinated workflows. ...

Downloads: 4 This Week

Last Update: 1 day ago
See Project