Block Diffusion for Ultra-Fast Speculative Decoding
User toolkit for analyzing and interfacing with Large Language Models
An efficient forwarding service designed for LLMs
AWS Skills for Agents
Tongyi Deep Research, the Leading Open-source Deep Research Agent
SGLang is a fast serving framework for large language models
Agentic, Reasoning, and Coding (ARC) foundation models
Large-language-model & vision-language-model based on Linear Attention
Context management for Claude Code. Hooks maintain state via ledgers
Visual Causal Flow
SOTA discrete acoustic codec models with 40/75 tokens per second
An Open Source text-to-speech system built by inverting Whisper
AGiXT is a dynamic AI Automation Platform
The highest-scoring AI memory system ever benchmarked
Open-weight, large-scale hybrid-attention reasoning model
Plug-and-play library to enable agents to call MCP and UTCP tools
Code for running inference and finetuning with SAM 3 model
Qwen3 is the large language model series developed by Qwen team
Provides convenient access to the Anthropic REST API from any Python 3
Build resilient language agents as graphs
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Uncertainty Quantification for Language Models, is a Python package
Performance-optimized AI inference on your GPUs
High-performance Inference and Deployment Toolkit for LLMs and VLMs
A Telegram bot for Large Language Models