Autoregressive Model Beats Diffusion
DepGraph: Towards Any Structural Pruning
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Accessible large language models via k-bit quantization for PyTorch
Integrating LLMs into structured NLP pipelines
Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms
Implement CPU from scratch and play with large model deployments
Gemma open-weight LLM library, from Google DeepMind
Play ChatGPT and other LLM with Xiaomi AI Speaker
A Telegram bot for Large Language Models
Framework and no-code GUI for fine-tuning LLMs
Test-Time Reinforcement Learning
AI-driven multi-agent research assistant automating hypothesis
Synthetic data curation for post-training and data extraction
Easy token price estimates for 400+ LLMs. TokenOps
From nobody to big model (LLM) hero
Deploy your agentic worfklows to production
Modular AI runtime for robots
An open-source, modern-design AI training tracking and visualization
A simple, easy-to-hack GraphRAG implementation
General technology for enabling AI capabilities w/ LLMs and MLLMs
Weaving the Digital Agent Galaxy
Chat with your SQL database
A New Axis of Sparsity for Large Language Models
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible