NeurIPS2025 Spotlight] Quantized Attention
Maimaibot, a (more focused) multi-platform intelligent agent
Llama Chinese community, real-time aggregation
SimpleMem: Efficient Lifelong Memory for LLM Agents
Enables the best performance on NVIDIA RTX Graphics Cards
CoreNet: A library for training deep neural networks
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
Flower: A Friendly Federated Learning Framework
FlashMLA: Efficient Multi-head Latent Attention Kernels
A curated list of project tutorials for project-based learning
Learning agent trained in a diffusion world model
A PyTorch-based Speech Toolkit
Integrate AI Assistants with Django to build intelligent applications
MII makes low-latency and high-throughput inference possible
Open-source multi-chain data routing & low-latency scanning framework.
Extremely fast enterprise server framework, can be used in RPC
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
A Personalized LLM-powered Agent Frameworks
Running large language models on a single GPU
The open source post-building layer for agents
Schema-Guided Reasoning (SGR) has agentic system design
Chat with your documents using local AI
Towards Efficient Self-Evolving Agent System
Learning to Reason with Search for LLMs via Reinforcement Learning
A tension reasoning engine over 131 S-class problems