Search all of YouTube from the command line
NeurIPS2025 Spotlight] Quantized Attention
Llama Chinese community, real-time aggregation
SimpleMem: Efficient Lifelong Memory for LLM Agents
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
Open source no-code system for text annotation and building of text
FlashMLA: Efficient Multi-head Latent Attention Kernels
A curated list of project tutorials for project-based learning
AI discovers 520000 stable inorganic crystal structures for research
Integrate AI Assistants with Django to build intelligent applications
A PyTorch-based Speech Toolkit
Learning agent trained in a diffusion world model
MII makes low-latency and high-throughput inference possible
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
A Personalized LLM-powered Agent Frameworks
Running large language models on a single GPU
The open source post-building layer for agents
Schema-Guided Reasoning (SGR) has agentic system design
Towards Efficient Self-Evolving Agent System
Learning to Reason with Search for LLMs via Reinforcement Learning
A tension reasoning engine over 131 S-class problems
An Efficient Web-enhanced Question Answering System
An agentless approach to automatically solve software development
Empowering Code Generation with OSS-Instruct
A simple, performant and scalable Jax LLM