Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Large-language-model & vision-language-model based on Linear Attention
Implementation of Make-A-Video, new SOTA text to video generator
Open-source MCP server that gives your coding agent
Run LLM prompts from your shell
An end-to-end Data Scientist
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Fast-stable-diffusion + DreamBooth
Ultimate meta-skill for generating best-in-class Claude Code skills
Persistent context and multi-instance coordination
Multimodal embedding and reranking models built on Qwen3-VL
A New Axis of Sparsity for Large Language Models
Context engineering is the new vibe coding
Instant AI code reviews
LLM training in simple, raw C/CUDA
A simple, secure MCP-to-OpenAPI proxy server
VMZ: Model Zoo for Video Modeling
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
High-resolution models for human tasks
Towards Real-World Vision-Language Understanding
Chat & pretrained large vision language model
Repo of Qwen2-Audio chat & pretrained large audio language model
Extensible AGI Framework
SWE-agent takes a GitHub issue and tries to automatically fix it
Flower: A Friendly Federated Learning Framework