Synthetic data curation for post-training and data extraction
How to optimize some algorithm in cuda
NeurIPS2025 Spotlight] Quantized Attention
Weaving the Digital Agent Galaxy
Unified framework for building enterprise RAG pipelines
An end-to-end Data Scientist
Spark-TTS Inference Code
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Ultimate meta-skill for generating best-in-class Claude Code skills
End-to-end pipeline converting generative videos
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Hunyuan Translation Model Version 1.5
Multimodal embedding and reranking models built on Qwen3-VL
A New Axis of Sparsity for Large Language Models
Code release for Cut and Learn for Unsupervised Object Detection
High-resolution models for human tasks
Ling is a MoE LLM provided and open-sourced by InclusionAI
Scalable data pre processing and curation toolkit for LLMs
PyTorch version of Stable Baselines
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Conditional GAN for generating synthetic tabular data
Library to help with training and evaluating neural networks
Image processing in Python
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
RGBD video generation model conditioned on camera input