first person shooter, space invader
CLIP, Predict the most relevant text snippet given an image
PyTorch code and models for VJEPA2 self-supervised learning from video
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Synchronized Translation for Videos
PyTorch code and models for V-JEPA self-supervised learning from video
Generate Any 3D Scene in Seconds
Large Multimodal Models for Video Understanding and Editing
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Comprehensive study guide for coding interviews
Mini website for testing both general CS knowledge and enforce coding
A collective list of free APIs
Implementation of the Surya Foundation Model for Heliophysics
SOTA discrete acoustic codec models with 40/75 tokens per second
Diffusion Transformer with Fine-Grained Chinese Understanding
Language modeling in a sentence representation space
Generate 3D objects conditioned on text or images
deletes junk files to free disk space and improve privacy
4X Space Strategy Game
GTK+ comic book viewer.
General Mission Analysis Tool
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
A small programmer's editor.
Software for molecular simulations and trajectory analysis
Code release for "Detecting Twenty-thousand Classes