Create videos with Stable Diffusion
PyTorch code and models for VJEPA2 self-supervised learning from video
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
CLIP, Predict the most relevant text snippet given an image
Text-space optimizer that trains reusable natural-language skills
A Hyperparameter Tuning Library for Keras
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Topic Modelling for Humans
Based on AI Agent + MCP toolchain + penetration Skill orchestration
A Family of Open Sourced Music Foundation Models
PyTorch code and models for V-JEPA self-supervised learning from video
Recovering the Visual Space from Any Views
Medical imaging toolkit for deep learning
An open-source toolkit for monitoring Language Learning Models (LLMs)
PyTorch version of Stable Baselines
1B text generation model based on the HRM architecture
AI-Driven Exploration in the Space of Code
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Motion-controllable Video Generation via Latent Trajectory Guidance
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Implementation of the Surya Foundation Model for Heliophysics
Generate high-definition story short videos with one click using AI
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Generate Any 3D Scene in Seconds
UI-TARS-desktop version that can operate on your local personal device