Generate blog articles from video or audio
SOTA discrete acoustic codec models with 40/75 tokens per second
Controllable and fast Text-to-Speech for over 7000 languages
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Python examples of popular machine learning algorithms
DeepMind model for tracking arbitrary points across videos & robotics
code for Mesh R-CNN, ICCV 2019
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
PyTorch code and models for VJEPA2 self-supervised learning from video
Language modeling in a sentence representation space
An AI-powered security review GitHub Action using Claude
Educational framework exploring multi-agent orchestration
Official python implementation of UTCP. UTCP is an open standard
Proofs, cases, concept supplements, and reference explanations
Spatiotemporal Signal Processing with Neural Machine Learning Models
A simple forecasting package
Probabilistic time series modeling in Python
Best practices on recommendation systems
NeuTTS model built from small LLM backbones
On-device TTS model by Neuphonic
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
Interface for OuteTTS models
One-click deployment (including offline integration package)
A TTS model capable of generating ultra-realistic dialogue
Plug-and-play library to enable agents to call MCP and UTCP tools