HunyuanVideo: A Systematic Framework For Large Video Generation Model
Interact with your SQL database, Natural Language to SQL using LLMs
A Unified Framework for Text-to-3D and Image-to-3D Generation
Industrial-level controllable zero-shot text-to-speech system
A guidance language for controlling large language models
Uncover insights, surface problems, monitor, and fine tune your LLM
The repository provides code for running inference with SAM 2
Free, high-quality text-to-speech API endpoint to replace OpenAI
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
A deep learning toolkit for Text-to-Speech, battle-tested in research
Multi-Voice and Prompt-Controlled TTS Engine
Code for the paper Hybrid Spectrogram and Waveform Source Separation
A large open dataset + tools to speed up MRI scans using ML
A method to increase the speed and lower the memory footprint
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Tools to download and cleanup Common Crawl data
Time Series Forecasting Best Practices & Examples
An open-source convolutional neural networks platform for research
Fast, modular reference implementation of Instance Segmentation
A Neural Net Training Interface on TensorFlow, with focus on speed
Source-to-source debuggable derivatives in pure Python