Simulation framework for accelerating research
Multilingual speech recognition and audio understanding model
Free, high-quality text-to-speech API endpoint to replace OpenAI
Lets make video diffusion practical
Extensible, parallel implementations of t-SNE
A Web UI for easy subtitle using whisper model
This repo contains the code for 1D tokenizer and generator
Investment Research for Everyone, Everywhere
The repository provides code for running inference with SAM 2
Deep learning optimization library: makes distributed training easy
Opensource browser using agents
A guidance language for controlling large language models
Multilingual sentence & image embeddings with BERT
Terminal-based LLM chat tool with multi-model and local support
Pluggable SOTA multi-object tracking modules for segmentation
Spark-TTS Inference Code
Sparsity-aware deep learning inference runtime for CPUs
A high-quality rapid TTS voice cloning model
Library for OCR-related tasks powered by Deep Learning
Industrial-level controllable zero-shot text-to-speech system
Interact with your SQL database, Natural Language to SQL using LLMs
High-Quality Voice Cloning TTS for 600+ Languages
Block Diffusion for Ultra-Fast Speculative Decoding
Algorithmic Trading in Python with Machine Learning
Sharp Monocular Metric Depth in Less Than a Second