Bash is all you need, write a claude code with only 16 line code
A TTS model capable of generating ultra-realistic dialogue
Audiocraft is a library for audio processing and generation
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Automatically translates the text of a video based on a subtitle file
Real-time voice interactive digital human
Scalable machine learning for time series forecasting
Library for training machine learning models with privacy for data
Z80-μLM is a 2-bit quantized language model
Stable Diffusion with Core ML on Apple Silicon
Implementation of Phenaki Video, which uses Mask GIT
On-device Speech-to-Intent engine powered by deep learning
Omnilingual ASR Open-Source Multilingual SpeechRecognition
LLM-based Reinforcement Learning audio edit model
Python SDK for agent monitoring, LLM cost tracking, benchmarking, etc.
The ChatGPT Retrieval Plugin lets you easily find personal documents
Educational framework exploring multi-agent orchestration
Build applications that make decisions. Chatbots, agents, simulations
A high performance implementation of HDBSCAN clustering
Python framework for adversarial attacks, and data augmentation
Open-source tool designed to enhance the efficiency of workloads
Data Lake for Deep Learning. Build, manage, and query datasets
A refreshing functional take on deep learning
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Open source libraries and APIs to build custom preprocessing pipelines