ClawdBot one-click deployment tool
A simple native web interface that uses ChatTTS to synthesize text
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Ready-to-use OCR with 80+ supported languages
Ling is a MoE LLM provided and open-sourced by InclusionAI
Controllable and fast Text-to-Speech for over 7000 languages
Curated list of datasets and tools for post-training
The most intuitive, flexible, way for researchers to build models
Defang CLI and sample projects
A straightforward method for training your LLM
Photorealistic Synthetic Dataset for Holistic Indoor Scene
Code for running inference with the SAM 3D Body Model 3DB
Async PHP client/server API for the telegram MTProto protocol
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Powerful open source image generation model
Code for the paper Language Models are Unsupervised Multitask Learners
Inference code for Llama models
Semantic segmentation models, datasets & losses implemented in PyTorch
Large-scale autoregressive pixel model for image generation by OpenAI
Code for the paper "On First-Order Meta-Learning Algorithms"
Deep learning person re-identification in PyTorch
PyTorch implementation of BigGAN with pretrained weights
3D ResNets for Action Recognition (CVPR 2018)
A powerful and intuitive WYSIWYG to create Machine Learning models
Devanagari fonts traineddata for Tesseract OCR