Official PyTorch Implementation
Open-source infrastructure for Computer-Use Agents. Sandboxes
From Images to High-Fidelity 3D Assets
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Utilities intended for use with Llama models
Offline Text To Speech synthesis for python
An implementation of a deep learning recommendation model (DLRM)
Reference PyTorch implementation and models for DINOv3
Awesome multilingual OCR toolkits based on PaddlePaddle
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Python scraper based on AI
Long-form streaming TTS system for multi-speaker dialogue generation
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Framework for building neural networks
Official DeiT repository
Scalable generative AI framework built for researchers and developers
Document Image Parsing via Heterogeneous Anchor Prompting”
ContextGem: Effortless LLM extraction from documents
Open-source, code-first Python toolkit for building, evaluating, etc.
Semantic search and workflows for medical/scientific papers
A SOTA open-source image editing model
Volcano Engine Reinforcement Learning for LLMs
Definitions for AI/ML tasks like dataset creation
Flexible Photo Recrafting While Preserving Your Identity
The largest collection of PyTorch image encoders / backbones