Instant voice cloning by MIT and MyShell. Audio foundation model
Industrial-level controllable zero-shot text-to-speech system
A Python package for segmenting geospatial data with the SAM
Models for the spaCy Natural Language Processing (NLP) library
Create UIs for your machine learning model in Python in 3 minutes
Datasets, transforms and models specific to Computer Vision
SAPIEN Manipulation Skill Framework
TTS with kokoro and onnx runtime
Lightweight Python library for adding real-time multi-object tracking
When LLM Meets Domain Experts
Reference PyTorch implementation and models for DINOv3
ktrain is a Python library that makes deep learning AI more accessible
A Python library for audio data augmentation
Pytorch domain library for recommendation systems
Qlib is an AI-oriented quantitative investment platform
Open source driver assistance system
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Open-source large language model family from Tencent Hunyuan
A Repo For Document AI
Machine learning metrics for distributed, scalable PyTorch application
A Powerful Native Multimodal Model for Image Generation
MII makes low-latency and high-throughput inference possible
Data loaders and abstractions for text and NLP
The Open Source Memory Layer For Autonomous Agents