gpt-4o for windows, macos and linux
The most powerful local music generation model
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
AutoGluon: AutoML for Image, Text, and Tabular Data
A Python wrapper you can't refuse
A lightweight audio-to-MIDI converter with pitch bend detection
An Open Source implementation of Notebook LM with more flexibility
From Images to High-Fidelity 3D Assets
Speech recognition module for Python
A Lightweight Face Recognition and Facial Attribute Analysis
Code for running inference with the SAM 3D Body Model 3DB
Fast-stable-diffusion + DreamBooth
Source code of PyGAD, Python 3 library for building genetic algorithms
Awesome multilingual OCR toolkits based on PaddlePaddle
Personalize Any Characters with a Scalable Diffusion Transformer
Models for object and human mesh reconstruction
A simple but complete full-attention transformer
ChatGLM-6B: An Open Bilingual Dialogue Language Model
A simple, high-quality voice conversion tool focused on ease of use
The most powerful and modular diffusion model GUI, api and backend
A fast library for AutoML and tuning
Open-source MCP server that gives your coding agent
State-of-the-art Parameter-Efficient Fine-Tuning
OpenCompass is an LLM evaluation platform
Synchronized Translation for Videos