Portia Labs Python SDK for building agentic workflows
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Spark-TTS Inference Code
Diffusion Transformer with Fine-Grained Chinese Understanding
Build Vision Agents quickly with any model or video provider
Qwen2.5-VL is the multimodal large language model series
A batteries-included library for building AI-powered software
One-click deployment (including offline integration package)
Pokee Deep Research Model Open Source Repo
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Tools like web browser, computer access and code runner for LLMs
Time-lapse Video Generation Models as Metamorphic Simulators
Deploy and share agents with open infrastructure
Real-time voice interactive digital human
Habit Tracker for the AI Coding Workshop
SWE-agent takes a GitHub issue and tries to automatically fix it
Language modeling in a sentence representation space
Speech-AI-Forge is a project developed around TTS generation model
Repo of Qwen2-Audio chat & pretrained large audio language model
A neural network that transforms a design mock-up into static websites
A minimal yet professional single agent demo project
An unsupervised and free tool for image and video dataset analysis
Qwen3-omni is a natively end-to-end, omni-modal LLM
Your Fully-Automated Personal AI Assistant