Framework for building real-time voice and multimodal AI agents
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
Designed for training LLM/VLM agents via RL
Build AI WhatsApp Bots with Pure Python
Bringing BERT into modernity via both architecture changes and scaling
A lightweight framework for building LLM-based agents
SDG is a specialized framework
LISA: Reasoning Segmentation via Large Language Model
Build a large language model from 0 only with Python foundation
Accelerate local LLM inference and finetuning
No-code LLM Platform to launch APIs and ETL Pipelines
The absolute trainer to light up AI agents
Simplifies the local serving of AI models from any source
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Qwen3-omni is a natively end-to-end, omni-modal LLM
Official inference library for Mistral models
Extensible, parallel implementations of t-SNE
Convert codebases into structured prompts optimized for LLM analysis
Run all your local AI together in one package
A sound cloning tool with a web interface, using your voice
Sharp Monocular Metric Depth in Less Than a Second
The leading agent orchestration platform for Claude
Generating Immersive, Explorable, and Interactive 3D Worlds
A library for scientific machine learning & physics-informed learning