Sharp Monocular Metric Depth in Less Than a Second
Communicate with an LLM provider using a single interface
Fast image augmentation library and an easy-to-use wrapper
A library for deep learning end-to-end dialog systems and chatbots
Framework for validating and controlling LLM outputs in AI apps
Multi-modal large language model designed for audio understanding
OCR expert VLM powered by Hunyuan's native multimodal architecture
Set of tools to assess and improve LLM security
SDK for building interactive UI components over MCP for AI tools
A Survey of Large Language Models
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
A PyTorch-based Speech Toolkit
Llama Chinese community, real-time aggregation
RAG Search API
AIConfig is a config-based framework to build generative AI apps
A Personalized LLM-powered Agent Frameworks
Streamlines and simplifies prompt design for both developers
Instruction-tuning LLM with Chinese Medical Knowledge
Robust recipes to align language models with human and AI preferences
Automatic question answering for local knowledge bases based on LLM
A library to communicate with ChatGPT, Claude, Copilot, Gemini
High-Resolution Image Synthesis with Latent Diffusion Models
Run LLMs locally on Cloud Workstations
Software that uses AI to perform real-time voice conversion
A system for quickly generating training data with weak supervision