Build Vision Agents quickly with any model or video provider
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Helping you get the most out of AWS, wherever you use MCP
Open Source TypeScript AI Agent Framework
MII makes low-latency and high-throughput inference possible
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Toolkit for conversational AI
A library for deep learning end-to-end dialog systems and chatbots
.NET Client for Telegram Bot API
A Python toolbox for scalable outlier detection
Stanford NLP Python library for many human languages
High-Fidelity and Controllable Generation of Textured 3D Assets
State-of-the-art (SoTA) text-to-video pre-trained model
OCR expert VLM powered by Hunyuan's native multimodal architecture
Benchmarking synthetic data generation methods
The Cloud-Native API Gateway
An MCP server that provides fast file searching capabilities
Swirl queries any number of data sources with APIs
Extensible workflow development framework
A Non-Official OpenAI RESTful API Client for DotNet
C++ library for Telegram bot API
Evaluate and monitor ML models from validation to production