Interface for OuteTTS models
MARS5 speech model (TTS) from CAMB.AI
Plug-and-play library to enable agents to call MCP and UTCP tools
This repository provides an advanced RAG
An MCP server that autonomously evaluates web applications
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
Repo of Qwen2-Audio chat & pretrained large audio language model
Helping you get the most out of AWS, wherever you use MCP
Python package for AutoML on Tabular Data with Feature Engineering
Lightweight Python library for adding real-time multi-object tracking
MII makes low-latency and high-throughput inference possible
Toloka-Kit is a Python library for working with Toloka API
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Tensor search for humans
The data structure for multimodal data
Django friendly finite state machine support
A fast library for AutoML and tuning
Jittor is a high-performance deep learning framework
Implementation of Imagen, Google's Text-to-Image Neural Network
Open Source Differentiable Computer Vision Library
Fast image augmentation library and an easy-to-use wrapper
Build cross-modal and multimodal applications on the cloud
A library for deep learning end-to-end dialog systems and chatbots
A multi-function Discord bot