An Open Source text-to-speech system built by inverting Whisper
Towards Human-Sounding Speech
Interface for OuteTTS models
MARS5 speech model (TTS) from CAMB.AI
A command-line utility for taking automated screenshots of websites
This repository provides an advanced RAG
MetricFlow allows you to define, build, and maintain metrics in code
Chinese and English multimodal conversational language model
Library providing end-to-end GPU-accelerated recommender systems
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Open Source Differentiable Computer Vision Library
Build cross-modal and multimodal applications on the cloud
Static code analysis
Rules engine for cloud security, cost optimization, and governance
Python module that helps you build complex pipelines of batch jobs
Cross-platform application monitoring and error tracking software
Experimental, AI/ML-powered and open sourced Marketing Mix Modeling
LLM-based agent for general purpose software engineering tasks
High-Fidelity and Controllable Generation of Textured 3D Assets
Multi-modal large language model designed for audio understanding
A minimal yet professional single agent demo project
Scalable machine learning for time series forecasting
Chat & pretrained large audio language model proposed by Alibaba Cloud
Run LLMs locally on Cloud Workstations
An advanced paper search agent powered by large language models