A text-to-speech, speech-to-text and speech-to-speech library
Data processing for and with foundation models
Synthetic Data Generation for tabular, relational and time series data
AI Toolkit for Healthcare Imaging
OpenSpace: Make Your Agents: Smarter, Low-Cost, Self-Evolving
A sound cloning tool with a web interface, using your voice
A fast library for AutoML and tuning
Automated translation solution for visual novels
Develop software autonomously
Pre-trained Deep Learning models and demos
The repository provides code for running inference with SAM 2
Your open-source LLM evaluation toolkit
Generate high-definition story short videos with one click using AI
Converts text to speech in realtime
Helping you get the most out of AWS, wherever you use MCP
Generate blog articles from video or audio
GLM-4 series: Open Multilingual Multimodal Chat LMs
Achieving 3+ generation speedup on reasoning tasks
Enhances Tesseract OCR output using LLMs (local or API)
Advanced techniques for RAG systems
PyTorch code and models for the DINOv2 self-supervised learning
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Open-source choice to scale, assess and maintain natural language data
Video Object and Interaction Deletion
Bridging LLM and Recommender System