AI-data warehouse to enrich, transform and analyze unstructured data
Language Model Reinforcement Learning Environments frameworks
Virtual AI anchor that combines state-of-the-art technology
Capable of understanding text, audio, vision, video
Controllable & emotion-expressive zero-shot TTS
The official Python SDK for the ElevenLabs API
A text-to-speech, speech-to-text and speech-to-speech library
Implementation of Video Diffusion Models
The most intuitive, flexible, way for researchers to build models
Tiny vision language model
Seamlessly integrate LLMs into scikit-learn
OCR expert VLM powered by Hunyuan's native multimodal architecture
A solution to build and deploy MCP agents and applications
Library to facilitate federated learning research
Python client for the Telegram's tdlib
Implementation of a U-net complete with efficient attention
This repository contains the official implementation of FastVLM
Pushing the Limits of Mathematical Reasoning in Open Language Models
Research code artifacts for Code World Model (CWM)
A Unified Framework for Text-to-3D and Image-to-3D Generation
Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry, etc
Speech-AI-Forge is a project developed around TTS generation model
SGLang is a fast serving framework for large language models
Integrating LLMs into structured NLP pipelines
Machine learning, conversational dialog engine for creating chat bots