Language Model Reinforcement Learning Environments frameworks
ContextGem: Effortless LLM extraction from documents
Controllable & emotion-expressive zero-shot TTS
Virtual AI anchor that combines state-of-the-art technology
The official Python SDK for the ElevenLabs API
A text-to-speech, speech-to-text and speech-to-speech library
Capable of understanding text, audio, vision, video
Implementation of Video Diffusion Models
The most intuitive, flexible, way for researchers to build models
Tiny vision language model
Library to facilitate federated learning research
Python client for the Telegram's tdlib
Seamlessly integrate LLMs into scikit-learn
Implementation of a U-net complete with efficient attention
OCR expert VLM powered by Hunyuan's native multimodal architecture
This repository contains the official implementation of FastVLM
Pushing the Limits of Mathematical Reasoning in Open Language Models
Research code artifacts for Code World Model (CWM)
A solution to build and deploy MCP agents and applications
Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry, etc
A Unified Framework for Text-to-3D and Image-to-3D Generation
SGLang is a fast serving framework for large language models
Integrating LLMs into structured NLP pipelines
Speech-AI-Forge is a project developed around TTS generation model
Machine learning, conversational dialog engine for creating chat bots