Images to inference with no labeling
Taming Stable Diffusion for Lip Sync
Speech-AI-Forge is a project developed around TTS generation model
Agent framework and applications built upon Qwen>=3.0
Helping you get the most out of AWS, wherever you use MCP
A TTS that fits in your CPU (and pocket)
The Simple Agent Development Kit
Synthetic data generators for tabular and time-series data
Trainable models and NN optimization tools
Decentralized deep learning in PyTorch. Built to train models
A comprehensive set of fairness metrics for datasets
Data loaders and abstractions for text and NLP
Streamline your ML workflow
Open deep learning compiler stack for cpu, gpu, etc.
Geometric deep learning extension library for PyTorch
High-performance library for gradient boosting on decision trees
Provider-agnostic, open-source evaluation infrastructure
Controllable and fast Text-to-Speech for over 7000 languages
Foundational model for human-like, expressive TTS
Towards Human-Level Text-to-Speech through Style Diffusion
A TTS model capable of generating ultra-realistic dialogue
An open sourced end-to-end VLM-based GUI Agent
Solve end to end problems using Llama model family
Interact with your SQL database, Natural Language to SQL using LLMs
Monte Carlo tree search in JAX