Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Python library and CLI tool to interface with Google Translate
The official Python SDK for the ElevenLabs API
Multi-lingual large voice generation model, providing inference
AI discovers 520000 stable inorganic crystal structures for research
Inference Llama 2 in one file of pure C
The repository provides code for running inference with SAM 2
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Open Source Document Management System for Digital Archives
Python SDK for the Computer Use model Lux, developed by OpenAGI
Large Multimodal Models for Video Understanding and Editing
OCR expert VLM powered by Hunyuan's native multimodal architecture
Chat & pretrained large audio language model proposed by Alibaba Cloud
On-device Speech-to-Intent engine powered by deep learning
Curated list of classic, high-quality computer science books
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
Structured RAG: ingest, index, query
Shared repository for open-sourced projects from the Google AI Lang
A package for the sparse identification of nonlinear dynamical systems
Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real
Master the fundamentals of machine learning, deep learning
Numerical differential equation solvers in JAX
Helps data scientists define testable self-documenting dataflows
AI tool that removes hardcoded subtitles and text from videos locally
Unsupervised Learning for Image Registration