Ready-to-use OCR with 80+ supported languages
Lets make video diffusion practical
Free open source tool for real-time PC hardware sensor monitoring
Create videos with Stable Diffusion
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Official implementation of DreamCraft3D
Selenium-python but lighter: Helium is the best Python library
Community-maintained approach to improving access to GitHub services
MySQL client library for Python
TikZ figures for concepts in physics/chemistry/ML
Toolkit for running TensorFlow training scripts on SageMaker
Create web-based user interfaces with Python
HY-Motion model for 3D character animation generation
A game engine powered by python and panda3d
Controllable and fast Text-to-Speech for over 7000 languages
PyTorch code and models for VJEPA2 self-supervised learning from video
Large Multimodal Models for Video Understanding and Editing
Open-source infrastructure for Computer-Use Agents. Sandboxes
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
Personalize Any Characters with a Scalable Diffusion Transformer
Official inference repo for FLUX.1 models
Universal Command Line Interface for Amazon Web Services
Powerful framework for controlling Android and iOS devices
Open-source, code-first Python toolkit for building, evaluating, etc.