Controllable and fast Text-to-Speech for over 7000 languages
DeepMind model for tracking arbitrary points across videos & robotics
code for Mesh R-CNN, ICCV 2019
Designed for text embedding and ranking tasks
The standard data-centric AI package for data quality and ML
Qiling Advanced Binary Emulation Framework
AI Toolkit for Healthcare Imaging
The Triton Inference Server provides an optimized cloud
AutoGluon: AutoML for Image, Text, and Tabular Data
Best practices on recommendation systems
A SOTA open-source image editing model
FAIR Chemistry's library of machine learning methods for chemistry
MII makes low-latency and high-throughput inference possible
A free (libre) open source, mobile OS for Ethereum
Build AI-powered semantic search applications
Python module that helps you build complex pipelines of batch jobs
Deep learning library
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Multi-modal large language model designed for audio understanding
Physical Symbolic Optimization
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Build your chatbot within minutes on your favorite device
Capable of understanding text, audio, vision, video
Django friendly finite state machine support