MARS5 speech model (TTS) from CAMB.AI
An open-source, modern-design AI chat framework
Foundational model for human-like, expressive TTS
Implementations for various Generative AI Agent techniques
Sharp Monocular Metric Depth in Less Than a Second
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Python example app from the OpenAI API quickstart tutorial
Provides convenient access to the Anthropic REST API from any Python 3
Python SDK for the Computer Use model Lux, developed by OpenAGI
Interaction model for connecting buyers to complete purchases
Interview guide for machine learning, mathematics, and deep learning
A collection of various deep learning architectures, models, and tips
A simple, secure MCP-to-OpenAPI proxy server
The most powerful Android RPA agent framework
Implementation of "MobileCLIP" CVPR 2024
A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
Video understanding codebase from FAIR for reproducing video models
🤖 Assemble, configure & deploy autonomous AI Agents in your browser
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
C++ inference library for multiple SVC/TTS
Provides CTP stock options and Zhongtai Securities XTP
The TypeScript AI agent framework
A fast TTS architecture with conditional flow matching
SOTA discrete acoustic codec models with 40/75 tokens per second