README file generator, powered by AI
Qwen3-omni is a natively end-to-end, omni-modal LLM
The largest collection of PyTorch image encoders / backbones
Towards Human-Sounding Speech
Converts text to speech in realtime
A simple, secure MCP-to-OpenAPI proxy server
The most powerful Android RPA agent framework
Implementation of "MobileCLIP" CVPR 2024
A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
Official implementation of Watermark Anything with Localized Messages
Training Large Language Model to Reason in a Continuous Latent Space
Video understanding codebase from FAIR for reproducing video models
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Uniform Manifold Approximation and Projection
Gorilla: An API store for LLMs
Low-code framework for building custom LLMs, neural networks
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
Open platform for training, serving, and evaluating language models
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
MMEditing is a low-level vision toolbox based on PyTorch
Modular quant framework
Library to help with training and evaluating neural networks
Concatenate a directory full of files into a single prompt