GLM-4 series: Open Multilingual Multimodal Chat LMs
A simple, secure MCP-to-OpenAPI proxy server
The most powerful Android RPA agent framework
Implementation of "MobileCLIP" CVPR 2024
A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
Training Large Language Model to Reason in a Continuous Latent Space
Video understanding codebase from FAIR for reproducing video models
CLIP, Predict the most relevant text snippet given an image
Talk to Your AI Agents from Anywhere
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Supercharge Your LLM Application Evaluations
A system for quickly generating training data with weak supervision
Uniform Manifold Approximation and Projection
Automate browser-based workflows with LLMs and Computer Vision
Gorilla: An API store for LLMs
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
Time series forecasting with PyTorch
MMEditing is a low-level vision toolbox based on PyTorch
Library to help with training and evaluating neural networks
21 Lessons, Get Started Building with Generative AI
Lightweight framework for evaluating large language model performance
A command-line productivity tool powered by AI large language models
ChatGPT interface with better UI
Global weather forecasting model using graph neural networks and JAX