FAIR Sequence Modeling Toolkit 2
code for Mesh R-CNN, ICCV 2019
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
PyTorch code and models for VJEPA2 self-supervised learning from video
GPT4V-level open-source multi-modal model based on Llama3-8B
Proofs, cases, concept supplements, and reference explanations
PyTorch extensions for fast R&D prototyping and Kaggle farming
Outcome driven agent development framework that evolves
A SOTA open-source image editing model
Multi-Agent daTa geneRation Infra and eXperimentation framework
This repository provides an advanced RAG
Chinese and English multimodal conversational language model
A fast library for AutoML and tuning
Hub of ready-to-use datasets for ML models
Build cross-modal and multimodal applications on the cloud
Deep learning library
Python SDK for the Computer Use model Lux, developed by OpenAGI
Powering Amazon custom machine learning chips
Free, high-quality text-to-speech API endpoint to replace OpenAI
Full stack AI software engineer
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Automatically Visualize any dataset, any size
An efficient forwarding service designed for LLMs
Chinese Llama-3 LLMs) developed from Meta Llama 3
High-Resolution Image Synthesis with Latent Diffusion Models