PyTorch code and models for V-JEPA self-supervised learning from video
A Python package for segmenting geospatial data with the SAM
The unofficial python package that returns response of Google Bard
Aider is AI pair programming in your terminal
Get a ChatGPT plugin up and running in under 5 minutes
LTX-Video Support for ComfyUI
Code to accompany "A Method for Animating Children's Drawings"
"Big Model" trains a visual multimodal VLM with 26M parameters
Ling is a MoE LLM provided and open-sourced by InclusionAI
An Open Source text-to-speech system built by inverting Whisper
A Systematic Framework for Interactive World Modeling
OCR expert VLM powered by Hunyuan's native multimodal architecture
OpenLIT is an open-source LLM Observability tool
Multi-Agent daTa geneRation Infra and eXperimentation framework
Diversity-driven optimization and large-model reasoning ability
Code release for Cut and Learn for Unsupervised Object Detection
CLIP, Predict the most relevant text snippet given an image
Low-code framework for building custom LLMs, neural networks
Open platform for training, serving, and evaluating language models
Library to help with training and evaluating neural networks
Solve end to end problems using Llama model family
A python library for self-supervised learning on images
LLM powered fuzzing via OSS-Fuzz
RL implementations
An API standard for multi-agent reinforcement learning environments