Unified Multimodal Understanding and Generation Models
Collection of common code shared among different research projects
PyTorch code and models for VJEPA2 self-supervised learning from video
Language modeling in a sentence representation space
How to improve NGINX performance, security, and other important things
Evals is a framework for evaluating LLMs and LLM systems
Educational framework exploring multi-agent orchestration
AV1 Image File Format Specification - ISO-BMFF/HEIF derivative
AI-powered MCP server for desktop file and terminal automation
Examples and guides for using the OpenAI API
Retrieval Augmented Generation (RAG) framework
Request recommended movies, TV shows and anime to Jellyseer/Overseer
GitLab automatic code review tool based on large models
This repo contains the code for 1D tokenizer and generator
A Universal Customization Method for Single and Multi Conditioning
A Unified Framework for Image Customization
Flexible Photo Recrafting While Preserving Your Identity
Multi-Agent daTa geneRation Infra and eXperimentation framework
MARS5 speech model (TTS) from CAMB.AI
A command-line utility for taking automated screenshots of websites
This repository provides an advanced RAG
MetricFlow allows you to define, build, and maintain metrics in code
LLM powered fuzzing via OSS-Fuzz
Zig game engine & graphics toolkit
Toolkit for audio, music, and speech generation