Visual Studio Code client for Tabnine
A neural network that transforms a design mock-up into static websites
This repo contains the code for 1D tokenizer and generator
Code for running inference and finetuning with SAM 3 model
LTX-Video Support for ComfyUI
Extensible workflow development framework
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Python inference and LoRA trainer package for the LTX-2 audio–video
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Guiding Instruction-based Image Editing via Multimodal Large Language
Inference script for Oasis 500M
ICLR2024 Spotlight: curation/training code, metadata, distribution
Learning multi-scale deep model correcting over- and under- exposed
Flexible Photo Recrafting While Preserving Your Identity
Official code for Style Aligned Image Generation via Shared Attention
Code release for ConvNeXt model
A real-time approach for mapping all human pixels of 2D RGB images
Simulating worlds in a computer
OpenAI’s compact 20B open model for fast, agentic, and local use
OpenAI’s open-weight 120B model optimized for reasoning and tooling
Vision-language-action model for robot control via images and text