Visual Studio Code client for Tabnine
A neural network that transforms a design mock-up into static websites
This repo contains the code for 1D tokenizer and generator
Code for running inference and finetuning with SAM 3 model
LTX-Video Support for ComfyUI
Extensible workflow development framework
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Guiding Instruction-based Image Editing via Multimodal Large Language
Python inference and LoRA trainer package for the LTX-2 audio–video
Learning multi-scale deep model correcting over- and under- exposed
Inference script for Oasis 500M
ICLR2024 Spotlight: curation/training code, metadata, distribution
Flexible Photo Recrafting While Preserving Your Identity
Official code for Style Aligned Image Generation via Shared Attention
AI Powered Open Source Platform to Easily Build Enterprise Web Apps
Code release for ConvNeXt model
A real-time approach for mapping all human pixels of 2D RGB images
Simulating worlds in a computer
OpenAI’s compact 20B open model for fast, agentic, and local use
OpenAI’s open-weight 120B model optimized for reasoning and tooling
Vision-language-action model for robot control via images and text