Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
OCR expert VLM powered by Hunyuan's native multimodal architecture
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Elyra extends JupyterLab with an AI centric approach
Guiding Instruction-based Image Editing via Multimodal Large Language
Go package for computer vision using OpenCV 4 and beyond
Open-source framework for conversational voice AI agents
Inference script for Oasis 500M
ICLR2024 Spotlight: curation/training code, metadata, distribution
Open source MVVM framework for Web Apps
Flexible Photo Recrafting While Preserving Your Identity
Python package for AutoML on Tabular Data with Feature Engineering
Learning multi-scale deep model correcting over- and under- exposed
AI Powered Open Source Platform to Easily Build Enterprise Web Apps
Official code for Style Aligned Image Generation via Shared Attention
Creation of a Taylorplot for several machine learning models
Visualize the diagrams of your projects
An extension for using Cursor in Visual Studio Code
A reactive runtime for building durable AI agents
Coframe brings your UX to life with AI-powered optimization
Real-time collision detection and multi-physics simulation for VR
Code release for ConvNeXt model
GLIDE: a diffusion-based text-conditional image synthesis model
Tool for building chat bots, apps and custom integrations