Flexible Photo Recrafting While Preserving Your Identity
Open-source framework for conversational voice AI agents
Lightning fast C++/CUDA neural network framework
OCR expert VLM powered by Hunyuan's native multimodal architecture
Chat & pretrained large vision language model
airda(Air Data Agent
Virtual AI anchor that combines state-of-the-art technology
Visual Automation IDE — automate anything you see on screen
AçorOS: Debian com múltiplos desktops, fácil para iniciantes.
AI Powered Open Source Platform to Easily Build Enterprise Web Apps
Plug-n-play module turning text-to-image models into animation
Visual Instruction Tuning: Large Language-and-Vision Assistant
Visual AI Workflow Builder
Visualize the diagrams of your projects
Visual Studio Code client for Tabnine
Design system skills for agentic tools
computer vision projects | Fun AI projects related to computer vision
Guiding Instruction-based Image Editing via Multimodal Large Language
Open-source tool to visualise your RAG
CS2, Valorant, Fortnite, APEX, every game
Library of self-supervised methods for visual representation
Learning multi-scale deep model correcting over- and under- exposed
Official code for Style Aligned Image Generation via Shared Attention