Wan2.2: Open and Advanced Large-Scale Video Generative Model
ComfyUI wrapper nodes for HunyuanVideo
Visual Causal Flow
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
A Telegram RSS bot that cares about your reading experience
An AI-powered file management tool that ensures privacy
Qwen-Image is a powerful image generation foundation model
This repo contains the code for 1D tokenizer and generator
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
Visual intelligence for your home.
Qwen2.5-VL is the multimodal large language model series
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
A Repo For Document AI
An unsupervised and free tool for image and video dataset analysis
Secure local-first microVM sandbox for running untrusted code fast
Enhances Tesseract OCR output using LLMs (local or API)
Distill your ex into an AI Skill
Document content and metadata extraction microservice
A python library for self-supervised learning on images
Native and Compact Structured Latents for 3D Generation
Unified Multimodal Understanding and Generation Models
Package and deploy machine learning models using Docker containers
Multimodal embedding and reranking models built on Qwen3-VL
Collection of Gemma 3 variants that are trained for performance
Official implementation of Watermark Anything with Localized Messages