Public opinion analysis system
Simplifies the local serving of AI models from any source
A Unified Framework for Text-to-3D and Image-to-3D Generation
Personalize Any Characters with a Scalable Diffusion Transformer
WhatsApp MCP server enabling AI access to chats and messaging
AI framework for automated short video creation and editing tools
Local RAG engine for private multimodal knowledge search on devices
Advanced AI Explainability for computer vision
AI assistant based on large models that can actively think and plan
Stable Diffusion web UI
Generate Any 3D Scene in Seconds
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of FastVLM
A PyTorch library for implementing flow matching algorithms
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
A feature rich discord Modmail bot
Gracefully face hCaptcha challenge with multimodal llms
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Context data platform for building observable, self-learning AI agents
Data Lake for Deep Learning. Build, manage, and query datasets
PyTorch extensions for fast R&D prototyping and Kaggle farming
A lightweight vision library for performing large object detection
Create HTML profiling reports from pandas DataFrame objects
A Universal Customization Method for Single and Multi Conditioning