LTX-Video Support for ComfyUI
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
General-purpose image editing model that delivers high-fidelity
Outcome driven agent development framework that evolves
A personal context-agent that learns how you work
State-of-the-art (SoTA) text-to-video pre-trained model
Framework for building realtime multimodal voice AI agents apps
Open-source MCP server that gives your coding agent
Sharp Monocular Metric Depth in Less Than a Second
A Customizable Image-to-Video Model based on HunyuanVideo
Accelerate local LLM inference and finetuning
Experimental, AI/ML-powered and open sourced Marketing Mix Modeling
Diffusion Transformer with Fine-Grained Chinese Understanding
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Democratizing AI scientists with ToolUniverse
AI video generator optimized for low VRAM and older GPUs use
Chat with your documents using local AI
A command-line productivity tool powered by AI large language models
Official inference library for Mistral models
Fast multimodal LLM for real-time voice interaction and AI apps
AI multi-agent framework for automating data-driven R&D workflows
A mcp server for vikingdb store and search
CLIP, Predict the most relevant text snippet given an image
A solution to build and deploy MCP agents and applications
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)