Full-stack AI Red Teaming platform
Qwen3-ASR is an open-source series of ASR models
Document Index for Vectorless, Reasoning-based RAG
Recovering the Visual Space from Any Views
AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories
Spark-TTS Inference Code
Swing Music is a beautiful, self-hosted music player
Multi-agent autonomous startup system for Claude Code
Claude Code skill implementing Manus-style persistent planning
Connect any LLM to your internal knowledge sources
Socket.IO integration for Flask applications
Official repository for LTX-Video
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Inspiration database for Internet practitioners with no ads
Google CTF
Fast and accurate AI powered file content types detection
VMZ: Model Zoo for Video Modeling
Towards Real-World Vision-Language Understanding
Code for the paper "Evaluating Large Language Models Trained on Code"
Tool for exploring and debugging transformer model behaviors
CLIP, Predict the most relevant text snippet given an image
Contains the code for CM-SS13
A TLS MITM proxy for Non-HTTP traffic, with support for TLS upgrades
The NVIDIA AgentIQ toolkit is an open-source library
A framework to enable multimodal models to operate a computer