Document Image Parsing via Heterogeneous Anchor Prompting”
Implementation of Vision Transformer, a simple way to achieve SOTA
4M: Massively Multimodal Masked Modeling
Refer and Ground Anything Anywhere at Any Granularity
A Model Context Protocol (MCP) Gateway & Registry
Hackable and optimized Transformers building blocks
AI-powered tool for developers, simplifying coding tasks
A library for accelerating Transformer models on NVIDIA GPUs
Open-source choice to scale, assess and maintain natural language data
Trainable models and NN optimization tools
End-to-End Library for Continual Learning based on PyTorch
Easy-to-use,Modular and Extendible package of deep-learning models
Probabilistic reasoning and statistical analysis in TensorFlow
A feature rich discord Modmail bot
Deep universal probabilistic programming with Python and PyTorch
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Full stack AI software engineer
Open platform connecting AI agents to tools via unified MCP server
Multi-user UI for managing and running Stable Diffusion workflows tool
Ship AI Agents to Google Cloud in minutes, not months
AI-powered document analysis and tagging for Paperless-ngx
Context-aware desktop AI assistant that understands screen content
Multilingual speech recognition and audio understanding model
Enterprise platform for building and orchestrating AI agent workflows