Chat & pretrained large vision language model
Repo of Qwen2-Audio chat & pretrained large audio language model
An orchestration platform for the development, production
The Model Zoo of cognitive diagnosis models
Command-line tool to delete merged Git branches
Douyin TikTok Download API
High quality, fast, modular reference implementation of SSD in PyTorch
Hub of ready-to-use datasets for ML models
Rules engine for cloud security, cost optimization, and governance
Enables the best performance on NVIDIA RTX Graphics Cards
Spanish-language course repository that teaches fundamentals of SQL
Rename anything
High-quality implementations of standard and SOTA methods
Fast and accurate AI powered file content types detection
A simple, secure MCP-to-OpenAPI proxy server
The most powerful Android RPA agent framework
Implementation of "MobileCLIP" CVPR 2024
A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
VMZ: Model Zoo for Video Modeling
Mobile manipulation research tools for roboticists
CoreNet: A library for training deep neural networks
High-resolution models for human tasks
Video understanding codebase from FAIR for reproducing video models
Towards Real-World Vision-Language Understanding