Fast and accurate AI powered file content types detection
Less Code, Lower Barrier, Faster Deployment
PPTAgent: Generating and Evaluating Presentations
A simple, secure MCP-to-OpenAPI proxy server
Official implementation of Watermark Anything with Localized Messages
Video understanding codebase from FAIR for reproducing video models
Implements weak-to-strong learning for training stronger ML models
Multimodal Diffusion with Representation Alignment
Extensible AGI Framework
LLM based autonomous agent that does online comprehensive research
Harness LLMs with Multi-Agent Programming
Superfast AI decision making and processing of multi-modal data
Open source no-code system for text annotation and building of text
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Conditional GAN for generating synthetic tabular data
Modular quant framework
Deep learning optimization library making distributed training easy
OCR expert VLM powered by Hunyuan's native multimodal architecture
airda(Air Data Agent
Spatiotemporal Signal Processing with Neural Machine Learning Models
Data science on data without acquiring a copy
Python package built to ease deep learning on graph
Technical principles related to large models
[NeurIPS 2023 Spotlight] LightZero
Advanced evolutionary computation library built on top of PyTorch