NLP Cloud serves high performance pre-trained or custom models for NER
Implementation of Phenaki Video, which uses Mask GIT
An open source object detection toolbox based on PyTorch
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Bring the notion of Model-as-a-Service to life
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Qwen3-omni is a natively end-to-end, omni-modal LLM
Easy-to-use and powerful NLP library with Awesome model zoo
The Multi-Agent Framework
A Systematic Framework for Interactive World Modeling
Open source libraries and APIs to build custom preprocessing pipelines
A chatbot built based on a large model
Convert AI papers to GUI
Generate high-definition story short videos with one click using AI
A Universal Customization Method for Single and Multi Conditioning
A state-of-the-art open visual language model
AI-data warehouse to enrich, transform and analyze unstructured data
The data structure for multimodal data
Implementation of Vision Transformer, a simple way to achieve SOTA
Data Lake for Deep Learning. Build, manage, and query datasets
Datasets, transforms and models specific to Computer Vision
PyTorch code and models for the DINOv2 self-supervised learning
PaddlePaddle End-to-End Development Toolkit
Implementation of Make-A-Video, new SOTA text to video generator