Large Multimodal Models for Video Understanding and Editing
A subtitle generator for Japanese Adult Videos.
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
FAIR's research platform for object detection research
Let us control diffusion models
This repository contains the official implementation of research
Resources, corpora, and tools for Chinese natural language processing
Enable sending and receiving images during chatting
Flash enables you to easily configure and run complex AI recipes
Implementation of NÜWA, attention network for text to video synthesis
OpenMMLab Video Perception Toolbox
Implementation of BEVFormer, a camera-only framework
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
Convolutional Neural Network for 3D meshes in PyTorch
Per-Pixel Classification is Not All You Need for Semantic Segmentation
Next-generation platform for object detection and segmentation
DeepImageTranslator: a deep-learning utility for image translation
Semantic segmentation models, datasets & losses implemented in PyTorch
Automatically remove the mosaics in images and videos, or add mosaics
Auto-diff neural network library for high-dimensional sparse tensors
Gluon CV Toolkit
Chinese synonyms, chat robot, intelligent question and answer toolkit
Pre-trained and Reproduced Deep Learning Models
fastNLP: A Modularized and Extensible NLP Framework