UniVL is a video-language pretrain model. It is designed with four modules and five objectives for both video language understanding and generation tasks. It is also a flexible model for most of the multimodal downstream tasks considering both efficiency and effectiveness.
Features
- Finetune on YoucookII
- Documentation available
- Examples available
- Run caption task on YoucookII
- Pretrain on HowTo100M
- Licensed under the MIT License
License
MIT LicenseFollow UniVL
Other Useful Business Software
Gemini 3 and 200+ AI Models on One Platform
Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of UniVL!