HunyuanVideo: A Systematic Framework For Large Video Generation Model
YOLOv5 is the world's most loved vision AI
A generative speech model for daily dialogue
Code release for Cut and Learn for Unsupervised Object Detection
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
The official Meta Llama 3 GitHub site
Large Multimodal Models for Video Understanding and Editing
A simple native web interface that uses ChatTTS to synthesize text
An experimental version of DeepSeek model
Models for the spaCy Natural Language Processing (NLP) library
CodeGeeX4-ALL-9B, a versatile model for all AI software development
An open source implementation of CLIP
Official inference library for Mistral models
Ling is a MoE LLM provided and open-sourced by InclusionAI
Enable AI to control your desktop, mobile and HMI devices
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
Improve human sleep through scientifically
Gorilla: An API store for LLMs
E2M converts various file types (doc, docx, epub, html, htm, url
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
State-of-the-art (SoTA) text-to-video pre-trained model
Inference code for CodeLlama models
Data science on data without acquiring a copy
GUI Exploration Lab. One of the best GUI agent solutions