Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
YOLOv5 is the world's most loved vision AI
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A generative speech model for daily dialogue
Code release for Cut and Learn for Unsupervised Object Detection
Large Multimodal Models for Video Understanding and Editing
The official Meta Llama 3 GitHub site
Official inference library for Mistral models
An experimental version of DeepSeek model
Models for the spaCy Natural Language Processing (NLP) library
An open source implementation of CLIP
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
CodeGeeX4-ALL-9B, a versatile model for all AI software development
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Enable AI to control your desktop, mobile and HMI devices
Inference code for CodeLlama models
Data science on data without acquiring a copy
A simple native web interface that uses ChatTTS to synthesize text
Improve human sleep through scientifically
Ling is a MoE LLM provided and open-sourced by InclusionAI
Gorilla: An API store for LLMs
GUI Exploration Lab. One of the best GUI agent solutions
Building a Secure and Interoperable Future for AI-Driven Payments
E2M converts various file types (doc, docx, epub, html, htm, url