Open-weight, large-scale hybrid-attention reasoning model
Multilingual Document Layout Parsing in a Single Vision-Language Model
Build multimodal AI applications with cloud-native stack
The official implementation of RAPTOR
Ready-to-run cloud templates for RAG
Code for the paper "Evaluating Large Language Models Trained on Code"
Integrate ChatGPT into your own discord bot
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Open source AI model for generating full songs from lyrics prompts
Korvus is a search SDK that unifies the entire RAG pipeline
MII makes low-latency and high-throughput inference possible
State-of-the-art diffusion models for image and audio generation
Implementation of Make-A-Video, new SOTA text to video generator
Autonomous LLM agent for end-to-end data science workflows
AI Slack bot for reading, summarizing, and chatting with content
A Personalized LLM-powered Agent Frameworks
AI framework for automated short video creation and editing tools
Play couplet with seq2seq model
Running large language models on a single GPU
Vertically Unified Agents for Graph Retrieval-Augmented Reasoning
LongBench v2 and LongBench (ACL 25'&24')
Traditional Mandarin LLMs for Taiwan
Benchmark LLMs by fighting in Street Fighter 3
LISA: Reasoning Segmentation via Large Language Model
Skywork-R1V is an advanced multimodal AI model series