AI framework for automated short video creation and editing tools
Play couplet with seq2seq model
Running large language models on a single GPU
Vertically Unified Agents for Graph Retrieval-Augmented Reasoning
LongBench v2 and LongBench (ACL 25'&24')
Traditional Mandarin LLMs for Taiwan
Benchmark LLMs by fighting in Street Fighter 3
LISA: Reasoning Segmentation via Large Language Model
Skywork-R1V is an advanced multimodal AI model series
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Self-learning data agent that grounds its answers in layers of content
Chinese Llama-3 LLMs) developed from Meta Llama 3
The best ChatGPT that $100 can buy
A Model Context Protocol server for searching and analyzing arXiv
Refer and Ground Anything Anywhere at Any Granularity
ICLR2024 Spotlight: curation/training code, metadata, distribution
Official implementation of DreamCraft3D
Towards Real-World Vision-Language Understanding
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
LLM-based agent for general purpose software engineering tasks
Practical productivity tools for Claude Code, Codex-CLI
LLM Large Model of Selling Anchor
Generative AI reference workflows
Context data platform for building observable, self-learning AI agents
Language modeling in a sentence representation space