Hypernetworks that adapt LLMs for specific benchmark tasks
Qwen3-omni is a natively end-to-end, omni-modal LLM
Unifying 3D Mesh Generation with Language Models
Code and models for ICML 2024 paper, NExT-GPT
Large-language-model & vision-language-model based on Linear Attention
Knowledge Graph Generation from Any Text
A high-quality PDF to Markdown tool based on large language model
Autoregressive Model Beats Diffusion
Enhances Tesseract OCR output using LLMs (local or API)
Multilingual sentence & image embeddings with BERT
A modular graph-based Retrieval-Augmented Generation (RAG) system
Tensor search for humans
Scalable data pre processing and curation toolkit for LLMs
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
User toolkit for analyzing and interfacing with Large Language Models
Data Infrastructure providing an approach to multimodal AI workloads
Build multimodal language agents for fast prototype and production
Unleashing 10,000+ Word Generation from Long Context LLMs
Chinese and English multimodal conversational language model
Using AI models to automatically provide commentary and edit videos
Benchmark LLMs by fighting in Street Fighter 3
LISA: Reasoning Segmentation via Large Language Model
Gemma open-weight LLM library, from Google DeepMind
A lightweight framework for building LLM-based agents
Vertically Unified Agents for Graph Retrieval-Augmented Reasoning