Vertically Unified Agents for Graph Retrieval-Augmented Reasoning
Chat with your documents using local AI
Uncertainty Quantification for Language Models, is a Python package
MemoryOS is designed to provide a memory operating system
Real-time multi-AI collaboration: Claude, Codex & Gemini
A dataset consists of 15,140 ChatGPT prompts from Reddit
A system for agentic LLM-powered data processing and ETL
Retrieval and Retrieval-augmented LLMs
A lightweight vLLM implementation built from scratch
Utilities intended for use with Llama models
GLM-4-Voice | End-to-End Chinese-English Conversational Model
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Practical productivity tools for Claude Code, Codex-CLI
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Gracefully face hCaptcha challenge with multimodal llms
A course of learning LLM inference serving on Apple Silicon
Simple, Pythonic building blocks to evaluate LLM applications
Qwen3-omni is a natively end-to-end, omni-modal LLM
Access large language models from the command-line
Ongoing research training transformer models at scale
A guidance language for controlling large language models
LLM abstractions that aren't obstructions
PyTorch library of curated Transformer models and their components
Open source libraries and APIs to build custom preprocessing pipelines
Integrating LLMs into structured NLP pipelines