Document (PDF, Word, PPTX ...) extraction and parse API
Toolkit for conversational AI
A high-quality PDF to Markdown tool based on large language model
Enhances Tesseract OCR output using LLMs (local or API)
Open source libraries and APIs to build custom preprocessing pipelines
AI-Powered Data Processing: Use LOTUS to process all of your datasets
A system for agentic LLM-powered data processing and ETL
SQL-Driven RAG Engine
Knowledge Graph Generation from Any Text
Structured data extraction and instruction calling with ML, LLM
Scalable data pre processing and curation toolkit for LLMs
Large-language-model & vision-language-model based on Linear Attention
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Using AI models to automatically provide commentary and edit videos
The official repo of Qwen chat & pretrained large language model
Build a large language model from 0 only with Python foundation
LLM Large Model of Selling Anchor
Generative AI reference workflows
Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM
Framework that is dedicated to making neural data processing
An interpretable and efficient predictor using pre-trained models