Document (PDF, Word, PPTX ...) extraction and parse API
Parse files for optimal RAG
A high-quality PDF to Markdown tool based on large language model
Open image model at the forefront of design
Contexts Optical Compression
Structured data extraction and instruction calling with ML, LLM
Reading book source
Knowledge Graph Generation from Any Text
Generate blog articles from video or audio
OCR model for complex documents with layout-aware structured outputs
AI agent to evaluate and score resumes
A Family of Open Sourced Music Foundation Models
Generate audiobooks from e-books
Document content and metadata extraction microservice
Using AI models to automatically provide commentary and edit videos
Open source healthcare AI
OCR expert VLM powered by Hunyuan's native multimodal architecture
Renderer for the harmony response format to be used with gpt-oss
Autonomous LLM agent for end-to-end data science workflows
AI-Researcher: Autonomous Scientific Innovation
A simple, high-quality voice conversion tool focused on ease of use
AI framework for automated short video creation and editing tools
Pushing the Frontier of Long Audio-Visual Generation
Voice Recognition to Text Tool
Public opinion analysis system