ExtractThinker is a Document Intelligence library for LLMs
Did you say you like data?
Structured data extraction and instruction calling with ML, LLM
No-code LLM Platform to launch APIs and ETL Pipelines
ContextGem: Effortless LLM extraction from documents
Document content and metadata extraction microservice
Make websites accessible for AI agents
A high-quality tool for convert PDF to Markdown and JSON
Synthetic data curation for post-training and data extraction
Document (PDF, Word, PPTX ...) extraction and parse API
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
End-to-end pipeline converting generative videos
An on-premises, OCR-free unstructured data extraction
Superlinked is a Python framework for AI Engineers
A Simple and Universal Swarm Intelligence Engine
From Paper to Presentation in One Click
kaldi-asr/kaldi is the official location of the Kaldi project
Get your documents ready for gen AI
Tools to build web AI agents that can authenticate
LLM
Online machine learning in Python
PyTorch code and models for the DINOv2 self-supervised learning
Open-Source Financial Large Language Models
Open-source evaluation toolkit of large multi-modality models (LMMs)
Automate browser-based workflows with LLMs and Computer Vision