ExtractThinker is a Document Intelligence library for LLMs
A persistent, network resilient, full text search library
Persian NLP Toolkit
The most accurate natural language detection library for Rust
Structured data extraction and instruction calling with ML, LLM
WikiChat is an improved RAG
OpenVINO™ Toolkit repository
Pretrained model hub for Keras 3
A high-quality PDF to Markdown tool based on large language model
LLM.swift is a simple and readable library
Robust Speech Recognition via Large-Scale Weak Supervision
The pluggable natural language linter for text and markdown
Sparsity-aware deep learning inference runtime for CPUs
A system for agentic LLM-powered data processing and ETL
Enhances Tesseract OCR output using LLMs (local or API)
General natural language facilities for node
A curated list of data mining papers about fraud detection
Hub of ready-to-use datasets for ML models
A Heterogeneous Benchmark for Information Retrieval
Open source libraries and APIs to build custom preprocessing pipelines
Chinese XLNet pre-trained model
AI-powered tool for generating, optimizing, and translating subtitles
Semantic search and workflows for medical/scientific papers
Efficient Retrieval Augmentation and Generation Framework
A fast, helpful, and open-source document parser