An on-premises, OCR-free unstructured data extraction
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
A community-supported supercharged version of paperless
Multi-tool for semantic search
Library for OCR-related tasks powered by Deep Learning
The official Python client for the Huggingface Hub
Ready-to-use OCR with 80+ supported languages
Topic Modelling for Humans
Explainability and Interpretability to Develop Reliable ML models
Solve end to end problems using Llama model family
Deepnote is a drop-in replacement for Jupyter
A very simple framework for state-of-the-art NLP
List of references in my private & single document
Running large language models on a single GPU
Python implementation of TextRank algorithms
An Open Toolkit for Knowledge Graph Extraction and Construction
Document papers compiled daily in computer vision/deep learning
A Unified Toolkit for Deep Learning Based Document Image Analysis
CPU/GPU inference server for Hugging Face transformer models
Repository to track the progress in Natural Language Processing (NLP)
Converting text to a structured representation
Facilitating the design, comparison and sharing of deep text models
A natural language frame semantics parser
AiLearning, data analysis plus machine learning practice
DSTK - DataScience ToolKit for All of Us