An on-premises, OCR-free unstructured data extraction
A community-supported supercharged version of paperless
Multi-tool for semantic search
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Library for OCR-related tasks powered by Deep Learning
Ready-to-use OCR with 80+ supported languages
The official Python client for the Huggingface Hub
Topic Modelling for Humans
Solve end to end problems using Llama model family
Explainability and Interpretability to Develop Reliable ML models
A very simple framework for state-of-the-art NLP
Deepnote is a drop-in replacement for Jupyter
Running large language models on a single GPU
List of references in my private & single document
Python implementation of TextRank algorithms
An Open Toolkit for Knowledge Graph Extraction and Construction
A Unified Toolkit for Deep Learning Based Document Image Analysis
CPU/GPU inference server for Hugging Face transformer models
Repository to track the progress in Natural Language Processing (NLP)
Facilitating the design, comparison and sharing of deep text models
A natural language frame semantics parser
AiLearning, data analysis plus machine learning practice
DSTK - DataScience ToolKit for All of Us
Beautiful visualizations of how language differs among document types
A technical report on convolution arithmetic in deep learning