Apache OpenNLP
OpenVINO™ Toolkit repository
Hub of ready-to-use datasets for ML models
A full spaCy pipeline and models for scientific/biomedical documents
Chinese XLNet pre-trained model
State of the Art Natural Language Processing
Bring the notion of Model-as-a-Service to life
Go efficient multilingual NLP and text segmentation
A Repo For Document AI
Han Language Processing
Trained models & code to predict toxic comments
WikiChat is an improved RAG
ReFT: Representation Finetuning for Language Models
Easy-to-use and high-performance NLP and LLM framework
ExtractThinker is a Document Intelligence library for LLMs
Superlinked is a Python framework for AI Engineers
Fast and customizable framework for automatic ML model creation
Semantic search and workflows for medical/scientific papers
A Heterogeneous Benchmark for Information Retrieval
Data and tools for generating and inspecting OLMo pre-training data
Efficient Retrieval Augmentation and Generation Framework
Dealing with all unstructured data, such as reverse image search
Libraries for applying sparsification recipes to neural networks
Data processing for and with foundation models
Build AI-powered semantic search applications