Apache OpenNLP
OpenVINO™ Toolkit repository
Hub of ready-to-use datasets for ML models
A full spaCy pipeline and models for scientific/biomedical documents
Chinese XLNet pre-trained model
Han Language Processing
State of the Art Natural Language Processing
Data processing for and with foundation models
Go efficient multilingual NLP and text segmentation
A Repo For Document AI
Large Language Model Text Generation Inference
Trained models & code to predict toxic comments
WikiChat is an improved RAG
ReFT: Representation Finetuning for Language Models
Pretrained model hub for Keras 3
Easy-to-use and high-performance NLP and LLM framework
ExtractThinker is a Document Intelligence library for LLMs
Superlinked is a Python framework for AI Engineers
Fast and customizable framework for automatic ML model creation
Semantic search and workflows for medical/scientific papers
A Heterogeneous Benchmark for Information Retrieval
Data and tools for generating and inspecting OLMo pre-training data
Efficient Retrieval Augmentation and Generation Framework
Dealing with all unstructured data, such as reverse image search
Libraries for applying sparsification recipes to neural networks