C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Sparsity-aware deep learning inference runtime for CPUs
Industrial-strength Natural Language Processing (NLP)
Efficient Retrieval Augmentation and Generation Framework
Han Language Processing
Training data (data labeling, annotation, workflow) for all data types
Unified embedding model
Pretrained model hub for Keras 3
Large Language Model Text Generation Inference
Transformers4Rec is a flexible and efficient library
The Classical Language Toolkit
ExtractThinker is a Document Intelligence library for LLMs
Data and tools for generating and inspecting OLMo pre-training data
Toolkit for conversational AI
Obsei is a low code AI powered automation tool
ReFT: Representation Finetuning for Language Models
Bring the notion of Model-as-a-Service to life
Evaluation code for various unsupervised automated metrics
Extract schema, statistics and entities from datasets
A Repo For Document AI
A full spaCy pipeline and models for scientific/biomedical documents
Easy-to-use and powerful NLP library with Awesome model zoo
Hub of ready-to-use datasets for ML models
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
A tool for learning vector representations of words and entities