Page 2 | language processing free download

ChatGLM.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

ChatGLM.cpp is a C++ implementation of the ChatGLM-6B model, enabling efficient local inference without requiring a Python environment. It is optimized for running on consumer hardware.

Downloads: 3 This Week

Last Update: 2025-01-21

See Project

Keras Hub is a repository of pre-trained models for Keras 3, offering a collection of ready-to-use models for various machine-learning tasks. KerasHub is an extension of the core Keras API; KerasHub components are provided as Layer and Model implementations. If you are familiar with Keras, congratulations. You already understand most of KerasHub.

Downloads: 5 This Week

Last Update: 6 days ago

See Project

Chinese-XLNet

Chinese XLNet pre-trained model

Chinese-XLNet is a Chinese language pre-trained model based on the XLNet architecture, providing an advanced foundation for natural language processing tasks in Mandarin and other Chinese dialects. Unlike traditional masked language modeling, XLNet uses a permutation language modeling objective that captures bidirectional context more effectively by training over all possible token orderings, yielding richer contextual representations.

Downloads: 0 This Week

Last Update: 2026-04-19

See Project

WikiChat

WikiChat is an improved RAG

WikiChat is a chatbot framework designed to interactively retrieve and summarize Wikipedia information, allowing users to ask questions and get context-aware responses?

Downloads: 0 This Week

Last Update: 2025-04-29

See Project

natural

General natural language facilities for node

"Natural" is a general natural language facility for nodejs. It offers a broad range of functionalities for natural language processing. Tokenizing, stemming, classification, phonetics, tf-idf, WordNet, string similarity, and some inflections are currently supported. It’s still in the early stages, so we’re very interested in bug reports, contributions and the like. Note that many algorithms from Rob Ellis’s node-nltools are being merged into this project and will be maintained from here onward. ...

Downloads: 0 This Week

Last Update: 2026-02-18

See Project

Apache OpenNLP

Apache OpenNLP is a machine learning-based NLP library that provides tools for text-processing tasks such as tokenization, sentence segmentation, and named entity recognition.

Downloads: 0 This Week

Last Update: 2026-04-28

See Project

NVIDIA NeMo

Toolkit for conversational AI

NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI architectures are typically large and require a lot of data and compute for training. ...

Downloads: 1 This Week

Last Update: 2026-04-22

See Project

textlint

The pluggable natural language linter for text and markdown

Textlint is an extensible linting tool for text and markdown files, designed to enforce style guidelines, detect errors, and improve writing quality.

Downloads: 0 This Week

Last Update: 2026-05-18

See Project

BettaFish

Public opinion analysis system

...It also integrates multimodal processing, enabling it to parse images and video alongside text.

Downloads: 0 This Week

Last Update: 2026-02-17

See Project

Datasets

Hub of ready-to-use datasets for ML models

Datasets is a library for easily accessing and sharing datasets, and evaluation metrics for Natural Language Processing (NLP), computer vision, and audio tasks. Load a dataset in a single line of code, and use our powerful data processing methods to quickly get your dataset ready for training in a deep learning model. Backed by the Apache Arrow format, process large datasets with zero-copy reads without any memory constraints for optimal speed and efficiency. ...

Downloads: 2 This Week

Last Update: 3 days ago

See Project

API-for-Open-LLM

Openai style api for open large language models

API-for-Open-LLM is a lightweight API server designed for deploying and serving open large language models (LLMs), offering a simple way to integrate LLMs into applications.

Downloads: 0 This Week

Last Update: 2025-01-22

See Project

deepdoctection

A Repo For Document AI

DeepDoctection is a document AI framework that applies deep learning techniques to analyze and extract structured data from scanned documents, PDFs, and images. deepdoctection is a Python library that orchestrates document extraction and document layout analysis tasks using deep learning models. It does not implement models but enables you to build pipelines using highly acknowledged libraries for object detection, OCR and selected NLP tasks and provides an integrated frameworks for...

Downloads: 0 This Week

Last Update: 2026-05-15

See Project

PaperAI

Semantic search and workflows for medical/scientific papers

PaperAI is an open-source framework for searching and analyzing scientific papers, particularly useful for researchers looking to extract insights from large-scale document collections.

Downloads: 3 This Week

Last Update: 2025-07-01

See Project

FastRAG

Efficient Retrieval Augmentation and Generation Framework

fastRAG is a research framework for efficient and optimized retrieval augmented generative pipelines, incorporating state-of-the-art LLMs and Information Retrieval. fastRAG is designed to empower researchers and developers with a comprehensive tool set for advancing retrieval augmented generation.

Downloads: 3 This Week

Last Update: 2025-01-24

See Project

Recognizers-Text

Recognition and resolution of numbers, units, date/time, etc.

Recognizers-Text is a multilingual text recognition library that extracts structured information such as dates, numbers, and currency values from unstructured text.

Downloads: 3 This Week

Last Update: 2025-02-12

See Project

NNCF

Neural Network Compression Framework for enhanced OpenVINO

NNCF (Neural Network Compression Framework) is an optimization toolkit for deep learning models, designed to apply quantization, pruning, and other techniques to improve inference efficiency.

Downloads: 4 This Week

Last Update: 2026-06-01

See Project

Search-Index

A persistent, network resilient, full text search library

Search-Index is a lightweight and fast JavaScript-based search engine that enables full-text search indexing and retrieval for web applications.

Downloads: 1 This Week

Last Update: 2025-03-12

See Project

BEIR

A Heterogeneous Benchmark for Information Retrieval

BEIR is a benchmark framework for evaluating information retrieval models across various datasets and tasks, including document ranking and question answering.

Downloads: 1 This Week

Last Update: 2025-06-04

See Project

AdalFlow

The library to build & auto-optimize LLM applications

AdalFlow is a framework for building AI-powered automation workflows, enabling users to design and execute intelligent automation pipelines with minimal coding.

Downloads: 1 This Week

Last Update: 2025-09-25

See Project

Super comprehensive deep learning notes

Super Comprehensive Deep Learning Notes

Super comprehensive deep learning notes is a massive and well-structured collection of deep learning notebooks that serve as a comprehensive study resource for anyone wanting to learn or reinforce concepts in computer vision, natural language processing, deep learning architectures, and even large-model agents. The repository contains hundreds of Jupyter notebooks that are richly annotated and organized by topic, progressing from basic Python and PyTorch fundamentals to advanced neural network designs like ResNet, transformers, and object detection algorithms. It’s not just a dry code repository; it includes theoretical explanations alongside hands-on examples, loss function explorations, optimization routines, and full end-to-end experiments on real datasets, making it highly suitable for both self-study and classroom use.

Downloads: 2 This Week

Last Update: 2026-04-27

See Project

Open Interpreter

A natural language interface for computers

Open Interpreter is an open-source tool that provides a natural-language interface for interacting with your computer. It lets large language models (LLMs) run code locally (Python, JavaScript, shell, etc.), enabling you to ask your computer to do tasks like data analysis, file manipulation, browsing, etc. in human terms (“chat with your computer”), with safeguards. Runs locally or via configured remote LLM servers/inference backends, giving flexibility to use models you trust or have...

Downloads: 14 This Week

Last Update: 2025-09-12

See Project

SetFit

Efficient few-shot learning with Sentence Transformers

SetFit is an efficient and prompt-free framework for few-shot fine-tuning of Sentence Transformers. It achieves high accuracy with little labeled data - for instance, with only 8 labeled examples per class on the Customer Reviews sentiment dataset, SetFit is competitive with fine-tuning RoBERTa Large on the full training set of 3k examples.

Downloads: 3 This Week

Last Update: 2025-08-05

See Project

Diffgram

Training data (data labeling, annotation, workflow) for all data types

...Annotation is required because raw media is considered to be unstructured and not usable without it. That’s why training data is required for many modern machine learning use cases including computer vision, natural language processing and speech recognition.

Downloads: 4 This Week

Last Update: 2024-10-14

See Project

tidytext

Text mining using tidy tools

tidytext brings tidy data principles to text mining by converting text into a tidy data frame format. It provides tools for tokenization, sentiment analysis, n‑gram creation, and term‑document matrices, enabling interoperability with dplyr, ggplot2, and other tidyverse workflows.

Downloads: 0 This Week

Last Update: 2025-07-30

See Project

Superlinked

Superlinked is a Python framework for AI Engineers

Superlinked is a Python framework designed for AI engineers to build high-performance search and recommendation applications that combine structured and unstructured data.

Downloads: 0 This Week

Last Update: 2025-10-22

See Project

Search Results for "language processing" - Page 2

Showing 298 open source projects for "language processing"

ChatGLM.cpp

Keras Hub

Chinese-XLNet

WikiChat

natural

Apache OpenNLP

NVIDIA NeMo

textlint

BettaFish

Datasets

API-for-Open-LLM

deepdoctection

PaperAI

FastRAG

Recognizers-Text

NNCF

Search-Index

BEIR

AdalFlow

Super comprehensive deep learning notes

Open Interpreter

SetFit

Diffgram

tidytext

Superlinked

Search Results for "language processing" - Page 2

Showing 298 open source projects for "language processing"

ChatGLM.cpp

Keras Hub

Chinese-XLNet

WikiChat

natural

Apache OpenNLP

NVIDIA NeMo

textlint

BettaFish

Datasets

API-for-Open-LLM

deepdoctection

PaperAI

FastRAG

Recognizers-Text

NNCF

Search-Index

BEIR

AdalFlow

Super comprehensive deep learning notes

Open Interpreter

SetFit

Diffgram

tidytext

Superlinked

Related Searches

Related Categories