Page 4 | nlp free download

Showing 196 open source projects for "nlp"

View related business solutions

Python Clear Filters & Widen Search

Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

Label Sleuth

Open source no-code system for text annotation and building of text

An open-source no-code system for text annotation and building text classifiers. No AI knowledge needed. From task definition to working model in just a few hours! While domain experts label their data, Label Sleuth automatically trains in the background-appropriate machine learning models. To avoid wasted labeling effort, Label Sleuth employs active learning techniques to guide the user in what they should be labeled next. Domain experts can quickly start labeling their data through an...

Downloads: 3 This Week

Last Update: 2024-06-17
See Project
2

DeepEval

...DeepEval incorporates the latest research to evaluate LLM outputs based on metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., which uses LLMs and various other NLP models that run locally on your machine for evaluation. Whether your application is implemented via RAG or fine-tuning, LangChain, or LlamaIndex, DeepEval has you covered. With it, you can easily determine the optimal hyperparameters to improve your RAG pipeline, prevent prompt drifting, or even transition from OpenAI to hosting your own Llama2 with confidence.

Downloads: 2 This Week

Last Update: 2026-04-28
See Project
3

Arize Phoenix

Uncover insights, surface problems, monitor, and fine tune your LLM

Phoenix provides ML insights at lightning speed with zero-config observability for model drift, performance, and data quality. Phoenix is an Open Source ML Observability library designed for the Notebook. The toolset is designed to ingest model inference data for LLMs, CV, NLP and tabular datasets. It allows Data Scientists to quickly visualize their model data, monitor performance, track down issues & insights, and easily export to improve. Deep Learning Models (CV, LLM, and Generative) are an amazing technology that will power many of future ML use cases. A large set of these technologies are being deployed into businesses (the real world) in what we consider a production setting.

Downloads: 1 This Week

Last Update: 9 hours ago
See Project
4

model2Vec

Fast State-of-the-Art Static Embeddings

model2vec is an innovative embedding framework that converts large sentence transformer models into compact, high-speed static embedding models while preserving much of their semantic performance. The project focuses on dramatically reducing the computational cost of generating embeddings, achieving significant improvements in speed and model size without requiring large datasets for retraining. By using a distillation-based approach, it can produce lightweight models that run efficiently on...

Downloads: 0 This Week

Last Update: 2026-03-27
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
5

Uncertainty Baselines

High-quality implementations of standard and SOTA methods

...Rather than offering toy scripts, it provides end-to-end recipes—data input, model architectures, training loops, evaluation metrics, and logging—so results are comparable across runs and research groups. The library spans canonical modalities and tasks, from image classification and NLP to tabular problems, with baselines that cover both deterministic and probabilistic approaches. Techniques include deep ensembles, Monte Carlo dropout, temperature scaling, stochastic variational inference, heteroscedastic heads, and out-of-distribution detection workflows. Each baseline emphasizes reproducibility: fixed seeds, standard splits, and strong metrics such as calibration error, AUROC for OOD, and accuracy under shift.

Downloads: 0 This Week

Last Update: 2026-03-24
See Project
6

AiLearning-Theory-Applying

Quickly get started with AI theory and practical applications

AiLearning-Theory-Applying is a comprehensive educational repository designed to help learners quickly understand artificial intelligence theory and apply it in practical machine learning and deep learning projects. The repository provides extensive tutorials covering mathematical foundations, machine learning algorithms, deep learning concepts, and modern large language model architectures. It includes well-commented notebooks, datasets, and implementation examples that allow learners to...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
7

AI Engineer Headquarters

A collection of scientific methods, processes, algorithms

AI-Engineer-Headquarters is a comprehensive educational repository designed to help developers become advanced AI engineers through a structured learning path and practical system-building exercises. The project serves as a curated collection of resources, methodologies, and tools covering topics across the entire artificial intelligence development lifecycle. Rather than focusing only on theoretical knowledge, the repository emphasizes applied learning and encourages engineers to build real...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
8

newspaper4k

Python library for scraping and analyzing online news articles easily

Newspaper4k is a Python library designed for extracting, processing, and analyzing news articles from websites. It is a continuation and active fork of the original newspaper3k library, which had stopped receiving updates, with the goal of keeping the ecosystem maintained while adding improvements and bug fixes. It provides developers with tools to automatically download web pages, extract the main article content, and collect associated metadata such as titles, authors, images, and...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
9

Kaggle Solutions

Collection of Kaggle Solutions and Ideas

Kaggle Solutions is an open-source repository that compiles winning solutions, insights, and educational resources from hundreds of Kaggle data science competitions. The repository acts as a knowledge base for competitive machine learning by collecting solution write-ups, discussion threads, code notebooks, and tutorial resources shared by top Kaggle participants. Each competition entry typically includes information about the dataset, evaluation metrics, modeling strategies, and techniques...

Downloads: 0 This Week

Last Update: 2026-04-05
See Project
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
10

TigerBot

TigerBot: A multi-language multi-task LLM

TigerBot is an open-source family of large language models designed to support multilingual and multi-task natural language processing applications. The project focuses on building high-performance models capable of handling both English and Chinese tasks while maintaining strong reasoning and conversational abilities. TigerBot models are based on modern transformer architectures and are trained on large datasets that cover multiple domains and languages. The project provides both base...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
11

llms-from-scratch-cn

Build a large language model from 0 only with Python foundation

llms-from-scratch-cn is an educational open-source project designed to teach developers how to build large language models step by step using practical code and conceptual explanations. The repository provides a hands-on learning path that begins with the fundamentals of natural language processing and gradually progresses toward implementing full GPT-style architectures from the ground up. Rather than focusing on using pre-trained models through APIs, the project emphasizes understanding...

Downloads: 0 This Week

Last Update: 2026-03-26
See Project
12

PyTorch-Tutorial-2nd

CV, NLP, LLM project applications, and advanced engineering deployment

PyTorch-Tutorial-2nd is an open-source educational repository that provides structured tutorials for learning deep learning with the PyTorch framework. The project serves as a practical companion to a second edition of a PyTorch learning guide and is designed to help learners understand neural network concepts through hands-on coding examples. The repository covers a wide range of topics including tensor operations, neural network construction, model training workflows, and optimization...

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
13

Flax

Flax is a neural network library for JAX

Flax is a flexible neural-network library for JAX that embraces functional programming while offering ergonomic module abstractions. Its design separates pure computation from state by threading parameter collections and RNGs explicitly, enabling reproducibility, transformation, and easy experimentation with JAX transforms like jit, pmap, and vmap. Modules define parameterized computations, but initialization and application remain side-effect free, which pairs naturally with JAX’s staging...

Downloads: 0 This Week

Last Update: 2026-04-22
See Project
14

OpenCompass

OpenCompass is an LLM evaluation platform

Just like a compass guides us on our journey, OpenCompass will guide you through the complex landscape of evaluating large language models. With its powerful algorithms and intuitive interface, OpenCompass makes it easy to assess the quality and effectiveness of your NLP models. OpenCompass is a one-stop platform for large model evaluation, aiming to provide a fair, open, and reproducible benchmark for large model evaluation. Pre-support for 20+ HuggingFace and API models, a model evaluation scheme of 50+ datasets with about 300,000 questions, comprehensively evaluating the capabilities of the models in five dimensions. ...

Downloads: 0 This Week

Last Update: 2026-02-13
See Project
15

Poetiq

Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1

poetiq-arc-agi-solver is the open-source codebase from Poetiq that replicates their record-breaking submission to the challenging benchmark suite ARC-AGI (both ARC-AGI-1 and ARC-AGI-2). The project demonstrates a system that orchestrates large language models (LLMs) — like those from major providers — with carefully engineered prompting, reasoning workflows, and dynamic strategies, to tackle the abstract, logic-heavy problems in ARC-AGI. Instead of relying on a single prompt or fixed...

Downloads: 0 This Week

Last Update: 2025-12-07
See Project
16

crème de la crème of AI courses

This repository is a curated collection of links to various courses

crème de la crème of AI courses is an open-source repository that serves as a curated directory of high-quality educational resources related to artificial intelligence, machine learning, and modern data science. The project aggregates links to online courses, tutorials, lecture series, and learning materials from universities, research labs, and independent educators. The repository organizes courses by topic, difficulty level, format, and release year, allowing learners to quickly identify...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
17

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis

AutoGPTQ is an implementation of GPTQ (Quantized GPT) that optimizes large language models (LLMs) for faster inference by reducing their computational footprint while maintaining accuracy.

Downloads: 0 This Week

Last Update: 2025-01-21
See Project
18

Eng2BN CSV Translator

Translate English to Bangla using CSV file format and range wise.

Eng2BN CSV Translator user-friendly Python tool that enables efficient translation of English text to Bangla within CSV files. The application supports large datasets and allows users to translate specific row ranges, making it ideal for batch processing.

Downloads: 0 This Week

Last Update: 2025-12-06
See Project
19

Wikipedia2Vec

A tool for learning vector representations of words and entities

Wikipedia2Vec is an embedding learning tool that creates word and entity vector representations from Wikipedia, enabling NLP models to leverage structured and contextual knowledge.

Downloads: 0 This Week

Last Update: 2025-01-24
See Project
20

Transformers4Rec

Transformers4Rec is a flexible and efficient library

Transformers4Rec is an advanced recommendation system library that leverages Transformer models for sequential and session-based recommendations. The library works as a bridge between natural language processing (NLP) and recommender systems (RecSys) by integrating with one of the most popular NLP frameworks, Hugging Face Transformers (HF). Transformers4Rec makes state-of-the-art transformer architectures available for RecSys researchers and industry practitioners. Traditional recommendation algorithms usually ignore the temporal dynamics and the sequence of interactions when trying to model user behavior. ...

Downloads: 8 This Week

Last Update: 2025-01-24
See Project
21

Textinator

Internationalized highly customizable annotation and evaluation tool

Textinator is an open-source internationalized highly-customizable annotation and evaluation tool for Natural Language Processing (NLP) tasks. The tool offers a web interface with a user-friendly UI and supports a number of NLP tasks out of the box. The recommended way of deploying Textinator is through building a production version of the Docker container, as described in the Deployment guidelines. Note that the production version is most definitely more secure and reliable than the development version. ...

Downloads: 0 This Week

Last Update: 2024-02-04
See Project
22

LexiFinder

AI-powered semantic indexing: automating the creation of book indexes

LexiFinder is a tool to generate analytic indexes from documents automatically. Given one or more source documents and a set of keywords, it extracts all nouns, compares them semantically to the keywords using a pretrained NLP model, and produces a structured, hierarchical index ready to be included in a book or manuscript. LexiFinder works in two ways: as a command-line tool for scripting, automation, and batch processing, and as a graphical application for a guided, point-and-click experience. Both interfaces share the same underlying engine and support the same features. ...

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
23

Pytente

Uma Ferramenta Computacional para Análise e Recuperação de Patentes

O Pytente é uma solução avançada para automatizar o processo de coleta, armazenamento e tratamento de dados bibliográficos de patentes. A ferramenta foi projetada para simplificar a coleta de grandes volumes de dados em repositórios de acesso aberto. O Pytente garante o armazenamento estruturado das informações, além da validação e eliminação de registros duplicados. Dentre as diversas funcionalidades disponibilizadas pela ferramenta, destacam-se a extração personalizada de subconjuntos de...

Downloads: 0 This Week

Last Update: 2025-11-03
See Project
24

Kalshi-Quant-TeleBot

Kalshi Advanced Quantitative Trading Bot is an enterprise-grade

Kalshi Advanced Quantitative Trading Bot is an enterprise-grade automated trading system designed for the Kalshi event-based prediction market. Built with cutting-edge quantitative algorithms and professional risk management, it provides institutional-quality trading capabilities with user-friendly control The Kalshi Advanced Quantitative Trading Bot is a professional-grade automated trading system designed specifically for event-based markets on the Kalshi platform. This bot leverages...

1 Review

Downloads: 5 This Week

Last Update: 2026-03-12
See Project
25

Chinese-LLaMA-Alpaca 2

Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project

This project is developed based on the commercially available large model Llama-2 released by Meta. It is the second phase of the Chinese LLaMA&Alpaca large model project. The Chinese LLaMA-2 base model and the Alpaca-2 instruction fine-tuning large model are open-sourced. These models expand and optimize the Chinese vocabulary on the basis of the original Llama-2, use large-scale Chinese data for incremental pre-training, and further improve the basic semantics and command understanding of...

Downloads: 0 This Week

Last Update: 2024-01-23
See Project