BigQuery automates data prep, analysis, and predictions with built-in AI assistance.
Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
Try Free
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
A JAVA class with a small functionality that is stemming Arabic words
A JAVA Arabic stemmer that is based on Shereen Khoja algorithm. This java class offers a function called stemWrod which takes an arabic word and return the stem of it.
MongoLantern is an open source full text search server using MongoDB as index storage, which allows MongoLantern to migrate any changes very easily into account using MongoDB API. It's written originally written in PHP can be migrated to any desired language as required using it's future APIs.
MongoLantern 0.7 - Stable/Production Release:
1. MongoLantern API support enabled.
2. CSV indexer added as a plugin.
3. node.js API client added.
Stemka - the free stemming tool for creating fast, safe and powerful stemmers. Includes convertor for ISpell dictionaries and ready-to-use dictionaries for Russian and Ukrainian.
Guia Brasil de recursos turisticos econônomicos e mapeamento etnico e comportamental, para definir u parametro de gostos e tendencias, incorporação do algoritimo de Poter Stemming em datamine.
The Neurpheus Morphological Analyser performs morphological analysis, stemming or word form generation tasks using sophisticated classification methods for an analysis of words unseen in a training dictionary.
Based on the Buckwalter Morphological Analyzer (Version 1.0) for doing Arabic stemming and POS tagging. Includes a rewrite of the original Perl script, with better documentation and more flexible options, and a C++ interface (usable as a library or app).
Cubit is an Azureus plugin that enables decentralized, approximate keyword search of torrents within the Azureus client. It provides accurate and useful results even with errors in the search terms, stemming from typos and common spelling variations.
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.
Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
Word stemming library (Porter algorithms). Stemming refers to stripping a word down to a simpler prefix. This is useful for "fuzzy" word matching by finding common prefixes between similar words.
FinIndex is an client-server full-text indexing and search tool build around finlib with focus on Czech language support. It can do diacritics restoration and stemming for Czech. It contains fast-cgi interface for searching in mailing lists.
Perl module uses MySQL database backend to index files, web documents and database fields. Supports must include, can include, and cannot include words and phrases. Support for boolean (AND/OR) queries, stop words and stemming.
Contextual rule-based text normalization engine written in java, that can be used to implement stemming algorithms or phonetic normalizers. The project includes a french stemmer/phonetic normalizer, and a soundex implementation. Includes a Lucene Analyzer
A collection of implementations of the Lovins stemmer as described in "Development of a Stemming Algorithm" by Julie Beth Lovins (published in "Mechanical translation and computational linguistics", 11:22-31, 1968).
...The stem() method will return the stem as will toString() after stem() has been
called). The clear() method will wipe the Stemmer buffer and allow a new word to be input.
This version extends Martin Porter's original stemming algorithm by allowing capital letters to exist in words. This version should also be plugged in wherever the old algorithm is used with
few accommodations necessary.
The code in this version is more readable (in my opinion) than the old version. There is a main at the bottom that shows how to use the Stemmer.
Advanced Full Text Search + AI Assistant + Local Server for LLMs
TARILIO Pro platform for Information Retrieval that can work on a LAN as both client and LLM server. Advanced features: set LLM sampler parameters, scrolling index vocabulary, multilingual stemming, synonyms. MCP Client. UI can be translated using a free Language File Translator. Uses Llama Sharp and Lucene search engine . Open source and free. TARILIO PRO commercial version.
A set of classes for Natural Language Processing in PHP for:
1. Part of speech Tagging - Brill, n-gram, HMM
2. Princeton Wordnet querying and access
3. Document summarization
4. Document classification - EM, Bayes
5. Stemming - Porter, Lancaster