Showing 36 open source projects for "stemming"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 1
    natural

    natural

    General natural language facilities for node

    "Natural" is a general natural language facility for nodejs. It offers a broad range of functionalities for natural language processing. Tokenizing, stemming, classification, phonetics, tf-idf, WordNet, string similarity, and some inflections are currently supported. It’s still in the early stages, so we’re very interested in bug reports, contributions and the like. Note that many algorithms from Rob Ellis’s node-nltools are being merged into this project and will be maintained from here onward...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Vespa

    Vespa

    The open big data serving engine

    Make AI-driven decisions using your data, in real-time. At any scale, with unbeatable performance. Vespa is a full-featured text search engine and supports both regular text search and fast approximate vector search (ANN). This makes it easy to create high-performing search applications at any scale, whether you want to use traditional techniques or a modern vector-based approach. You can even combine both approaches efficiently in the same query, something no other engine can do....
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Code Quality and Security for Java

    Code Quality and Security for Java

    SonarSource Static Analyzer for Java Code Quality and Security

    ... coding issues with just a click. Dozens of rules to ensure your tests are always as clean as your code! Dedicated rules to detect vulnerabilities including ones stemming from OWASP & CWE Top 25 guidelines. It all comes from a powerful analysis engine that we constantly refine. Sonar employs advanced rules along with smart, exclusive analysis techniques to find the trickiest, most elusive issues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Smile

    Smile

    Statistical machine intelligence and learning engine

    Smile is a fast and comprehensive machine learning engine. With advanced data structures and algorithms, Smile delivers the state-of-art performance. Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster? Write applications quickly in Java, Scala, or any JVM...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Boost your bottom line in less time | IBM Cognos Analytics Icon
    Boost your bottom line in less time | IBM Cognos Analytics

    Business intelligence solution for businesses wanting to make better analytics and confident decisions

    IBM Cognos Analytics acts as your trusted co-pilot for business with the aim of making you smarter, faster, and more confident in your data-driven decisions.
    Learn More
  • 5
    Anti-Spam SMTP Proxy Server

    Anti-Spam SMTP Proxy Server

    Anti-Spam SMTP Proxy Server implements multiple spam filters

    The Anti-Spam SMTP Proxy (ASSP) Server project aims to create an open source platform-independent SMTP Proxy server which implements auto-whitelists, self learning Hidden-Markov-Model and/or Bayesian, Greylisting, DNSBL, DNSWL, URIBL, SPF, SRS, Backscatter, Virus scanning, attachment blocking, Senderbase and multiple other filter methods. Click 'Files' to download the professional version 2.8.1 build 24261. A linux(ubuntu 20.04 LTS) and a freeBSD 12.2 based ready to run OVA of ASSP V2 are...
    Leader badge
    Downloads: 52,931 This Week
    Last Update:
    See Project
  • 6
    GoldenDict
    A feature-rich dictionary lookup program, supporting multiple dictionaries' formats, perfect article rendering with the complete markup, illustrations and other content retained, and allowing to type in words without any accents or correct case.
    Leader badge
    Downloads: 1,280 This Week
    Last Update:
    See Project
  • 7

    VecText

    Converting text to a structured representation

    VecText is an application that converts raw text to a structured format suitable for various data mining software. The application is written in interpreted programming language Perl. A part of the functionality is realized by external modules (e.g., Lingua::Stem::Snowball for stemming). The graphical user interface enables user-friendly software employment without requiring specialized technical skills and knowledge of a particular programming language, names of libraries and their functions...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    mbFXWords

    mbFXWords

    Analyze text. Diagonal read subject, predicate, obj. Search other pdf.

    Version 1.04. Applies and builds upon Apache OpenNLP. For English, French and German files. JavaFX Application, runs with Oracle Java Runtime Environment version 8 that is including JavaFX. NLP extensions: - Divide sentences in subclauses: segmentation. - Divide plain text: subject, predicate, object. - Count words: stemming. - Search for similar content: pdf's. Gives out subject, predicate and object of sentences of pdf and plain text files. Provides comfortable GUI. Automatic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    DSTK - Data Science TooKit 3

    DSTK - Data Science TooKit 3

    Data and Text Mining Software for Everyone

    DSTK - Data Science Toolkit 3 is a set of data and text mining softwares, following the CRISP DM model. DSTK offers data understanding using statistical and text analysis, data preparation using normalization and text processing, modeling and evaluation for machine learning and algorithms. It is based on the old version DSTK at https://sourceforge.net/projects/dstk2/ DSTK Engine is like R. DSTK ScriptWriter offers GUI to write DSTK script. DSTK Studio offers SPSS Statistics like GUI...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Nectar: Employee Recognition Software to Build Great Culture Icon
    Nectar: Employee Recognition Software to Build Great Culture

    Nectar is an employee recognition software built for the modern workforce.

    Our 360 recognition & rewards platform enables everyone (peer to peer & manager to employees alike) to send meaningful recognition rooted in core values. Nectar has the most extensive rewards catalog so users can choose from company branded swag, Amazon products, gift cards or custom reward types. Integrate with your other tools like Slack and Teams to make sending recognition easy. We support top organizations like MLB, SHRM, Redfin, Heineken and more.
    Learn More
  • 10
    The Corpora contains 81,000 tagged words of Arabic resources (Contemporary Arabic (CCA) [1] and Arabic Wikipedia [2]) text with the basic tags (verb, noun, adjective). [1] http://www.comp.leeds.ac.uk/eric/latifa/research.htm. [2] http://ar.wikipedia.org.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    DSTK - DataScience ToolKit is an opensource free software for statistical analysis, data visualization, text analysis, and predictive analytics. Newer version and smaller file size can be found at: https://sourceforge.net/projects/dstk3/ It is designed to be straight forward and easy to use, and familar to SPSS user. While JASP offers more statistical features, DSTK tends to be a broad solution workbench, including text analysis and predictive analytics features. Of course you may specify...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Resource for Urdu stemmer

    Urdu affix list, Urdu stem words list, stop words list and to remove i

    We presents the Urdu resources for Urdu text stemming such as affixes list, stop words list, stem words list and stemming rules to remove the infixes letter/ letters and recoding to extract correct stem. Here, we collect 1169 affixes, 1100 stop words, 40904 stem word list and 35 rules with their various variations to remove the infixes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    ooPorter

    A Porter stemming or stemmer algorithm coded in ooRexx

    This is an ooRexx line-by-line port from Ansi-C to ooRexx of the stemming routine published by Martin Porter 1980. The original source code from Porter has been commented out and emulated by the corresponding (oo)Rexx code as far as possible. This is not an example of good or fast (oo)Rexx programming, it is merely a demonstration of the Porter stemming routine ported to ooRexx, use and modify as necessary. Input to the program is a textfile with words to be stemmed, one per line. Output...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ... on Turkish and make experimental results re-producable. The TTC-3600 data set has 4 different forms in terms of pre-processing: 1. Original: No pre-processing step is applied. 2. FPS-5: The first five characters of terms are selected as stem and stop-words elimination is performed. 3. FPS-7: The first seven characters of terms are selected as stem and stop-words elimination is performed. 4. Zemberek-Stemmed: Zemberek NLP toolkit is utilized for stemming and stop-words elimination is perfo
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Stemmer Gujarati

    Stemmer Gujarati

    Offline stemmer for Gujarati , which is one of 22 Indian languages.

    This is a Gujarati stemmer in Java. Stemming is a process in which affixes are removed form the root word (stem). It relates morphological variant words to corresponding common root. For example "પ્રતિઉપયોગી" is word which has stem " ઉપયોગ". Stemmers are language specific tools. The design of a stemming algorithm requires a significant level of linguistic expertise. There has been lot of significant work in the development and evaluation of stemmer for non-Indian languages, but very less...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    NHunspell Spellcheck-Hyphen-Thesaurus

    NHunspell Spellcheck-Hyphen-Thesaurus

    Spell Checker, Hypehnation and Thesaurus for .NET

    NHunspell is a .NET version of the open office spell checker. Functions: spell checking, hyphenation, thesaurus with the open office dictionaries. It is free (GPL,LGPL, MPL) and can be used in closed source software. NuGet Package: http://nuget.org/packages/NHunspell/ Release Files and Samples: http://download.crawler-lib.net/NHunspell/ Support: http://www.crawler-lib.net/boards/forum/5/nhunspell
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    The Apelon DTS (Distributed Terminology System) is an integrated set of open source components that provides comprehensive terminology services in distributed application environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    TextProcessor

    A Java package to preprocess text datasets for posterior text analysis

    The TextProcessor Java package is a text processing toolkit, which provides some frequently used text processing functions such as stemming, removing stop-words, generating a term vocabulary, and calculating the term-doc frequency matrix. Basic topic mining models such as LDA and sparse NMF are also supported. The package can also generate feature files from a given text dataset with LDA and LIBSVM format for posterior procedures such as classification or clustering. The toolkit is also being...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Auto Summarization tool using java

    Auto summarization provides a concise summary for a document.

    ... are removed from the document. The document after removing the stop words is revised again for the unique words. Unique words are the one which have the same meaning or might be redundant in the document. These are removed by a method called stemming. By using the Stemming mechanism the occurrence of a word is calculated and the results are displayed in the format of how many times they occur and the number of sentences they have occurred.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Arabic Computational Linguistics resources and Tools, Arabic Text Mining Tools, Arabic Language tools, Arabic Morphological Analysis (Stemming / Light Stemming), Arabic text preprocessing, Arabic Corpora, Open Source Arabic Corpora OSAC, Comparable Corpora. For more information: http://sites.google.com/site/motazsite
    Leader badge
    Downloads: 74 This Week
    Last Update:
    See Project
  • 21
    Stemka - the free stemming tool for creating fast, safe and powerful stemmers. Includes convertor for ISpell dictionaries and ready-to-use dictionaries for Russian and Ukrainian.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    JAVA Arabic Stemmer

    A JAVA class with a small functionality that is stemming Arabic words

    A JAVA Arabic stemmer that is based on Shereen Khoja algorithm. This java class offers a function called stemWrod which takes an arabic word and return the stem of it.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    MongoLantern - MongoDB Fulltext Search

    MongoLantern - MongoDB Fulltext Search

    Open Source MongoDB Fulltext Search Server

    MongoLantern is an open source full text search server using MongoDB as index storage, which allows MongoLantern to migrate any changes very easily into account using MongoDB API. It's written originally written in PHP can be migrated to any desired language as required using it's future APIs. MongoLantern 0.7 - Stable/Production Release: 1. MongoLantern API support enabled. 2. CSV indexer added as a plugin. 3. node.js API client added.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Guia Brasil de recursos turisticos econônomicos e mapeamento etnico e comportamental, para definir u parametro de gostos e tendencias, incorporação do algoritimo de Poter Stemming em datamine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Neurpheus Morphological Analyser performs morphological analysis, stemming or word form generation tasks using sophisticated classification methods for an analysis of words unseen in a training dictionary.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next