Showing 60 open source projects for "text based"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    An open-source corpus analysis class library written in C#. GUI of Tenka Text 0.1.3 comes with Wordlister - an advanced, extremely fast graphical wordlist tool and a simple regex concordance tool. Tenka Text - the open-source answer to WordSmith Tool
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    TextMarker
    TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    SEMANTIXS is a semantic information extraction system that can extract, represent and visualize domain-specific information from free-text in the form of complex (and simple) relationships. Refer - http://www.cs.iastate.edu/~semantix/ for more info.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    GamesCrafters is an undergraduate research group at the University of California, Berkeley. GAMESMAN is our software for building text & X11 apps for finite, 2-person abstract perfect-information games (e.g., TicTacToe & chess). We've authored 40+ games
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    Ontea - Pattern based Semantic Annotation Platform. Ontea search or create semantic meta data from text or documents using pattern based approaches. Implementation currently includes regular expressions (regex) patterns
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Contextor
    Contextor is a light-weight simple-to-use Java based library to help developers and researchers working with the general concept of a resource; as examples, resources can be text resources, web resources, images and videos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Optex Analyzer is a software to analyze and compare algorithms to solve approximately optimization problems. It has a GUI that allows select a set of input files containing raw algorithm results. The analysis is shown with tables and charts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    A simple java program using PDFBox and Prefuse libraries to extract text from pdf and display as a graph. Can also link words based on their contexts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    LAITOR is a text mining software developed to find co-occurrence of biological entities (gene/protein terms) together with biointeractions and concepts term from customized dictionaries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    * Java classes for parsing text, conversion to XML or to evaluate in Java. The parser is textual-script-controlled with a syntax near Backus Naur Format, named ZBNF. * Some routines for conversion: C-Header or Java to XMI, XML-Documentation generation,
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    OCR c++ library. Include: contour recognition; vectorisation; matrix letter feature recognition; auto page segmentation and detect rotation; SS3 ASM core; XML base; web-based GUI; 99,6% printed Unicode text recognition; letter base up to 1200 letters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    TextMine is for the Perl hacker who is grappling with the problems of managing unstructured text from various sources. You can use these text mining tools to search the Web, index text, extract entities, categorize your e-mail, and summarize documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    T-Rex (Trainable Relation Extraction) is a highly configurable machine learning-based Information Extraction from Text framework, which includes tools for document classification, entity extraction and relation extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Plagiasi is a plagiarism detection software, deployed as a web application. It can detect plagiarism in a text, which will be compared to local documents and/or online documents (automatically seleted using web search engines).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    iDocs is a intellectual document work flow with text mining options project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    LACE means "Lucene Analyzer for CJK (Chinese/Japanese/Korean) & English". It's a simple tokenizer that can handle English-CJK mixed text. Chinese words are handled using a dictionary based method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The UIMA Annotator (called BRUTUS - Business Rules from Unstructured Text and Unstructured Sources) is a component for the UIMA Framework that allows for capturing business knowledge formalized in Structured English syntax (based on OMG's SBVR) with MOF
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    hypKNOWsys aims at developing a Java-based workbench for knowledge discovery and knowledge management. Currently, hypKNOWsys has released two intermediate tools: DIAsDEM Workbench (text mining for semantic tagging) and WUMprep (Web mining pre-processing)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    An approximate gazetteer for GATE (General Architecture for Text Engineering), based on Levenshtein's Distance. Strings can be matched and found even in texts with noise and errors. More Info: http://bruno-wp.blogspot.com/search/label/Software
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Java Expert Rule Based Inference Language. Jerbil is an open source rule processing engine written in Java. Currently Jerbil supports a full set of processing functions with text-based and XML interfaces; a Java interface is planned.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The Text Annotation Environment (tae) can be used to annotate natural language text manually or automatically (UIMA Annotator) with meta information (tokens, part-of-speech, named entities, ...). Tae is based on Eclipse and IBM's UIMA.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Kriterion is a document retrieval and categorization engine capable of full text searching. There is no need for keyword or context-based information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    SENTENSA Knowledge Miner is a platform independent tool for searching any text. SENTENSA uses robust methods of indexing and searching text, leveraging on experience from more than 20 years of information retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB