Showing 85 open source projects for "linux text editor"

View related business solutions
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    T-Rex (Trainable Relation Extraction) is a highly configurable machine learning-based Information Extraction from Text framework, which includes tools for document classification, entity extraction and relation extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    The main purpose of AMATOOL is to create an application for semiautomatic mark of text, using XML tags. The texts is typical can be archaeological reports or midleagetextscripts. It is a semiautomtaic editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    GoldenOrb is a java library under the Apache License V2.0 for correlation, summarization and clustering of text information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    iDocs is a intellectual document work flow with text mining options project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    The Java Text Categorizing Library (JTCL) is a pure java implementation of libTextCat which in turn is "a library that was primarily developed for language guessing, a task on which it is known to perform with near-perfect accuracy."
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    LACE means "Lucene Analyzer for CJK (Chinese/Japanese/Korean) & English". It's a simple tokenizer that can handle English-CJK mixed text. Chinese words are handled using a dictionary based method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    The UIMA Annotator (called BRUTUS - Business Rules from Unstructured Text and Unstructured Sources) is a component for the UIMA Framework that allows for capturing business knowledge formalized in Structured English syntax (based on OMG's SBVR) with MOF
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    hypKNOWsys aims at developing a Java-based workbench for knowledge discovery and knowledge management. Currently, hypKNOWsys has released two intermediate tools: DIAsDEM Workbench (text mining for semantic tagging) and WUMprep (Web mining pre-processing)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    An approximate gazetteer for GATE (General Architecture for Text Engineering), based on Levenshtein's Distance. Strings can be matched and found even in texts with noise and errors. More Info: http://bruno-wp.blogspot.com/search/label/Software
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Java Expert Rule Based Inference Language. Jerbil is an open source rule processing engine written in Java. Currently Jerbil supports a full set of processing functions with text-based and XML interfaces; a Java interface is planned.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    The NS-Mapper ad-hoc scenario editor is improved and extended by adding more realistic strategies, such as random based node placement, movement and traffic to the ad-hoc simulation of the Network Simulator 2 (NS-2).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The Text Annotation Environment (tae) can be used to annotate natural language text manually or automatically (UIMA Annotator) with meta information (tokens, part-of-speech, named entities, ...). Tae is based on Eclipse and IBM's UIMA.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    TopicMapTools allows the creation, manipulation and visualization of XML Topic Maps. It is based on the TM4J framework. The editor can be used in genealogy for the visualisation of e.g. family trees.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    This Software takes an text with algebra symbols and converts it to real sql, which then may be run with an extra database backend like mysql or postgresql.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SENTENSA Knowledge Miner is a platform independent tool for searching any text. SENTENSA uses robust methods of indexing and searching text, leveraging on experience from more than 20 years of information retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    DotPlot is an Eclipse plug-in to graphically compare word sequences of any type of text. Matches will be plotted as dots on a graph. Similarities in thousands of lines of text or code will result in typical textures and diagonals in the plot.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Thetis is a Java (OS-independent) application written to allow the linguistic and statistical analysis of the Homeric and Hesiodic Epic. Current achievement is the creation of a complete (and free) Thesaurus of the Homeric and Hesiodic poems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    It is an universal language translator and written in Java. All languages are translated to an unique language (interlingua) and generate any native language from the interlingua. The wordbooks are XML. It use the context of a text, rules and a grammar.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    AutoSummary uses Natural Language Processing to generate a contextually-relevant synopsis of plain text. It uses statistical and rule-based methods for part-of-speech tagging, word sense disambiguation, sentence deconstruction and semantic analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A tool kit for multiplexing annotations and management of features for textual annotation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The "Universal Content Evaluation and Categorisation Software" is a program for analysing a website’s, or more generally, a text’s content. The text is arranged in dozens of categories, permitting more efficient web searches and information processing.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB