Showing 27 open source projects for "apache open office"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    JoBimText

    JoBimText

    Linking Language to Knowledge with Distributional Semantics

    JobimText is a software solution for automatic text expansion using contextualized distributional similarity. It provides text analysis tools for large corpora and has capabilities to create distributional semantic models (JoBimText models) and multi-word expressions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Live Transcribe Speech Engine

    Live Transcribe Speech Engine

    Live Transcribe is an Android application

    Live Transcribe Speech Engine provides on-device speech recognition components that power real-time transcription for accessibility and everyday voice-first experiences. Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Leseratte is a Java parser for German written language. Currently, it contains a German lexicon (based on the Wiktionary), inflexion rules, a grammar and a parser. (Semantics component planned.) Usable as a Java library, also provides a graphical UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Corpus Toolkit

    Corpus Toolkit

    A text management tool for linguistic purposes...

    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Helsinki Finite-State Technology
    The Helsinki Finite-State Transducer toolkit is intended for processing natural language morphologies. The toolkit is demonstrated by wide-coverage implementations of a number of languages of varying morphological complexity.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6

    LexSub

    A Lexical Substitution Framework

    Lexical substitution framework for supervised all-words lexical substitution using delexicalized features. For a runnable (but GPL-licensed) version of LexSub, see LexSub-GPL (sf.net/p/lexsub/lexsub-gpl)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    ISO GrAF

    Experimental Java library for reading and writing GrAF/XML files.

    The Graph Annotation Framework (GrAF) models linguistic annotations using a data model based on Graph theory and algorithms. The GrAF standard is a work product of ISO TC37SC4 Working Group 1. This Java library is NOT part of the GrAF standard and standoff annotation files produced by the library may not be GrAF compliant.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Aelius Brazilian Portuguese POS-Tagger

    Python, NLTK-based package for shallow parsing of Brazilian Portuguese

    Aelius is an ongoing open source project aiming at developing a suite of Python, NLTK-based modules and interfaces to external freely available tools for shallow parsing of Brazilian Portuguese. It also includes language resources such as language models, sample texts, and gold standards. Presently, Aelius already offers facilities for POS-tagging and chunking corpora and outputting annotations in different formats, such as in XML in the TEI P5 encoding scheme.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    This package contains different tools to add NLP capabilities for Lucene 4.x (it has been tested using Lucene version from 4.6.x to 4.8.1). Although it was originally developed for German, it is, mostly, language independent. It allows the user to lemmatize words to be indexed, to weight termy ba their parts of speech (e.g. weighting nouns mor hevaily than pronouns), and to add synonyms taken from GermaNet or a list you provide to the search index and thereby increase recall of lucene.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10

    CLEiM

    Cross Lingual Education in Medicine

    CLEiM (Cross Lingual Education in Medicine) is an opensource version of an Intelligent System which extract concepts from medical texts and provides qualified information. It integrates information from various sources. This system has been developed by the Intelligent System Group GSI (http://www.esi.uem.es/gsi/) at UEM University. We do NER (Named Entity Recognition) based on GATE platform. The installation is simple, you can use it as a Web application. It has been tested under...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    TML - Text Mining Library for LSA & CMM

    TML is a Java Library for LSA and extracting Concept Maps from text

    TML has moved to http://www.villalon.cl/tml.html and the code to https://github.com/villalon/tml
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    CoocViewer

    Viewer for co-occurrences and positional co-occurrences

    A Demo is available at: http://coocviewer.sourceforge.net/coocviewer/index.php
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Semiotics

    A story generation program.

    This is a study on automatic generation of stories. The aim is to explore the cognitive and literary fields to eventually implement models and tools to help you create new stories, poems, and scenarios from a minimum set of generic parameters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    miac-p

    Code for syntactic parsing and other NLP apps.

    Code for syntactic parsing and other natural language processing applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    SweetOnionCCG2PTBConverter

    SweetOnionCCG2PTBConverter

    A tool that converts CCGBank to PTB

    Conversion between different grammar frameworks is of great importance to comparative performance analysis of the parsers developed on them. This tool can convert CCG derivations to PTB trees by using Max Entropy models as well as visualizing the tree graphs. The main technical innovation presented here is the effective conversion method which achieves a F score over 95%.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    Java Analogical Modeling

    Analogical Modeling module for Java

    Analogical Modeling is an exemplar-based approach to machine learning which imitates human behavior in outcome prediction. Its design has been applied to many natural language and other phenomena which exhibit variable behavior. A Perl XS implementation is available from http://humanities.byu.edu/am/ . This project is a Java implementation of the same. For more information on Analogical Modeling, see http://en.wikipedia.org/wiki/Analogical_modeling .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Distributed phrase-based machine translation training tool based on Hadoop.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    This is a PHP-5 library for language detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The Kyoto FST Decoder is a general decoding engine for Weighted Finite State Transducers. It features flexible XML-based configuration, beam-search decoding, and is able to output separate weights for weight tuning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Sylli
    Sylli is a universal syllabifier. Developed for Italian, it can easily be adapted to any language that is claimed to respect the SSP. Sylli divides timit, strings, files and directories into syllables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    oopinyinguide
    OO Pinyin Guide is a Java extension for OpenOffice 3 or higher. It enables the user to add pinyin transliteration over Chinese characters inside a text document. This tool can be useful for people learning or teaching Chinese.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    suffix arrays for phrase extraction
    Java Suffix array library for phrase discovery. Inspired initially by the classic paper of Yamamoto & Church, with newer ideas from Abouelhoda et al and Kim et al. Adapted for large alphabet so that words can be tokenized as alphabet characters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    BANAL - Banal And Not A Language. A prototyping notation compatible with Java and C# (via the largest possible common footprint between the two).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    cafetiere

    Rule-based information extraction.

    UIMA-compliant text analytics using a rule language in which to express context-sensitive constraints on syntactic and semantic text elements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB