Showing 282 open source projects for "java open source"

View related business solutions
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • Financial reporting cloud-based software. Icon
    Financial reporting cloud-based software.

    For companies looking to automate their consolidation and financial statement function

    The software is cloud based and automates complexities around consolidating and reporting for groups with multiple year ends, currencies and ERP systems with a slice and dice approach to reporting. While retaining the structure, control and validation needed in a financial reporting tool, we’ve managed to keep things flexible.
    Learn More
  • 1
    This project has been moved to https://github.com/loomchild/maligna . All further development will be done there.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    The Parenthesis Classifier takes the contents of a set of parentheses and classifies it into one of several categories. It includes a parenthesized-data extractor and the classifier.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    HanNanum - Korean POS Tagger
    HanNanum is a Korean Morphological Analyzer and POS Tagger. A plug-in component-based architecture is adapted to the new Java version for flexible use. You can find the work flow for morphological analysis, POS tagging, noun extraction, etc. Contact: kschoi@kaist.ac.kr hjjeong@world.kaist.ac.kr
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Distributed phrase-based machine translation training tool based on Hadoop.
    Downloads: 0 This Week
    Last Update:
    See Project
  • HOA Software Icon
    HOA Software

    Smarter Community Management Starts Here

    Simplify HOA management with software that handles everything from financials to communication.
    Learn More
  • 5
    A system to perform analysis of large documents for the purpose of cataloging similar documents. Similarity is based upon contextual analysis of these documents done by identifying common words and proper nouns.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    jWords is a port of WORDS (by William Whitaker, a free latin-to-english dictionary program written in Ada), to Java. Besides the dictionary will be translated to the German language.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    WQuery is a domain-specific query language designed to process WordNet-like lexical databases. It may be used as a standalone application or as an API to a lexical database in Java based systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ELIA(Eyegaze Language Integration Analysis) supports the analysis of eye-tracking data for studies in language processing. ELIA eases early analysis of data to enable iterative development of experiments in response to spoken language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    The Simple Semantic Classifier classifies short chunks of natural language text into broad semantic classes that correspond to the OBO ontologies provided as input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Total Network Visibility for Network Engineers and IT Managers Icon
    Total Network Visibility for Network Engineers and IT Managers

    Network monitoring and troubleshooting is hard. TotalView makes it easy.

    This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
    Learn More
  • 10
    This project is used to segment text into semantic parts by meaning of language model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    CORPSE (CORPus SEarch) is a powerful search engine written in Java. The aim is to provide an efficient implementation of a word level inverted index search with various cool functions that can be used on very large corpora.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A tool for large richly annotated parallel corpora preprocessing and Moses phrase-table extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Java program to create a (potentially multilingual) glossary of the unique words in any given Lojban text. Note that the Sourceforge page for this was superceded by the Bitbucket repository: https://bitbucket.org/pretoriusjf/vlastezba/overview Any further updates will be made there.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A linguistic tool to aid in the study of Linguistics/Phonology, specifically distinctive features of possible language sounds. Comprised of both a Visual C++ .NET version as well as a Java based web applet version. The C++ version has all but been ab
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Span-Gles
    A free Spanish - English Translator for Linux. It will translate a phrase (via internet) or single word (built-in dictionary.) Has capability to learn new words and is smart enough to find plural and feminine words. Written using Python/GTK under GPL
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    This is a PHP-5 library for language detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    LexBase is a configurable lexical database manager. It reads lexical and semantic information from WordNet, allows flexible querying of the database, and supports programmatic addition and deletion of terms, word senses, and relations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    BD-1 is a configurable database manager designed to provide efficient search and natural representations of annotated text, storing key-value pairs, triples, or n-tuples of text or binary data. It runs memory-resident or from disk.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A simple Java GUI tool for looking at the Spectrum and Cepstrum of a sound clip.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The Kyoto FST Decoder is a general decoding engine for Weighted Finite State Transducers. It features flexible XML-based configuration, beam-search decoding, and is able to output separate weights for weight tuning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    es-ve

    Diccionarios en Español para Venezuela

    Migrado a GithUB EN: https://github.com/christiangda/es-ve La mejor opción para verificar y corregir la gramática de tus documentos de LibreOffice escritos en español. La extensión incluye: Corrector Ortográfico, Tesauro de Sinónimos y Separación Silábica. ¡Hecho en Venezuela! Sus principales características son: Más DE 87.000 lemas y sus respectivas conjugaciones. Contiene el Lemario actualizado de RAE (Real Academia Española) Términos financieros e informáticos...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Alkindus is an automated solver for short monoalphabetic substitution ciphers without word divisions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    NooJ is used by linguists to describe linguistic phenomena and apply the formalized morphological, syntactic or semantic rules to corpora . It is used by non linguists in fields like psychology, sociology, history, literature studies as well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    PyAnnotation is a Python Library to access and manipulate linguistically annotated corpus files. Supported file formats are Kura XML, Elan XML and Toolbox files. A Corpus Reader API is provided to support statistical analysis within the NLTK.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Editor for formal grammars. Attempts to be universal – customizable for any grammatical formalism and any syntax. Provides features such as syntax checking and highlighting, transformations (refactoring) and advanced rule editor.
    Downloads: 0 This Week
    Last Update:
    See Project