Showing 350 open source projects for "java open source"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 1
    TF-IDF.jar is a Java Archive file to measure TF-IDF of each document in a document collection (corpus). The jar can be used to (a) get all the terms in the corpus (b) get the document frequency (DF) and inverse document frequency (IDF) of all the terms in the corpus (c) get the TF-IDF of each document in the corpus (d) get each term with their frequency (no. of presence), term frequency (TF) and TF-IDF in every document
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    gannu

    gannu

    Java API and tools for performing NLP and other AI tasks

    Java API and tools for performing a wide range of AI tasks such as: word sense disambiguation (released), optimization (5 Evolutionary Algorithms Implemented ETA February 2014), opinion mining (ETA November 2014) and text wikification (ETA July 2014). Gannu includes some graphical interfaces for scientific purposes. When using Gannu please cite: *Jiménez, F.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    Language Constructor

    Complete tool for constructing/manipulating languages in digital form

    With this tool you can easily design a new language, digitize an existing one or incrementally reconstruct an ancient language. It allows for free experimentation of all aspects of the language, so it does not have to be made consistent on paper first. You can edit script, syntax, grammar, morphology, lexicon and phonology, as well as write documents in the language, as it might be too complex to be handled by current font technology. The information is stored in xml format for easy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Java application for training and deploying text processing applications such as part-of-speech taggers, based on a re-implementation of Brill's algorithm in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Automate contact and company data extraction Icon
    Automate contact and company data extraction

    Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.

    Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
    Explore Apify Store
  • 5

    t2t-pipe

    automatic alignment pipeline for parallel treebanks

    The *Tree-to-Tree (t2t) Alignment Pipe* is a collection of python scripts, co-ordinating the process of automatic alignment of parallel treebanks from plain text files with a single call from a unix command line. Supported Languages: DE, FR, EN
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    TML - Text Mining Library for LSA & CMM

    TML is a Java Library for LSA and extracting Concept Maps from text

    TML has moved to http://www.villalon.cl/tml.html and the code to https://github.com/villalon/tml
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    BioLemmatizer

    Lemmatization tool for morphological analysis of biomedical literature

    The BioLemmatizer is a domain-specific lemmatization tool for the morphological analysis of biomedical literature. It is tailored to the biological domain through integration of several published lexical resources related to molecular biology. It focuses on the inflectional morphology of English, including the plural form of nouns, the conjugations of verbs, and the comparative and superlative form of adjectives and adverbs. README:...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Dex to Wiktionary

    Dex to Wiktionary

    Linguistic analysis

    Scopul proiectului este conversia definițiilor din DEX '98, luate de pe dexonline.ro, din text simplu în formatul structurat folosite de Wiktionary. Programul convertește definițiile în 3 pași: - descarcă și extrage definițiile de pe dexonline.ro - parsează definițiile și le împarte în părți mai mici care sunt clasificate - asamblează bucățile într-un format arborescent, pe care apoi îl parsează și generează formatul cerut de Wiktionary
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Perstem
    Perstem is a Persian (Farsi) stemmer, morphological analyzer, transliterator, and partial part-of-speech tagger. Inflexional morphemes are separated or removed from their stems. Perstem can also tokenize and transliterate between various character set encodings and romanizations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 10
    Donatus is an on-going project consisting of Python, NLTK-based tools and grammars for deep parsing and syntactical annotation of Brazilian Portuguese corpora. It includes a user-friendly graphical user interface for building syntactic parsers with the NLTK, providing some additional functionalities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    iGREAT is an open-source, statistical machine translation software toolkit based on finite-state models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    LanguageTool

    Proofreading Software for 20+ Languages

    LanguageTool is an Open Source language/grammar checker. *** THIS REPOSITORY IS OUT OF DATE, see https://github.com/languagetool-org INSTEAD ***
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    pdf2mp3

    Simply convert your PDF files into audio books

    Summary: Your eyes are tired of looking into the tablet or cell-phone screen reading ebooks? You have difficulty reading from LCD screen specially in a driving vehicle? This software is for you! It converts your PDF files to MP3 audio books. Special Features (Compared to similar projects): Each page is in a separate MP3 file. Created MP3 files have ID3v2 tags showing Book name and page number. Multi-threaded conversion, means all CPU cores will be used thus multiple times faster conversion.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Web site to translate text from Spanish to a regular Spanish called "espanes". This lenguage adaptation is very useful for learning Spanish because ia a simplified version with less verbal modes, accents enhanced, prefix, infix and suffix reduced....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Better PO Editor is an editor for .po files, used to generate compiled gettext .mo files which are used by many programs and websites to localize the user interface. It offers great features... It's worth to give it a try! PLEASE NOTE: the project moved to GitHub: see https://github.com/mlocati/betterpoeditor/releases
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A simulation package for investigating the dynamics of complex controversy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Fast Fuzzy Inference System
    FFIS or Fast Fuzzy Inference System is a portable and optimized implementation of Fuzzy Inference Systems. It supports both Mamdani and Takagi-Sugeno methods. The main idea behind this tool, is to provide case-special techniques rather than general solutions to resolve complicated mathematical calculations. This will lead to have more efficient defuzzification algorithms for Mamdani's model. Most systems in Mamdani's model can be defuzzified in O(n²) or even O(n) time which n is number of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    CRIS-IE-Smoking

    CRIS-IE-Smoking

    GATE based app to extract patient smoking status from free text

    This application was developed by the NIHR Biomedical Research Centre at the Institute of Psychiatry and South London and Maudsley NHS Foundation Trust, in collaboration with the University of Sheffield. Its purpose is to identify the smoking status of a individual, based on text evidence in clinical notes. Currently, it classifies patients as 'current', 'past' or 'never'. It runs on the GATE infrastructure, available at http://gate.ac.uk/. Please contact richard.g.jackson@slam.nhs.uk for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A simple java library for text and object oriented code. Among the different available packages, there are for text analysis (levenshtein and ngram fingerprinting), a grammar framework, simple object persistence (very light and dependence free), ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    CoocViewer

    Viewer for co-occurrences and positional co-occurrences

    A Demo is available at: http://coocviewer.sourceforge.net/coocviewer/index.php
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Hermes Natural Language Processing

    A repository of software, documentation and data for NLP

    Hermes is a repository of software, documentation and data for NLP. I am currently adding corpora extracted from Wikipedia (mostrly in Romance languages).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    ValiTerms

    ValiTerms

    Validation of terms in corpus

    ValiTerms is a tool that helps the validation of terms in corpus. It finds their occurrences and allows terminologists to choose if a term is relevant or not. ValiTerms is developed at LIPN (http://www-lipn.univ-paris13.fr), RCLN team. Please consult the wiki for instructions about installation and usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    NetBeans Dictionaries

    Additional dictionary files for the NetBeans spellchecker.

    Additional dictionary files for the NetBeans spellchecker.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    Obeliks

    Obeliks: Oblikoslovni označevalnik za slovenski jezik

    Dolgoročni cilj projekta Obeliks je izdelava in nadgrajevanje najbolj natančnega statističnega označevalnika za slovenski jezik. Oblikoskladenjsko označevanje je proces pripisovanja oblikoslovnih (in deloma skladenjskih) lastnosti besedam v poljubnem besedilu. Tako označeno besedilo je predpogoj za delovanje večine aplikacij, ki temeljijo na analizi naravnega jezika. Označevanje slovenskih besedil je zelo težak problem, saj mora algoritem za označevanje pravilno izbirati med skoraj dva tisoč...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Semiotics

    A story generation program.

    This is a study on automatic generation of stories. The aim is to explore the cognitive and literary fields to eventually implement models and tools to help you create new stories, poems, and scenarios from a minimum set of generic parameters.
    Downloads: 0 This Week
    Last Update:
    See Project