Showing 350 open source projects for "java open source"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    SweetOnionCCG2PTBConverter

    SweetOnionCCG2PTBConverter

    A tool that converts CCGBank to PTB

    Conversion between different grammar frameworks is of great importance to comparative performance analysis of the parsers developed on them. This tool can convert CCG derivations to PTB trees by using Max Entropy models as well as visualizing the tree graphs. The main technical innovation presented here is the effective conversion method which achieves a F score over 95%.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    WordSegment

    WordSegment

    wordseg project is a word segment module implemented by C#

    wordseg project is a word segment module implemented by C#. It is used to segment text into tokens and to label token's attribute according its context and semantic by front-maximum matching and CRF algorithms. The following are some sentences need to be segmented: 张晓晨和付仲恺一起坐在家(西坝河东里社区)里的沙发上看非诚勿扰。 百度公司的名字源于“众里寻他千百度”这诗句。 After above sentences be segmented by wordseg, the result as follows for each sentence: 张晓晨[PER] 和 付仲恺[PER] 一起 坐 在 家 ( 西坝河东里社区[LOC] ) 里 的 沙发[PDT] 上 看 非 诚 勿扰 。...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    CRFSharp

    CRFSharp

    CRFSharp is a .NET(C#) implementation of Conditional Random Field

    CRFSharp(aka CRF#) is a .NET(C#) implementation of Conditional Random Fields, an machine learning algorithm for learning from labeled sequences of examples. It is widely used in Natural Language Process (NLP) tasks, for example: word breaker, postagging, named entity recognized, query chunking and so on. CRF#'s mainly algorithm is the same as CRF++ written by Taku Kudo. It encodes model parameters by L-BFGS. Moreover, it has many significant improvement than CRF++, such as totally...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Roaat Regular Typographic Features
    I created a Khmer font. Because Khmer ist quite a descendant of the Brahmi script, adding OpenType and/or AAT (Apple Advanced Typography) features is necessary for realizing the letter rearrangement and reshaping. Because of the complexity of the Khmer script this task is quite challenging, and I still have problems in making the AAT features work properly. This project is meant to make the development process visible for everyone and allowing others to give valuable hints and comments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5
    Maskouk : Arabic Collocations
    Maskouk: Arabic Collocations Dictionary المسكوكات اللفظية العربيو، المتلازمات المتواردات
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6

    BioContext

    Software for extraction of biomedical information from literature

    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    jaf_Utils

    A C++ library for Statistical Language Processing tasks.

    A C++ library for Statistical Language Processing tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    jaf_MT

    This implements a phrased-based hidden semi-Markov Model for SMT

    This package implements the phrased-based hidden semi-Markov model described: Jesús Andrés-Ferrer, Alfons Juan. A phrase-based hidden semi-Markov approach to machine translation. Procedings of European Association for Machine Translation (EAMT), 2009. pp. 168-175. This project depends on jaf_Utils: http://sourceforge.net/projects/jafutils/ Install it prior installation of jaf_MT.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    jaf_Kernels

    Similarity Word-Sequence Kernels for Sentence Clustering toolkit

    This project implements the techniques used in this paper: @INPROCEEDINGS{Andres10a, author = {Jesús Andrés-Ferrer and Germán Sanchis-Trilles and Francisco Casacuberta}, title = {Similarity Word-Sequence Kernels for Sentence Clustering}, booktitle = {Proceedings of the 8th International Workshop on Statistical Pattern Recognition}, year = {2010}, } This project depends on jaf_Utils: http://sourceforge.net/projects/jafutils/ Install it prior...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 10

    Java Analogical Modeling

    Analogical Modeling module for Java

    Analogical Modeling is an exemplar-based approach to machine learning which imitates human behavior in outcome prediction. Its design has been applied to many natural language and other phenomena which exhibit variable behavior. A Perl XS implementation is available from http://humanities.byu.edu/am/ . This project is a Java implementation of the same. For more information on Analogical Modeling, see http://en.wikipedia.org/wiki/Analogical_modeling .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    LMchess

    Language model demonstration using a fixed chess domain

    A chess game interface which returns the real time perplexity and entropy at turn intervals. Also provides probability and graphing over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Mansour

    Arabic text analyzer

    Mansour is a simple application for analyzing digital text written in Arabic.منصور هو تطبيق مكتبي بسيط لتحليل النصوص الرقمية المكتوبة باللغة العربية.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text. The method is based on SVM but other ML algorithms can be adopted. The method details are explained in the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    This project has been moved to https://github.com/loomchild/maligna . All further development will be done there.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The Parenthesis Classifier takes the contents of a set of parentheses and classifies it into one of several categories. It includes a parenthesized-data extractor and the classifier.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    HanNanum - Korean POS Tagger
    HanNanum is a Korean Morphological Analyzer and POS Tagger. A plug-in component-based architecture is adapted to the new Java version for flexible use. You can find the work flow for morphological analysis, POS tagging, noun extraction, etc. Contact: kschoi@kaist.ac.kr hjjeong@world.kaist.ac.kr
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Distributed phrase-based machine translation training tool based on Hadoop.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    A system to perform analysis of large documents for the purpose of cataloging similar documents. Similarity is based upon contextual analysis of these documents done by identifying common words and proper nouns.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    jWords is a port of WORDS (by William Whitaker, a free latin-to-english dictionary program written in Ada), to Java. Besides the dictionary will be translated to the German language.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    WQuery is a domain-specific query language designed to process WordNet-like lexical databases. It may be used as a standalone application or as an API to a lexical database in Java based systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ELIA(Eyegaze Language Integration Analysis) supports the analysis of eye-tracking data for studies in language processing. ELIA eases early analysis of data to enable iterative development of experiments in response to spoken language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    This is a project to determine an appropriate framework for ontologies for communication with and between robots, as well as possibly tools for reasoning about information in the framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The Simple Semantic Classifier classifies short chunks of natural language text into broad semantic classes that correspond to the OBO ontologies provided as input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    This project is used to segment text into semantic parts by meaning of language model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    CORPSE (CORPus SEarch) is a powerful search engine written in Java. The aim is to provide an efficient implementation of a word level inverted index search with various cool functions that can be used on very large corpora.
    Downloads: 0 This Week
    Last Update:
    See Project