Showing 17 open source projects for "php-simple-html-dom-parser"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • 1
    TXM

    TXM

    Unicode XML TEI text analysis platform

    TXM is a free and open-source cross-platform Unicode & XML based text analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2

    LANGANA-E

    LANGANA-E is an English natural language parser

    LANGANA-E is an English natural language parser. It is the main structure that automatic understanding will be built upon. Automatic understanding can be used for answering questions automatically or finding an answer from a reference text archive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Leseratte is a Java parser for German written language. Currently, it contains a German lexicon (based on the Wiktionary), inflexion rules, a grammar and a parser. (Semantics component planned.) Usable as a Java library, also provides a graphical UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Ghawwas_V4

    An open source system for Arabic corpora processing

    ...Two corpora frequency profile comparison based on MI, CHI, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient f. Accept Windows and UTF-8 character encoding g. Accept TXT, DOC, DOCX, RTF and HTML formats h. Export the processing results in CSV file format
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 5

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are interested in reuse, and we focus on common NLP tasks that are broadly useful for textmining.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6

    eNTranslator

    To aid translation of satsangs of Paramhamsa Nithyananda

    ...The auto generated translations are then enriched with human alternation using an easy graphical user interface. Time stamp information may be synched and a subtitle file or a simple textual output may be generated. Additionally it is planned to use google voice tools to also add voice over from these translated text. Finally the subtitle, translated audio (if any) would be muxed with the original video and uploaded.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Mechaglot, Calculate Semantic Similarity

    Mechaglot, Calculate Semantic Similarity

    Calculate semantic similarity for any human and human-like languages

    ...This is Alpha release, expect many things to improve, including the algorithms. PLEASE GO TO BROWSE ALL FILES TO READ A FULL DESCRIPTION. The goal of this project is simple: Input two sentences of the same language, and obtain the number (from 0 to 1) denoting the similarity between the inputted sentences, according to semantic categories. This project models my previous project: https://sourceforge.net/projects/semantics/ Difference is, this project does not use any database and computes any Strings as an input. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    CLEiM

    Cross Lingual Education in Medicine

    ...This system has been developed by the Intelligent System Group GSI (http://www.esi.uem.es/gsi/) at UEM University. We do NER (Named Entity Recognition) based on GATE platform. The installation is simple, you can use it as a Web application. It has been tested under apache-tomcat. The original system has been successfully used to carry out active learning activities with medical students. However, it could be interesting in much more knowledge fields.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Dendrarium

    System do pielęgnacji składnikowych drzew składniowych

    Dendrarium służy do wybierania i weryfikacji składnikowych drzew składniowych generowanych przez parser Świgra. System jest użytkowany w Instytucie Podstaw Informatyki PAN do tworzenia banku drzew składniowych dla języka polskiego Składnica.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Award-Winning Medical Office Software Designed for Your Specialty Icon
    Award-Winning Medical Office Software Designed for Your Specialty

    Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

    RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.
    Learn More
  • 10
    gannu

    gannu

    Java API and tools for performing NLP and other AI tasks

    Java API and tools for performing a wide range of AI tasks such as: word sense disambiguation (released), optimization (5 Evolutionary Algorithms Implemented ETA February 2014), opinion mining (ETA November 2014) and text wikification (ETA July 2014). Gannu includes some graphical interfaces for scientific purposes. When using Gannu please cite: *Jiménez, F. V., Gelbukh, A. F. & Sidorov, G. (2013). Simple Window Selection Strategies for the Simplified Lesk Algorithm for Word Sense Disambiguation.. In F. Castro, A. F. Gelbukh & M. González (eds.), MICAI (1) (pp. 217-227), : Springer. ISBN: 978-3-642-45113-3 The zip file contains Gannu jar, source, API documentation and necessary resources for performing research. Gannu uses the following projects: Weka, JExcel API, Stanford POS Tagger and WordNet. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    A simple java library for text and object oriented code. Among the different available packages, there are for text analysis (levenshtein and ngram fingerprinting), a grammar framework, simple object persistence (very light and dependence free), ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    FullFiller

    Data Base Benchmarking tool

    A very simple tool to automate benchmarking tests on MySQL DBs. It fills MySQL tables columns; perform customized tests; and outputs the results on CSV format. It uses Xeger, a java package for generating random text from regular expressions (http://code.google.com/p/xeger/). Xeger uses dk.brics.automaton java package developed by Anders Møller (http://cs.au.dk/~amoeller/automaton/index.html).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    JAVA Arabic Stemmer

    A JAVA class with a small functionality that is stemming Arabic words

    A JAVA Arabic stemmer that is based on Shereen Khoja algorithm. This java class offers a function called stemWrod which takes an arabic word and return the stem of it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A system to perform analysis of large documents for the purpose of cataloging similar documents. Similarity is based upon contextual analysis of these documents done by identifying common words and proper nouns.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A simple Java GUI tool for looking at the Spectrum and Cepstrum of a sound clip.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    CHALICE
    Connecting Historical Authorities with Links, Contexts and Entities. CHALICE is a historic placename gazetteer for the UK, published as Linked Data and linked to other widely-used sources of placename reference information on the semantic web.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next