Showing 23 open source projects for "java html parser"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    FastQC

    FastQC

    A quality control analysis tool for high throughput sequencing data

    FastQC is a quality control analysis tool designed to spot potential problems in high throughput sequencing datasets. Its goal is to provide a simple way by which to check the quality of raw sequence data coming from high throughput sequencing pipelines. It does this by running a modular set of analyses on one or more raw sequence files in fastq or bam format. It then produces a report summarizing the results, and highlighting any areas where the library may appear unusual. This should then...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 2
    TXM

    TXM

    Unicode XML TEI text analysis platform

    TXM is a free and open-source cross-platform Unicode & XML based text analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP...
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3

    Ghawwas_V4

    An open source system for Arabic corpora processing

    ... encoding g. Accept TXT, DOC, DOCX, RTF and HTML formats h. Export the processing results in CSV file format
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Universal Tag Finder

    Universal Tag Finder

    Is tool for query HTML content

    Universal Tag Finder is a tool to query against html content in the file system. For developers it allows to find out relevant elements during the troubleshooting and validating periods. Does not need to bother on massive regex to find out elements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloud SQL for MySQL, PostgreSQL, and SQL Server Icon
    Cloud SQL for MySQL, PostgreSQL, and SQL Server

    Focus on your application, and leave the database to us

    Fully managed, cost-effective relational database service for PostgreSQL, MySQL, and SQL Server. Try Enterprise Plus edition for a 99.99% availability SLA and category-leading performance.
    Try it for free
  • 5
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    P-VCD

    Content-Based Video Copy Detection

    Software for Video Copy Detection. The software has been described in some scientific papers, e.g. http://dx.doi.org/10.1109/ICME.2011.6012212 and http://dx.doi.org/10.1007/s11042-011-0915-x. This software is the result of my PhD at the University of Chile and the participation at TRECVID Content-Based Copy Detection (CCD) evaluation task 2010 and 2011. More details in http://www-nlpir.nist.gov/projects/tv2011/tv2011.html#ccd and in http://www-nlpir.nist.gov/projects/tvpubs/tv11.papers...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    HadStat

    HadStat is service on cloud,for data analysis using Hadoop MapReduce.

    HadStat is service on the cloud, allow you to analysis the data on the cloud and return the result in nice graph,this service is free, you can redistribute it and/or modify it under the terms of the GNU General Public License. this service using many technologies , like Hadoop mapreduce, HTML, PHP, Web Service applications, linux server, java, eclipse IDE, with many indicators:Simple moving average (SMA),Exponential moving average (EMA),Smoothed simple moving average (SMMA),Linear weighted...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    ssSearchEngine

    keyword search engine for semi-structured data (Tables, lists,...)

    This application implement an approach for doing keyword based search over semi-structured data available in HTML pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Optex Analyzer is a software to analyze and compare algorithms to solve approximately optimization problems. It has a GUI that allows select a set of input files containing raw algorithm results. The analysis is shown with tables and charts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Crowdtesting That Delivers | Testeum Icon
    Crowdtesting That Delivers | Testeum

    Unfixed bugs delaying your launch? Test with real users globally – check it out for free, results in days.

    Testeum connects your software, app, or website to a worldwide network of testers, delivering detailed feedback in under 48 hours. Ensure functionality and refine UX on real devices, all at a fraction of traditional costs. Trusted by startups and enterprises alike, our platform streamlines quality assurance with actionable insights.
    Click to perfect your product now.
  • 10
    * Java classes for parsing text, conversion to XML or to evaluate in Java. The parser is textual-script-controlled with a syntax near Backus Naur Format, named ZBNF. * Some routines for conversion: C-Header or Java to XMI, XML-Documentation generation,
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Web-as-corpus tools in Java. * Simple Crawler (and also integration with Nutch and Heritrix) * HTML cleaner to remove boiler plate code * Language recognition * Corpus builder
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    OpenDMAP (Open Source Direct Memory Access Parser) is a natural language processing (text mining) application: a semantic parser for information extraction.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Open extensible system analysis report tool for Java, based on numerous open source analysis initiatives. The XML/XSL batch-processing framework produces integrated HTML/SVG reports of the systems current state and the development over time.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    A bioinformatics package to analyse ESTs (Expressed Sequence Tags) easily. It contains functionalities of ESTs for EST submission, Batch BLAST and BLAST result parser, etc. easiEST contains JAVA GUI applications for analysing ESTs or other sequences
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Provides a GUI interface to grammatical structure and relations (as parsed by the Stanford Parser) of any text. Contains grammatical relation editor to modify, import, export grammatical relation definitions (tregex patterns and features).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Docgen plugin for Protege provides a quick export of all the content of an ontology (classes, instances and documentation) in various formats (html, pdf, fo...). Images, Graphs, URL are readyly included in reports.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    This program helps understanding the content of any MPEG compressed video stream by displaying its structure and content in a tree based view.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The aim of MIEX (Metadata and Information Extractor from small XML documents) is to create a wrapper for the Stanford Parser, to extract and store metadata (syntactic structures, relationships among words...) from simple XML documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    TagHybrida is a French hybrid syntactic parser. TagHybrida is a four stage parser combining hand-writen and corpus based information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    XML Parser, Data Processing API and Specific Implementations for SensorML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    HyperSpider (Java app) collects the link structure of a website. Data import/export from/to database and CSV-files. Export to Graphviz DOT, Resource Description Framework (RDF/DC), XML Topic Maps (XTM), Prolog, HTML. Visualization as hierarchy and map.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    OBIEE UDML (Universal Database Markup Language) parser, providing an XML output, enabling other applications to take advantage of the meta data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A knowledgment management system written in Java under JBoss 4.2.3 Server, with richfaces 3.3.0BETA4. Including fileconversion from html to pdf and rich:editor component without special syntaxing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.