Showing 39 open source projects for "html source extractor"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables. Also computes readability indexes (Gunning-Fog, Coleman-Liau, Automated Readability Index (ARI), SMOG grade,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Fuzzy machine learning framework

    Fuzzy machine learning framework

    A library and a GUI front-end for fuzzy machine learning

    Fuzzy machine learning framework is a library and a GUI front-end for machine learning using intuitionistic fuzzy data. The approach is based on the intuitionistic fuzzy sets and the possibility theory. Further characteristics are fuzzy features and classes; numeric, enumeration features and features based on linguistic variables; user-defined features; derived and evaluated features; classifiers as features for building hierarchical systems; automatic refinement in case of dependent...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    TXM

    TXM

    Unicode XML TEI text analysis platform

    TXM is a free and open-source cross-platform Unicode & XML based text analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP...
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    FastQC

    FastQC

    A quality control analysis tool for high throughput sequencing data

    FastQC is a quality control analysis tool designed to spot potential problems in high throughput sequencing datasets. Its goal is to provide a simple way by which to check the quality of raw sequence data coming from high throughput sequencing pipelines. It does this by running a modular set of analyses on one or more raw sequence files in fastq or bam format. It then produces a report summarizing the results, and highlighting any areas where the library may appear unusual. This should then...
    Downloads: 27 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    CloudMapper

    CloudMapper

    CloudMapper analyzes your Amazon Web Services (AWS) environments

    CloudMapper helps you analyze your Amazon Web Services (AWS) environments. The original purpose was to generate network diagrams and display them in your browser (functionality no longer maintained). It now contains much more functionality, including auditing for security issues. If you want to add your own private commands, you can create a private_commands directory and add them there. You must have AWS credentials configured that can be used by the CLI with reading permissions for the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    gml4gtk

    gml4gtk

    small console GML or graphviz DOT graph viewer

    directed graph layout program for GML (Graph-Markup-Language) graph or partial graphviz DOT graph
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Parallel text aligner dessigned to generate transation memories (TMX files) from two files tagged with any kind of XML-based tags. The application uses the tag structure and the text blok length to perform the alignment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    codelyzer

    codelyzer

    Static analysis for Angular projects

    ...In the configuration file can implement custom pre-processing and template resolution logic. Lint rules encode logic for syntactic & semantic checks of TypeScript, HTML, CSS and Angular expressions source code.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9

    bwfinder

    Exogenous feature extractor from brainwaves

    This program for Scilab searches for primary peaks in the segments of brainwave signal spectrum which are much above the signal spectrum average. Brainwave segments of 1 s are Fourier transformed, thresholded and merged if contiguous. Selected segments are drawn in time domain, frequency domain and a narrow-band analysis at 75 Hz is attempted. This program should detect strong external signals in brainwave recordings. Weak signals are not detected. This program requires the "edf2ascii"...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10

    Ghawwas_V4

    An open source system for Arabic corpora processing

    ...Accept Windows and UTF-8 character encoding g. Accept TXT, DOC, DOCX, RTF and HTML formats h. Export the processing results in CSV file format
    Downloads: 20 This Week
    Last Update:
    See Project
  • 11
    scriptLattes is an open-source system to create academic reports of groups based on curricula vitae of the Brazilian Lattes Database.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12
    Universal Tag Finder

    Universal Tag Finder

    Is tool for query HTML content

    Universal Tag Finder is a tool to query against html content in the file system. For developers it allows to find out relevant elements during the troubleshooting and validating periods. Does not need to bother on massive regex to find out elements.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    **CODE MOVED TO GITHUB: https://github.com/bitextor ** Bitextor is an application created to generate translation memories using multilingual websites as a corpus source. It downloads an entire website and applies a set of heuristics (based mainly on HTML tag structure and text block length) to find bitexts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    angulartics

    angulartics

    Analytics for AngularJS applications

    Avoid dealing with vendor-specific code. If your provider of choice is not among the existing plugins, just write your own. Google Analytics, Kissmetrics, Mixpanel, Flurry, Piwik, Segment.io and more are supported out of the box. Just add Angulartics to your app dependencies and you'll get automatic pageview tracking for your application routes. Download Angulartics and the Angulartics plugin for your vendor. Note: All plugins list Angulartics as a dependency, so they will be downloaded...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    P-VCD

    Content-Based Video Copy Detection

    Software for Video Copy Detection. The software has been described in some scientific papers, e.g. http://dx.doi.org/10.1109/ICME.2011.6012212 and http://dx.doi.org/10.1007/s11042-011-0915-x. This software is the result of my PhD at the University of Chile and the participation at TRECVID Content-Based Copy Detection (CCD) evaluation task 2010 and 2011. More details in http://www-nlpir.nist.gov/projects/tv2011/tv2011.html#ccd and in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The National Library of New Zealand's Metadata Extraction Tool automatically extracts preservation-related metadata from digital files, then output that metadata in XML formats. It can be used through a graphical user interface or command-line interface. Please take the latest code from 'https://github.com/DIA-NZ/Metadata-Extraction-Tool.git'. The code on source forge will not be updated henceforth as it is moved to github.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 19
    wview is a unix application for weather stations. It archives weather data from the station. Generates full html web sites with graphics. ftp/ssh export capability. MySQL/PostgreSQL archiving. International. Weather alarms. Data feed. RSS feed.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    Census2
    Census is a comprehensive cross-platform project manager and integration hub for NONMEM, the gold standard tool for model-based pharmacometric data analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    HadStat

    HadStat is service on cloud,for data analysis using Hadoop MapReduce.

    HadStat is service on the cloud, allow you to analysis the data on the cloud and return the result in nice graph,this service is free, you can redistribute it and/or modify it under the terms of the GNU General Public License. this service using many technologies , like Hadoop mapreduce, HTML, PHP, Web Service applications, linux server, java, eclipse IDE, with many indicators:Simple moving average (SMA),Exponential moving average (EMA),Smoothed simple moving average (SMMA),Linear...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    DeMMaTTouL
    Software de Mapas de Argumentos con interfaz gráfica diseñado con el objetivo específico (aunque no excluyente) de analizar la estructura argumentativa de demostraciones matemáticas, utilizando el modelo de Toulmin. Además, el contenido de los elementos puede ser tanto texto como imágenes, lo que permite guardar como imagen expresiones matemáticas escritas en LaTeX (utilizando software de apoyo como KLatexFormula). Se puede imprimir el diagrama y exportarlo a formato imagen (varios...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Feedisto is your personal newspaper. Your newspaper is collocted from rss feeds, parsed by a bayesian filter to rate its relevance and served as a static html page or on a webserver to train the filter. Writing plugins is very easy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Optex Analyzer is a software to analyze and compare algorithms to solve approximately optimization problems. It has a GUI that allows select a set of input files containing raw algorithm results. The analysis is shown with tables and charts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB