Showing 35 open source projects for "html source extractor"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 1
    folium

    folium

    Python data, Leaflet.js maps

    folium builds on the data wrangling strengths of the Python ecosystem and the mapping strengths of the leaflet.js library. Manipulate your data in Python, then visualize it in on a Leaflet map via folium. folium makes it easy to visualize data that’s been manipulated in Python on an interactive leaflet map. It enables both the binding of data to a map for choropleth visualizations as well as passing rich vector/raster/HTML visualizations as markers on the map. The library has a number of...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    wttr.in

    wttr.in

    The right way to check the weather

    wttr.in is a console-oriented weather forecast service that supports various information representation methods like terminal-oriented ANSI-sequences for console HTTP clients (curl, httpie, or wget), HTML for web browsers, or PNG for graphical viewers. Originally started as a small project, a wrapper for wego, intended to demonstrate the power of the console-oriented services, wttr.in became a popular weather reporting service, handling tens of millions of queries daily.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Selenium-python Helium

    Selenium-python Helium

    Selenium-python but lighter: Helium is the best Python library

    Under the hood, Helium forwards each call to Selenium. The difference is that Helium's API is much more high-level. In Selenium, you need to use HTML IDs, XPaths and CSS selectors to identify web page elements. Helium on the other hand lets you refer to elements by user-visible labels. As a result, Helium scripts are typically 30-50% shorter than similar Selenium scripts. What's more, they are easier to read and more stable with respect to changes in the underlying web page. Selenium-python...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables. Also computes readability indexes (Gunning-Fog, Coleman-Liau, Automated Readability Index (ARI), SMOG grade,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    CAMPARI

    CAMPARI

    Software for molecular simulations and trajectory analysis

    We are proud to introduce version 5 of CAMPARI. We have added a number of new features, most notably a Python interface for interpreting user-supplied code (with the help of ForPy), a novel trajectory storage standard (with the help of libpqxx/PostgreSQL), and a module for performing transition path theory. Naturally, CAMPARI continues to provide the reference implementation of the ABSINTH force field paradigm and implicit solvation model. CAMPARI is a joint package for performing and...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    Argos Translate

    Argos Translate

    Open-source offline translation library written in Python

    Argos Translate uses OpenNMT for translations and can be used as either a Python library, command-line, or GUI application. Argos Translate supports installing language model packages which are zip archives with a ".argosmodel" extension containing the data needed for translation. LibreTranslate is an API and web-app built on top of Argos Translate. Argos Translate also manages automatically pivoting through intermediate languages to translate between languages that don't have a direct...
    Downloads: 138 This Week
    Last Update:
    See Project
  • 7
    Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Graphical tool for managing database of geocaches and waypoints, importing GPX files and exporting geocaches to many formats including GPX, POI, HTML and others.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    weewx weather software

    weewx weather software

    Python program for getting the most from your weather station

    WeeWX is a free, open source, software program, written in Python, which interacts with your weather station to produce graphs, reports, and HTML pages. Downloads can be found at http://weewx.com/downloads Development happens at https://github.com/weewx/weewx
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    ADOMA
    ADOMA stands for: Alternative Display Of Multiple Alignment. ADOMA can create four different displays of a multiple sequence alignment: a ClustalW alignment in HTML format, a simplified ClustalW alignment in HTML and/or txt format and a colored ClustalW alignment in HTML format. For examples of these outputfiles check the screenshots. ADOMA uses ClustalW to create the multiple alignment from DNA or protein sequences and displays them slightly different than the normal output of ClustalW....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    quichem

    Utilities for fast input of chemical formulae and equations.

    *** New releases of quichem will only be available on GitHub. *** quichem is a utility written in pure Python allowing for fast input and parsing of chemical formulae and equations. Entering data does not require any modifier keys or keystrokes to be pressed. This SourceForge page provided pre-built Windows binaries of quichem-pyside. **To run quichem on Mac OS X or Linux**, or with the wxPython front-end, please download the source from quichem's GitHub page at: github.com/spamalot/quichem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    WebChemViewer

    A simple program for sharing molecular structures with associated data

    Sharing lists of molecular structures with associated chemical properties is a common task in computer-aided drug design and medicinal chemistry. WebChem Viewer is a simple, free, open-source program that generates HTML-formatted output that can be viewed in any modern web browser, on any operating system (including mobile), without requiring the installation of additional software. The output can also be easily incorporated into existing web pages. WebChem Viewer is released under the FreeBSD license. It was created by Jacob Durrant, a post-doc in the lab of Rommie E. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    pyMantis
    pyMantis is a data-management system for (systems) biology build on the web2py framework. It features: tree based file explorer, relational db table wizzard with automated creation of user interfaces, internal and external access management, wiki, ..
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    WTMParse

    WTMParse

    Python script to parse Unix-like WTMP files

    Python script that was originally intended for use in forensic examinations that parses WTMP files from Unix-like operating systems and generates a CSS-styled HTML report containing login terminal, username, log start date and login time/date in a table. Good for postmortem forensic examinations or as a way of getting "last" like information where you don't have the ability to boot the machine in question but can grab the wtmp.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Cross platform (python 2.5) tool for exporting data via SMTP (email) to an Endress-Hauser Fieldgate DM3 Server. This tool may be used with any MySQL database historian, even though it was originally an extension for mod_openopc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    GUANO

    GUANO

    GUANO - Graphical User interface for performing ANalysis Of variance

    Free and open source standalone program capable of conducting between, within, and mixed analyses of variance (ANOVA). Provides a simple graphical user interface for specifying analyses and interaction plots (analyses performed by http://code.google.com/p/pyvttbl/). Features: - Capable of high order factorial designs (> 2 factors) - Within and mixed analyses of variance provide corrections for violations of sphericity (Huynh-Feldt, Greenhouse-Geisser, Box) - A variety of data transformations can be applied (log10, reciprocal, arcsine, square-root, and Windsor) - Generalized eta-squared measures of effect size - Post-hoc power analysis (should match G*Power) - Outputs include tables of estimated marginal means - Up to 4-way interaction plots with errorbars (png, svg) - Confidence intervals account for within-subject variability (where applicable; Loftus and Masson, 1994) - Non-proprietary HTML output files - Non-proprietary codebase Gotchas: - Assumes balanced designs
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    WebSynonymExtractor

    a synonym extractor based on web-corpora and a multilingual translator

    This project is an approach for synonym extraction and extending WordNet by the so found synonyms. The python application is realised as a kind of pipe that starts with a web-corpus-reader which is followed by several workers (tokenizers, lemmatizers, ...) and finally completed by a result writer. In contrast to the state of the art approaches, this implementation is based on single words found in the web used as a corpus and translated to other languages. If translations of different...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    TinyGPX
    Application for geocaching or waymarking that allows you to import GPX or LOC files for editing. You can print out cheat sheets to take on the trail, export to KML or HTML, and send the data to your handheld GPS unit.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Feedisto is your personal newspaper. Your newspaper is collocted from rss feeds, parsed by a bayesian filter to rate its relevance and served as a static html page or on a webserver to train the filter. Writing plugins is very easy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Cute Thermowell
    Cute Thermowell is now part of LibreEngineering suite: http://sourceforge.net/projects/libreeng/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A python package to find repetitive format pattern in HTML pages and extract information from them using this pattern. The idea is that in pages that have some kind of a list, there will be a repetitive pattern for the human eye (the page format).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A HTML scraper that uses machine learning frameworks to extract labelled fields from raw HTML. The project also involves the development of a tool to display the semi structured data generated by the scraper component.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Starting from various input files, we parse to find a list of addresses, then display them on a map.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB