Showing 62 open source projects for "python web crawler"

View related business solutions
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Jupytab

    Jupytab

    Display in Tableau data from Jupyter notebooks

    Jupytab allows you to explore in Tableau data which is generated dynamically by a Jupyter Notebook. You can thus create Tableau data sources in a very flexible way using all the power of Python. This is achieved by having Tableau access data through a web server created by Jupytab.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    istSOS

    istSOS

    Free and Open Source Sensor Observation Service Data Management System

    istSOS is an OGC SOS server implementation written in Python. istSOS allows for managing and dispatch observations from monitoring sensors according to the Sensor Observation Service standard. The project provides also a Graphical user Interface that allows for easing the daily operations and a RESTful Web api for automatizing administration procedures. istSOS is released under the GPL License, and runs on all major platforms (Windows, Linux, Mac OS X), even though tests were conducted under a Linux environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    CloverDX

    CloverDX

    Design, automate, operate and publish data pipelines at scale

    Please, visit www.cloverdx.com for latest product versions. Data integration platform; can be used to transform/map/manipulate data in batch and near-realtime modes. Suppors various input/output formats (CSV,FIXLEN,Excel,XML,JSON,Parquet, Avro,EDI/X12,HL7,COBOL,LOTUS, etc.). Connects to RDBMS/JMS/Kafka/SOAP/Rest/LDAP/S3/HTTP/FTP/ZIP/TAR. CloverDX offers 100+ specialized components which can be further extended by creation of "macros" - subgraphs - and libraries, shareable with 3rd...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    Vaex

    Vaex

    Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python

    Data science solutions, insights, dashboards, machine learning, deployment. We start at 100GB. Vaex is a high-performance Python library for lazy Out-of-Core data frames (similar to Pandas), to visualize and explore big tabular datasets. It calculates statistics such as mean, sum, count, standard deviation etc, on an N-dimensional grid for more than a billion (10^9) samples/rows per second. Visualization is done using histograms, density plots and 3d volume rendering, allowing interactive...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 5
    Clustergrammer

    Clustergrammer

    An interactive heatmap visualization built using D3.js

    Clustergrammer is a visualization library built using D3.js that enables intuitive interaction with high-dimensional data. Clustergrammer was built with biological data in mind. Clustergrammer is a web-based tool for visualizing high-dimensional data (e.g. a matrix) as an interactive and shareable hierarchically clustered heatmap. Clustergrammer's front end (Clustergrammer-JS) is built using D3.js and its back-end (Clustergrammer-PY) is built using Python. Clustergrammer produces highly interactive visualizations that enable intuitive exploration of high-dimensional data and has several biology-specific features (e.g. enrichment analysis, see Biology-Specific Features) to facilitate the exploration of gene-level biological data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    DSTK - DataScience ToolKit is an opensource free software for statistical analysis, data visualization, text analysis, and predictive analytics. Newer version and smaller file size can be found at: https://sourceforge.net/projects/dstk3/ It is designed to be straight forward and easy to use, and familar to SPSS user. While JASP offers more statistical features, DSTK tends to be a broad solution workbench, including text analysis and predictive analytics features. Of course you may specify...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    COV2HTML

    COV2HTML

    A visualization and analysis tool of Bacterial NGS data for Biologists

    COV2HTML provides an easy and 'in home' web interface for biologists that allows coverage visualization of the NGS alignment needed for the analysis. It combines two essential processes: (i) MAP2COV, a tool that converts the huge NGS mapping or coverage files into light specific coverage files which contains genetic elements informations. (ii) COV2HTML, a visualization interface allowing a real-time analysis of data with selected criteria. Thus this interface offers a visualization of NGS...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    slycat

    Web-based data science analysis and visualization platform.

    This is Slycat - a web-based data science analysis and visualization platform, created at Sandia National Laboratories. The goal of the Slycat project is to develop processes, tools and techniques to support data science, particularly analysis of large, high-dimensional data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    webStraktor is a programmable World Wide Web data extraction client. Its purpose is to scrape HTML based content via the HTTP protocol and extract relevant information. webStraktor features a scripting language to facilitate the collection, the extraction and the storage of information available on the web, including images. The scripting language uses elements of the Regular Expression and xPath syntax. The webStraktor scripting language has a small instruction set and its syntax is easy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Run applications fast and securely in a fully managed environment Icon
    Run applications fast and securely in a fully managed environment

    Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.

    Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
    Try for free
  • 10

    WebChemViewer

    A simple program for sharing molecular structures with associated data

    Sharing lists of molecular structures with associated chemical properties is a common task in computer-aided drug design and medicinal chemistry. WebChem Viewer is a simple, free, open-source program that generates HTML-formatted output that can be viewed in any modern web browser, on any operating system (including mobile), without requiring the installation of additional software. The output can also be easily incorporated into existing web pages. WebChem Viewer is released under the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Flightdeck-UI is a project that utilizes the ideas from the design of aircraft controls and instruments for creating computer user interfaces, such as for monitoring.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Portable Linux

    Portable Linux

    Portable Ubuntu Linux for Scientific Computing

    Released August 22, 2013 Lubuntu Blends: Biochemistry 13.04 (Raring) v5.44 Linux Kernel Image 3.8.0-29 Lubuntu Blends are pre-installed Wubi disk image remixes of Ubuntu and Debian Science meta packages, A custom boot loader allows installations to be copied and automatically booted from most external or USB flash drives. Once up and running, use earlier Lubuntu Remix README instructions here until documentation is updated....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13

    CollectD-CSV Plotter

    Web interface for visualizing CSV data from CollectD monitoring system

    CollectD-CSV Plotter is a web interface for fetching and visualizing data that is saved in CollectD CSV format. An HTML form is used for defining the time interval and choosing the wanted measurements. The plotter is actually a front-end for collectd-csv module (http://pypi.python.org/pypi/collectd-csv/), which it uses for fetching the data. Technologies CollectD-CSV Plotter uses include CGI/Python, HTML5/JavaScript, jQuery, jQuery-UI and Gnuplot.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A Python interface to the gnuplot plotting program.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 15
    Karesansui is an open-source virtualization management application made in Japan. It's smart graphical user interface lowers your management cost, and brings a total management/audit solution for both physical and virtual servers. Full featured RESTful interface allows customizing and integration with other management/billing systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    totaldepth

    Petrophysical data processing and presentation.

    TotalDepth is an Open Source, cross platform, software collection that can process petrophysical data from the oil field such as wireline logs, seismic data and so on. Conventional, proprietary, software for petrophysical data tends to be expensive to licence, restrictive, slow to develop for and tied to expensive hardware. TotalDepth changes all of that. TotalDepth is open and cross-platform, and produces results straight to the bowser. TotalDepth supports such technologies such as...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    A tool that parses SQL Select statements and generates a diagram. The diagram shows parts of the underlying SQL directly in the diagram. For example x=30 , GROUP BY (year), HAVING MIN(age) > 18. It is easy to see cartesian joins and/or loops.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    WebDX is a framework for deploying OpenDX applications over the web.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Data mines the voting record and other actions of Members of the UK Parliament. Extracts information from the parliament website and stores it in a database. Provides tools to analyse the information, producing statistics and tables about the MPs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Python based webservice to manage and display information from some fiber-GISes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A collection of tools for working with the comparative data analysis ontology including import/export facilities for common phylogenetic file formats, and also a triple-store framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This software enables easy creation and sharing of district maps online.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    SnapLogic is an Open Source Data Integration framework that combines the power of state-of-the-art dynamic programming languages with standard Web interfaces to solve today's most pressing problems in data integration.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    WebBabel is a python web application using OpenBabel to convert files from one format to another. It runs under Windows, Mac or Linux on your desktop, workstation or laptop. It uses the Jmol (or Marvin) viewer to show the structures being converted.
    Downloads: 0 This Week
    Last Update:
    See Project