Showing 68 open source projects for "python web crawler"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Secure remote access solution to your private network, in the cloud or on-prem. Icon
    Secure remote access solution to your private network, in the cloud or on-prem.

    Deliver secure remote access with OpenVPN.

    OpenVPN is here to bring simple, flexible, and cost-effective secure remote access to companies of all sizes, regardless of where their resources are located.
    Get started — no credit card required.
  • 1
    istSOS

    istSOS

    Free and Open Source Sensor Observation Service Data Management System

    istSOS is an OGC SOS server implementation written in Python. istSOS allows for managing and dispatch observations from monitoring sensors according to the Sensor Observation Service standard. The project provides also a Graphical user Interface that allows for easing the daily operations and a RESTful Web api for automatizing administration procedures. istSOS is released under the GPL License, and runs on all major platforms (Windows, Linux, Mac OS X), even though tests were conducted...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    CloverDX

    CloverDX

    Design, automate, operate and publish data pipelines at scale

    .... Simple data manipulation jobs can be created visually. More complex business logic can be implemented using Clover's domain-specific-language CTL, in Java or languages like Python or JavaScript. Through its DataServices functionality, it allows to quickly turn data pipelines into REST API endpoints. The platform allows to easily scale your data job across multiple cores or nodes/machines. Supports Docker/Kubernetes deployments and offers AWS/Azure images in their respective marketplace
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3
    Vaex

    Vaex

    Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python

    Data science solutions, insights, dashboards, machine learning, deployment. We start at 100GB. Vaex is a high-performance Python library for lazy Out-of-Core data frames (similar to Pandas), to visualize and explore big tabular datasets. It calculates statistics such as mean, sum, count, standard deviation etc, on an N-dimensional grid for more than a billion (10^9) samples/rows per second. Visualization is done using histograms, density plots and 3d volume rendering, allowing interactive...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Clustergrammer

    Clustergrammer

    An interactive heatmap visualization built using D3.js

    Clustergrammer is a visualization library built using D3.js that enables intuitive interaction with high-dimensional data. Clustergrammer was built with biological data in mind. Clustergrammer is a web-based tool for visualizing high-dimensional data (e.g. a matrix) as an interactive and shareable hierarchically clustered heatmap. Clustergrammer's front end (Clustergrammer-JS) is built using D3.js and its back-end (Clustergrammer-PY) is built using Python. Clustergrammer produces highly...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Photo and Video Editing APIs and SDKs Icon
    Photo and Video Editing APIs and SDKs

    Trusted by 150 million+ creators and businesses globally

    Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
    Learn More
  • 5
    PyTom

    PyTom

    http://www.sciencedirect.com/science/article/pii/S1047847711003492

    PyTom is a toolbox developed for interpreting cryo electron tomography data. All steps from reconstruction, localization, alignment and classification are covered with standard and improved methods. Please sign up to our mailing list to keep up with the most recent updates and versions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    scipion-xmipp

    scipion-xmipp

    Image processing framework to integrate EM software packages.

    Scipion is an image processing framework to obtain 3D models of macromolecular complexes using Electron Microscopy (3DEM). It integrates several software packages and presents an unified interface for both biologists and developers. Scipion allows to execute workflows combining different software tools, while taking care of formats and conversions. Additionally, all steps are tracked and can be reproduced later on. Xmipp is a well-known package in the EM image processing. It is integrated...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Vaultier - Team Password Manager

    Vaultier - Team Password Manager

    Linux based collaborative password manager with strong encryption.

    Open source enterprise collaborative password manager and file storage. Download and install on your own servers or let us prepare your installation on dedicated server. Store and share your sensitive data safely and manage users who are allowed to access the information. Vaultier is based on advanced security technology (RSA over AES). All data stored in Vaultier databases are encrypted. Only authorized users with valid keys can read them. Encryption keys always stay on client...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    COV2HTML

    COV2HTML

    A visualization and analysis tool of Bacterial NGS data for Biologists

    COV2HTML provides an easy and 'in home' web interface for biologists that allows coverage visualization of the NGS alignment needed for the analysis. It combines two essential processes: (i) MAP2COV, a tool that converts the huge NGS mapping or coverage files into light specific coverage files which contains genetic elements informations. (ii) COV2HTML, a visualization interface allowing a real-time analysis of data with selected criteria. Thus this interface offers a visualization of NGS...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    slycat

    Web-based data science analysis and visualization platform.

    This is Slycat - a web-based data science analysis and visualization platform, created at Sandia National Laboratories. The goal of the Slycat project is to develop processes, tools and techniques to support data science, particularly analysis of large, high-dimensional data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • No-Nonsense Code-to-Cloud Security for Devs | Aikido Icon
    No-Nonsense Code-to-Cloud Security for Devs | Aikido

    Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.

    Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
    Start for Free
  • 10
    webStraktor is a programmable World Wide Web data extraction client. Its purpose is to scrape HTML based content via the HTTP protocol and extract relevant information. webStraktor features a scripting language to facilitate the collection, the extraction and the storage of information available on the web, including images. The scripting language uses elements of the Regular Expression and xPath syntax. The webStraktor scripting language has a small instruction set and its syntax is easy...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    WebChemViewer

    A simple program for sharing molecular structures with associated data

    Sharing lists of molecular structures with associated chemical properties is a common task in computer-aided drug design and medicinal chemistry. WebChem Viewer is a simple, free, open-source program that generates HTML-formatted output that can be viewed in any modern web browser, on any operating system (including mobile), without requiring the installation of additional software. The output can also be easily incorporated into existing web pages. WebChem Viewer is released under the FreeBSD...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Flightdeck-UI is a project that utilizes the ideas from the design of aircraft controls and instruments for creating computer user interfaces, such as for monitoring.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Pumilio
    Pumilio is a web-based sound analysis and archive system for almost any kind of sound file with tools to see the spectrogram of the sound, select regions for further analysis and insertion in a database, filtering, and many other manipulations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Portable Linux

    Portable Linux

    Portable Ubuntu Linux for Scientific Computing

    Released August 22, 2013 Lubuntu Blends: Biochemistry 13.04 (Raring) v5.44 Linux Kernel Image 3.8.0-29 Lubuntu Blends are pre-installed Wubi disk image remixes of Ubuntu and Debian Science meta packages, A custom boot loader allows installations to be copied and automatically booted from most external or USB flash drives. Once up and running, use earlier Lubuntu Remix README instructions here until documentation is updated....
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15

    CollectD-CSV Plotter

    Web interface for visualizing CSV data from CollectD monitoring system

    CollectD-CSV Plotter is a web interface for fetching and visualizing data that is saved in CollectD CSV format. An HTML form is used for defining the time interval and choosing the wanted measurements. The plotter is actually a front-end for collectd-csv module (http://pypi.python.org/pypi/collectd-csv/), which it uses for fetching the data. Technologies CollectD-CSV Plotter uses include CGI/Python, HTML5/JavaScript, jQuery, jQuery-UI and Gnuplot.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A Python interface to the gnuplot plotting program.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 17
    Karesansui is an open-source virtualization management application made in Japan. It's smart graphical user interface lowers your management cost, and brings a total management/audit solution for both physical and virtual servers. Full featured RESTful interface allows customizing and integration with other management/billing systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    totaldepth

    Petrophysical data processing and presentation.

    TotalDepth is an Open Source, cross platform, software collection that can process petrophysical data from the oil field such as wireline logs, seismic data and so on. Conventional, proprietary, software for petrophysical data tends to be expensive to licence, restrictive, slow to develop for and tied to expensive hardware. TotalDepth changes all of that. TotalDepth is open and cross-platform, and produces results straight to the bowser. TotalDepth supports such technologies such as...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A tool that parses SQL Select statements and generates a diagram. The diagram shows parts of the underlying SQL directly in the diagram. For example x=30 , GROUP BY (year), HAVING MIN(age) > 18. It is easy to see cartesian joins and/or loops.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    WebDX is a framework for deploying OpenDX applications over the web.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Data mines the voting record and other actions of Members of the UK Parliament. Extracts information from the parliament website and stores it in a database. Provides tools to analyse the information, producing statistics and tables about the MPs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    DataSync Suite
    DataSync Suite is an open source platform for integrating tools like Zimbra, SugarCRM, and Drupal. The tool is focused on a single sign-on, application data integration, and fast, flexible deployment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Python based webservice to manage and display information from some fiber-GISes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A collection of tools for working with the comparative data analysis ontology including import/export facilities for common phylogenetic file formats, and also a triple-store framework.
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.