Showing 42 open source projects for "python web crawler"

View related business solutions
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • Powerful Business Process Automation Icon
    Powerful Business Process Automation

    With ThinkAutomation, you get an open-ended studio to build any and every automated workflow you could ever need.

    When a message is received ThinkAutomation automatically executes one or more Automations. Automations are created using an easy to use drag-and-drop interface to run simple or complex tasks. Automations can perform many business process Actions, including: updating company databases, CRM systems and cloud services, sending outgoing emails, Teams & SMS messages, document processing, custom scripting, integration and much more. Over 100 built-in actions are included, plus ThinkAutomation is extensible with Custom Actions.  
    Learn More
  • 1
    Grafana

    Grafana

    Leading open-source visualization and observability platform

    Grafana Open Source is a leading open-source visualization and observability platform that lets you query, visualize, alert on, and explore your data—regardless of where it’s stored. With support for 100+ data source plugins (such as Prometheus, Loki, Elasticsearch, InfluxDB, SQL/NoSQL databases, OpenTelemetry, and more), you can unify metrics, logs, traces, and other observability signals in one place. Grafana OSS empowers you to build dynamic, reusable dashboards with rich...
    Downloads: 50 This Week
    Last Update:
    See Project
  • 2
    Autoplot

    Autoplot

    Autoplot is an interactive browser for data on the web

    Autoplot is an interactive browser for data on the web. Give Autoplot a URL or local file name and it creates a sensible plot of the data. Autoplot allows you to interactively browse data stored in ascii, .cdf, netcdf, and many other formats. Autoplot's source has been moved to GitHub. Thanks to SourceForge for many years of hosting!
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Shynet

    Shynet

    Modern, privacy-friendly, and detailed web analytics

    Modern, privacy-friendly, and detailed web analytics that works without cookies or JS. There are a lot of web analytics tools. Unfortunately, most of them come with the following caveats. They require handing all of your visitors' info to a third-party company They use cookies to track visitors across sessions, so you need to have those annoying cookie notices. They collect so much personal data that even the NSA is jealous. They are closed source and/or expensive, often with limited data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    aseryla

    aseryla

    Aseryla code repositories

    This project describes a model of how the semantic human memory represents the information relevant to the objects of the world in text format. It provides a system and a GUI application capable of extracting and managing concepts and relations from English texts. https://aseryla2.sourceforge.io/
    Downloads: 0 This Week
    Last Update:
    See Project
  • Total Network Visibility for Network Engineers and IT Managers Icon
    Total Network Visibility for Network Engineers and IT Managers

    Network monitoring and troubleshooting is hard. TotalView makes it easy.

    This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
    Learn More
  • 5
    Python4Proteomics Course

    Python4Proteomics Course

    Python course for Proteomics analysis

    Python course (in Spanish) for Proteomics analysis using basically Jupyter NoteBooks. For more information, you can have a look at the readme.md file in the source code tree: https://sourceforge.net/p/lp-csic-uab/p4p/code/ci/default/tree/readme.md
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Xplico

    Xplico

    Xplico is a Network Forensic Analysis Tool (NFAT)

    Xplico is a Network Forensic Analysis Tool (NFAT). The goal of Xplico is extract from an internet traffic capture the applications data contained. For example, from a pcap file Xplico extracts each email (POP, IMAP, and SMTP protocols), all HTTP contents, each VoIP call (SIP, MGCP, MEGACO, RTP), IRC, WhatsApp... Xplico is able to classify more than 140 (application) protocols. Xplico cam be used as sniffer-decoder if used in "live mode" or in conjunction with netsniff-ng. Xplico is used...
    Downloads: 40 This Week
    Last Update:
    See Project
  • 7

    dadosSemiotica

    Collecter and manager of semiotica annalisis data

    This program is a web application to collect and organize data of text analysis. It works with sets of texts and the analysis are done on portions of the length of a sentence. One of the preprocessing modules is based on CoGroo (A LibreOffice & OpenOffice.org Portuguese Grammar Checker).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    scipion-xmipp

    scipion-xmipp

    Image processing framework to integrate EM software packages.

    Scipion is an image processing framework to obtain 3D models of macromolecular complexes using Electron Microscopy (3DEM). It integrates several software packages and presents an unified interface for both biologists and developers. Scipion allows to execute workflows combining different software tools, while taking care of formats and conversions. Additionally, all steps are tracked and can be reproduced later on. Xmipp is a well-known package in the EM image processing. It is integrated...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    slycat

    Web-based data science analysis and visualization platform.

    This is Slycat - a web-based data science analysis and visualization platform, created at Sandia National Laboratories. The goal of the Slycat project is to develop processes, tools and techniques to support data science, particularly analysis of large, high-dimensional data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • We help you deliver Virtual and Hybrid Events using our Award Winning end-to-end Event Management Platform Icon
    We help you deliver Virtual and Hybrid Events using our Award Winning end-to-end Event Management Platform

    Designed by event planners for event planners, the EventsAIR platform gives you the ability to manage your event, conference, meeting or function with

    EventsAIR have been anticipating and responding to the ever-changing event industry needs for over 30 years, providing innovative solutions that empower event organizers to create successful events around the globe.
    Learn More
  • 10
    A collection of software made by Milos Rancic.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    A web-based interface for the Hadoop MapReduce framework that simplifies the process of writing and running MapReduce jobs. Aimed at introducing parallelism concepts in introductory computer science courses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    HeWIT helps you create and fill out forms. It tells you if you've missed anything or made any mistakes. You can then send the form over email, upload to a web site, or pass it by memory stick to whoever needs it next.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    This project aims to be a easy-to-use toolkit of algorithms and utilities for semantic data mining. So far all algorithms are implemented as web services and we provide widgets for their use in the Orange4WS data mining platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Ex-Crawler
    Ex-Crawler is divided into 3 subprojects (Crawler Daemon, distributed gui Client, (web) search engine) which together provide a flexible and powerful search engine supporting distributed computing. More informations: http://ex-crawler.sourceforge.net
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Data mines the voting record and other actions of Members of the UK Parliament. Extracts information from the parliament website and stores it in a database. Provides tools to analyse the information, producing statistics and tables about the MPs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Python based webservice to manage and display information from some fiber-GISes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The NetEpi project is developing open-source, network-enabled tools for epidemiology and public health practice.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Web-as-corpus tools in Java. * Simple Crawler (and also integration with Nutch and Heritrix) * HTML cleaner to remove boiler plate code * Language recognition * Corpus builder
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A univariate and multivariate analysis UI. This project is no longer under development. Please use as you wish.
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    ConDEnSE (Confidential Data Enabled Statistical Exploration) will be a web-based environment for statistical analysis of confidential data from various database sources, based on Plone and R, and using the Jackknife method of confidentiality protection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Crawl a set of files, accumulating information on the temporal and spatial extent of the data in each file, for later search and retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This is a prototype implementation of "A Network of Ideas" (ANOI) content system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A toolkit for crawling information from web pages by combining different kinds of "actions". Actions are simple operations such as navigation to a specified url or extraction of text from the html. Also available is a graphic user interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Stats Jam project is an extension to Mediawiki that allows users to embed database queries and visualisations into their wiki pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next