Showing 24 open source projects for "python web crawler"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • 1
    ArchiveBox

    ArchiveBox

    Open source self-hosted web archiving

    ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually disappears or degrades. Archive.org does a great job as a centralized service, but saved URLs have to be public, and they can't save every type of content. ArchiveBox is an open source tool that lets organizations & individuals archive both public & private web content while retaining control over their data....
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    Rockstor

    Rockstor

    BTRFS based NAS and private cloud storage solution

    ...These Rock-ons, combined with advanced NAS features, turn Rockstor into a private cloud storage solution accessible from anywhere, giving users complete control of cost, ownership, privacy and data security. Rockstor UI is written in Javascript, making it simple to manage everything from your Web browser. The backend is written in Python and exposes RESTful APIs to easily extend functionality!
    Downloads: 34 This Week
    Last Update:
    See Project
  • 3
    transmission_cleanup

    transmission_cleanup

    Clean up of torrent files using the RPC protocal

    This application connects to the tranmission web client using the RPC interface, it allows the user to set the inital download folder for the torrents for sorting into their own folders based on the type of file it is. it also allows scheduling of the cleaning process eithe daily or weekly at a time set by you in the install process. you supply your username and password for the RPC web interface whohc is encrypted by the application and saved to the disk, The application checks if the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Delayter

    Utility to queue files for deferred deletion, days/weeks/months later

    Full documentation: Download delayterX.Y.html User has files that can probably be deleted later but does not feel comfortable deleting right now. Instruct with simple commands in which the file names and delay time are specified, eg.: Delayter -m 1 -w 2 -d 3 file1 file2 by which file1 and file2 are scheduled for deletion 1 month, 2 weeks and 3 days from now (roughly 47 days). Useful on projects with many temporary junk files that cannot be deleted until a later time at which you might...
    Downloads: 0 This Week
    Last Update:
    See Project
  • RMM Software | Remote Monitoring Platform and Tools Icon
    RMM Software | Remote Monitoring Platform and Tools

    Best-in-class automation, scalability, and single-pane IT management.

    Don’t settle when it comes to managing your clients’ IT infrastructure. Exceed their expectations with ConnectWise RMM, our MSP RMM software that provides proactive tools and NOC services—regardless of device environment. With the number of new vulnerabilities rising each year, smart patching procedures have never been more important. We automatically test and deploy patches when they are viable and restrict patches that are harmful. Get better protection for clients while you spend less time managing endpoints and more time growing your business. It’s tough to locate, afford, and retain quality talent. In fact, 81% of IT leaders say it’s hard to find the recruits they need. Add ConnectWise RMM, NOC services and get the expertise and problem resolution you need to become the advisor your clients demand—without adding headcount.
    Learn More
  • 5
    Configuration Backup (ConfiBack)

    Configuration Backup (ConfiBack)

    Project for backing up network device configuration

    Using this project you can make backup and track changes of configuration of network devices like switches, routers, etc.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    jQuery File Upload

    jQuery File Upload

    File Upload widget with multiple file selection

    jQuery-File-Upload is a mature, full-featured jQuery plugin (often paired with server-side handlers) for handling file uploads from the browser with advanced capabilities. It supports chunked uploads, drag and drop, multiple file selection, progress bars, client-side image resizing, and preview generation. On the server side, artifacts may be processed using compatible back-end scripts in languages like PHP, Ruby, Node.js, or Java, making the plugin cross-platform. Because uploads can be...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    diskover

    diskover

    File system crawler and disk space usage software

    diskover is a file system crawler and disk space usage software that uses Elasticsearch to index your file metadata. diskover crawls and indexes your files on a local computer or remote storage server over network mounts. diskover helps manage your storage by identifying old and unused files and give better insights into data change "hotfiles", file duplication "dupes" and wasted space. It is designed to help deal with managing large amounts of data growth and provide detailed storage...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    angular-filemanager

    angular-filemanager

    JavaScript file manager Material Design folder explorer

    A very smart filemanager to manage your files in the browser developed in AngularJS following Material Design styles by Jonas Sciangula Street. This project provides a web file manager interface, allowing you to create your own backend connector following the connector API. By the way, we provide some example backend connectors in many languages as an example (PHP-FTP, PHP-local, python, etc). Pick files callback for third parties apps. Directory tree navigation. Copy, Move, Rename (Interactive UX). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Cloud Export is a tool to automatically extract your data from web applications and save it to your local file system for backup purposes, but more extensive than Google Takeout. Plans are based on http://www.dataliberation.org.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Manage Your Call Center Without IT Icon
    Manage Your Call Center Without IT

    For Call Centers, Telemarketing, Customer Service, Inbound and outbound call centers

    CallShaper is a call center software and Predictive dialer designed to help reduce costs and increase ROI for Call Centers. CallShaper partners with businesses to maximize contacts, track the performance of agents, manage leads, telemarketing, and sales processes. The drag-and-drop interactive voice response (IVR) editor allows managers to transfer calls to third-party stakeholders and other recipients based on agents' availability, time, or type.
    Learn More
  • 10
    The archive-crawler project is building Heritrix: a flexible, extensible, robust, and scalable web crawler capable of fetching, archiving, and analyzing the full diversity and breadth of internet-accesible content.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    RiverGlass EssentialScanner is an open source web and file system crawler which indexes the text content of discovered files so they can be retrieved and analyzed. It provides simple scanner capabilities as part of larger enterprise search solutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Sushi, huh? is an aplication for download GNU/Linux packages from another OS or Linux distribution, for an posterior offline installation. Thinked for people that not have conexion to Internet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Backup and restore of files to web mail systems, ftp, sftp. Uses free storage of gmail/hotmail etc. Archives files, splits large files, encrypts and uploads. Requires python (tested with python 2.5)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Arrowbase is a collection of tools for backup persoses. Together they combine a backup system that can be used on more then one Operating system. This makes the project not only widely spread but portable as wel.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A configurable knowledge management framework. It works out of the box, but it's meant mainly as a framework to build complex information retrieval and analysis systems. The 3 major components: Crawler, Analyzer and Indexer can also be used separately.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    XSDB XML is to DATA as HTML is to DOCUMENT. Publish and combine data as easily as HTML format and web browsers publish and view documents. Implementations in Python, javascript, java, C#/.NET.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    A single purpose id3 tagger / file renamer which populates an album of songs with proper album/track information from Amazon Web Services (AWS).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    CAIRN is a modular copy and restore program for the imaging of a computer. It copies every file on a computer and figures out how to recreate it from scratch. It is primarily network oriented but is also flexible enough to boot from any possible method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Agile Author is a framework for developing networked repositories of digital information such as digital libraries and content management systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Lino is a suite of Python packages for IT professionals who offer business applications for small and medium-sized enterprises.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    idyuts is \"I Dare You to Use This Shell\"; a pre-hibernate approach to replacing an ORM written with jython functors into a pure-Java language command pattern. The \"pipeline codegen artifacts\" are simple IoC templates, and trivial to adapt
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    View, track, filter, archive, alert, group, rotate logs through a GUI, CLI, or WebUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The Local Content Server transforms Plone into a robust Learning Object Repository. It supports IMS content package transport, content preview, and object identification and registration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Rescuezilla

    Rescuezilla

    The Swiss Army Knife of System Recovery

    Rescuezilla is an easy-to-use disk cloning and imaging application that's fully compatible with Clonezilla — the industry-standard trusted by tens of millions. Yes, Rescuezilla is the Clonezilla GUI (graphical user interface) that you might have been looking for. **See: https://rescuezilla.com/ for download links** **NEW** Weekly rolling release downloads: https://github.com/rescuezilla/rescuezilla/releases Rescuezilla is a fork of Redo Backup and Recovery (now called Redo...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next