Showing 73 open source projects for "python web crawler"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Build apps or websites quickly on a fully managed platform Icon
    Build apps or websites quickly on a fully managed platform

    Get two million requests free per month.

    Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure.
    Try it for free
  • 1
    Kinto

    Kinto

    A generic JSON document store with sharing and synchronisation options

    Kinto is a minimalist JSON storage service with synchronization and sharing abilities. It is meant to be easy to use and easy to self-host. Kinto is used at Mozilla and released under the Apache v2 license. It’s hard for frontend developers to respect users' privacy when building applications that work offline, store data remotely and synchronize across devices. Existing solutions either rely on big corporations that crave user data or require a non-trivial amount of time and expertise to...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Auto Bangumi

    Auto Bangumi

    Automated Bangumi episode downloader and organizer with Web UI

    Auto_Bangumi is a fully automated tool for downloading, organizing, and tracking anime (Bangumi) episodes using RSS feeds and download clients like qBittorrent. It offers a modern Web UI for managing subscriptions, custom filtering rules, automatic file renaming, and subtitle matching. Designed for anime fans, it streamlines the process of staying up-to-date with seasonal shows by integrating feed parsing, downloading, and library organization.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    ipwb

    ipwb

    A distributed and persistent archive replay system using IPFS

    InterPlanetary Wayback (ipwb) facilitates permanence and collaboration in web archives by disseminating the contents of WARC files into the IPFS network. IPFS is a peer-to-peer content-addressable file system that inherently allows deduplication and facilitates opt-in replication. ipwb splits the header and payload of WARC response records before disseminating into IPFS to leverage the deduplication, builds a CDXJ index with references to the IPFS hashes returned, and combines the header...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    qBittorrent

    qBittorrent

    A free and reliable P2P BitTorrent client

    An advanced and multi-platform BitTorrent client with a nice Qt user interface as well as a Web UI for remote control and an integrated search engine. qBittorrent aims to meet the needs of most users while using as little CPU and memory as possible.
    Leader badge
    Downloads: 384,837 This Week
    Last Update:
    See Project
  • Crowdtesting That Delivers | Testeum Icon
    Crowdtesting That Delivers | Testeum

    Unfixed bugs delaying your launch? Test with real users globally – check it out for free, results in days.

    Testeum connects your software, app, or website to a worldwide network of testers, delivering detailed feedback in under 48 hours. Ensure functionality and refine UX on real devices, all at a fraction of traditional costs. Trusted by startups and enterprises alike, our platform streamlines quality assurance with actionable insights.
    Click to perfect your product now.
  • 5
    elFinder
    elFinder is a file manager for web similar to that you use on your computer. Written in JavaScript using jQuery UI, it just work's in any modern browser. Its creation is inspired by simplicity and convenience of Finder.app program used in Mac OS X.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    ipfs-css

    ipfs-css

    Single-purpose css class names and font-face config to IPFS up your UI

    The single-purpose CSS class names and @font-face config to IPFS up your UI. Once you've installed ipfs-css from npm, the CSS and SCSS files and the web-fonts are available from your node_modules/ipfs-css directory. You can import the theme.json file which can be used with a ThemeProvider component. All the CSS atoms are generated from that, so you can be sure you're using the same values. While ipfs.css contains everything you need, if you prefer variables for fonts, colors, and gradients...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    e-Dokyumento

    e-Dokyumento

    e-Dokyumento is web-based Document Management System (DMS)

    e-Dokyumento is opensource web-based Document Management System (DMS) A Document Management which automates the basic office document workflow such as receiving, filing, routing, and approving through capturing (scanning), digitizing (OCR Reading), storing, tagging, and electronically routing and approving (e-signature) of electronic documents. # Demo : https://e-dokyumento.herokuapp.com/ https://edokyu.seillig.com/ (refer to Readme.md for the accounts...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 8
    galacteek

    galacteek

    Multi-platform browser for the distributed web

    galacteek is a multi-platform Qt5-based browser and semantic agent for the distributed web. Be sure to install all the gstreamer packages on your system to be able to use the mediaplayer. After opening/mounting the DMG image, hold Control and click on the galacteek icon, and select Open and accept. You probably need to allow the system to install applications from anywhere in the security settings. Docker images are available. They run the full GUI inside a virtual Xorg server (using Xvfb...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    IPFS Publish

    IPFS Publish

    Continuous Delivery of static websites to IPFS

    This is a tool that aims to enable the automatic publishing of static webpages from Git repositories into IPFS. It consists of two parts: a small web server and management CLI. Web server exposes an endpoint that you use as your Git's webhook. When the hook is invoked, it clones your repo, build it (if needed), add it to the IPFS node (pin it if configured) and publish the new IPFS address under configured IPNS name.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 10
    Zero Install
    Zero Install is a decentralised cross-distribution software installation system. Create one package that works everywhere! With dependency handling and automatic updates, full support for shared libraries, and integration with native package managers
    Leader badge
    Downloads: 20,029 This Week
    Last Update:
    See Project
  • 11
    --- IMPORTANT : This project has been moved to GitHub at https://github.com/clstoulouse/motu-client-python. Download the last version from the release page https://github.com/clstoulouse/motu-client-python/releases. --- Motu is a high efficient and robust Web Server which fills the gap between heterogeneous Data Providers to End Users. Motu handles, extracts and transforms oceanographic huge volumes of data without performance collapse. This client enables to extract and download...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    TxPB

    TxPB

    TemporalX gRPC and Documentation Resources

    TxPB is the main source of information for using TemporalX, and includes protocol buffer definitions, generated code, and documentation. If you are looking for the protocol buffer definitions, check the pb folder. If you want bindings in other languages, feel free to open a github issue. Alternatively you can generate language bindings independently as long as the languages you are using support the appropriate protocol buffer and gRPC versions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Beamology

    Beamology

    Beamology: Semicom Project Management

    Beamology is the leading field of project management for SuperWikia Alpha semicom codesmiths. Its residency programs offer vital resources for SuperWikia vertical environment development and utilities needed for SA-VEC certification. Beamology residents are automatically licensed to access the Transcom system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Decentralized Internet

    Decentralized Internet

    SDK for building decentralized web and distributed computing projects

    This project was created in order to support a new internet. One that is more open, free, and censorship-resistant in comparison to the old internet. An internet that eventually wouldn't need to rely on telecom towers, an outdated grid, or all these other "old school" forms of tech. We believe P2P compatibility is an important part of the future of the net. Grid Computing also plays a role in having a better means of transferring information in a speedy, more cost-efficient and reliable manner.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Nyaa.si Batch Torrent Upload

    Nyaa.si Batch Torrent Upload

    Batch torrent uploader for Nyaa.si

    Batch upload via desktop to nyaa.si and sukebe.nyaa.si
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    CODESH

    CODESH

    Automatic persistent logbooks for Linux shell sessions (bash, tcsh...)

    The CODESH (COllaborative DEvelopment SHell) project provides an automatic persistent logbook for sessions of personal command-line work by recording what and how is being done: for private use/reuse and for sharing selected parts with collaborators.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    WebDAVSurfer

    WebDAVSurfer

    WebDAV client 64-bit works with Plone 5, Apache and more

    GUI WebDAV Client for Linux and Windows 10. Includes PROPFIND, PROPPATCH, LOCK ,UNLOCK ,VERSION-CONTROL,REPORT. HTTP(S) with Basic Authentication and PKI client and server Certificates. Works with Plone, Zope, Apache + mod_dav, PyWebDAV, PyDAV, Tamino. 64-bit wxPython used. Upload files or from Web. Update properties. Tested with Plone 5.04
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    python-webrobot

    download video from various hosters

    Allows to automatically download files and video streams from web hosters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    LeechCraft
    LeechCraft is a cross-platform modular "internet-client" application. It has plugins that allow one to browse the web, chat via instant messaging (like Jabber or IRC), read RSS/Atom feeds, download files via BitTorrent, Direct Connect and other protocols, play media files and much more.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    diwacs

    diwacs

    Diwa is a project collaboration tool.

    Diwa is a software ment to simplify sharing rich media and files that are important for the project between team members in the same room. It offers simple solutions for example exporting urls and files to another PC via drag and drop and the files are stored in "Project Folder" and there can be database entry for it as well. Diwa also supports remote controlling another PC running the DiWaCS software and creating events which save screenshots from all the computers and optionally a image...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    TacoNET

    TacoNET

    Cross platform darknet written in python

    TacoNET is an open-source, Friend 2 Friend, cross platform, encrypted Darknet written in python. Currently in Alpha, it's functional, but nowhere near what it will be. There will be gui weirdness, useless error messages, spelling/grammer mistakes, and of course some features may not work 100% in all cases. This will, of course, improve over time. Project dev has stopped here, all further dev @ https://github.com/withorwithoutgod/taconet
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Enigmatic CGI pastebin
    Simple CGI script which allows you to transfer sensitive ascii data (passwords, hashes, pem keys) via HTTPS. SSL MUST BE implemented by webserver. Algorithm: - Paste new data into textarea, submit. You get link Link consists of three md5 hashes from salt. File, containing your data, named that way. So, it is hard to bruteforce links/files. - Whenever link being opened, script reads file and prints its content into brower in text/plain content-type. Afterthat, script removes file. So...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    baretorrent

    baretorrent

    simple and portable bittorrent client with extensions

    This project has been moved to https://launchpad.net/baretorrent. There will be no more new releases on sourceforge. The web page is at http://baretorrent.org. Old description: This is a portable BitTorrent client with basic features along with support for extensions. The most recent release (18-jan-2013) is 0.4 for Windows, Debian 6.0.6, Ubuntu 12.04 and OSX 10.6.8. Only one extension is available so far: a simple server that is compatible with how the uTorrent WebUI adds torrents...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    openPLM - open source PLM
    open source PLM system - Product Structure management (BOM management) system and Electronic documents management or Entreprise Content Management (ECM) system
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    This is for Expa Administration group only
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.