Showing 741 open source projects for "python web crawler"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Deliver secure remote access with OpenVPN. Icon
    Deliver secure remote access with OpenVPN.

    Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

    OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
    Get started — no credit card required.
  • 1
    V2RayCloudSpider

    V2RayCloudSpider

    V2RayCloudSpider

    V2RSS is an "ecological mining machine" that can perform vertical mining on global providers based on the SSPanel-Uim framework; it can generate bottom-up "aggregation collection" tasks for mainstream protocol headers; it can self-digest and Compared with proxypool , the output is purer and more reliable proxy nodes; it has powerful production features such as self-discovery and service self-healing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Webifier

    Webifier

    A GitHub Action to deploy Notebooks, Markdowns

    Webifier is a stand-alone build tool for converting any repository into a deployable jekyll website. You can define your pages via yaml files and provide notebooks, markdown and pdf and other files for Webifier to render. It uses python markdown providing additional control over attributes and other extensive functionalities. It lets you define and direct how your web pages feel and automatically manages your assets, making it a perfect solution for fast static website development...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Spyne

    Spyne

    A transport agnostic sync/async RPC library

    Spyne is a Python RPC toolkit that makes it easy to expose online services that have a well-defined API using multiple protocols and transports. It integrates with popular Python web frameworks as well as libraries like SQLAlchemy to keep your code as DRY as possible. Spyne aims to save the protocol implementers the hassle of implementing their own remote procedure call api and the application programmers the hassle of jumping through hoops just to expose their services using multiple protocols...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Raiden Network

    Raiden Network

    Raiden Network

    Use Raiden to enrich your dApp with nearly instant, low-fee and scalable payments. It comes with an easy-to-use API and is compatible with the Ethereum ERC20 token standard. Incentivized, decentralized P2P live streaming with micropayments using Raiden, introducing “Proof-of-Stream-Payment”. The Raiden Network is an infrastructure layer on top of the Ethereum Blockchain. While the basic idea is simple, the underlying protocol is quite complex and the implementation non-trivial. Nonetheless...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Photo and Video Editing APIs and SDKs Icon
    Photo and Video Editing APIs and SDKs

    Trusted by 150 million+ creators and businesses globally

    Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
    Learn More
  • 5
    Docksal

    Docksal

    Docksal is a tool for defining and managing web development

    Running Docker on macOS and Windows is supported via VirtualBox as well as Docker for Mac / Docker for Windows. Docksal project stacks are pre-loaded with common development tools, which include Composer, PHP Code Sniffer, php-cli, node, npm, ruby, bundler, and python. For PHP, Drupal, and WordPress development, it comes with Drush, Drupal Console, and WP-CLI. There is built-in support for Apache Solr, Varnish, Memcache, Selenium, and Behat. Since services are containerized with Docker...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    PySchool

    PySchool

    Installable / Portable Python Distribution for Everyone.

    PySchool is a free and open-source Python distribution intended primarily for students who learn Python and data analysis, but it can also used by scientists, engineering, and data scientists. It includes more than 150 Python packages (full edition) including numpy, pandas, scipy, sympy, keras, scikit-learn, matplotlib, seaborn, beautifulsoup4...
    Leader badge
    Downloads: 816 This Week
    Last Update:
    See Project
  • 7
    Code Cookbook

    Code Cookbook

    Reusable code patterns which you can use as reference or copy

    Reusable code patterns which you can use as reference or copy to your project. Achieve small or large tasks using recipes that contain steps, scripts, and config files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Photonix Photo Manager

    Photonix Photo Manager

    A modern, web-based photo management server

    A modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms. This project is currently in development and not feature complete for a version 1.0 yet. If you don't mind putting up with broken parts or want to help out, run the Docker image and give it a go. I'd love for other...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Webware for Python

    Webware for Python

    The Classic Webware for Python

    Webware for Python is a suite of components for dynamic, server-side web development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • No-Nonsense Code-to-Cloud Security for Devs | Aikido Icon
    No-Nonsense Code-to-Cloud Security for Devs | Aikido

    Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.

    Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
    Start for Free
  • 10
    ReconSpider

    ReconSpider

    Most Advanced Open Source Intelligence (OSINT) Framework

    ... the capabilities of Wave, Photon and Recon Dog to do a comprehensive enumeration of attack surfaces. Reconnaissance is a mission to obtain information by various detection methods, about the activities and resources of an enemy or potential enemy, or geographic characteristics of a particular area. A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses the World Wide Web, typically for the purpose of Web indexing (web spidering).
    Downloads: 11 This Week
    Last Update:
    See Project
  • 11
    FlaskBB

    FlaskBB

    A classic Forum Software in Python using Flask

    ... can customize every template in FlaskBB. Besides that, you can also install community-made themes. Our goal is to develop a forum system with modern technologies and sticking to already well-tested and approved layouts. FlaskBB is a simple and extensible forum software that should be seen as the foundation for building communities. It is written in Python using the web framework Flask. FlaskBB is being distributed under the BSD 3-Clause License.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Reminiscence

    Reminiscence

    Self-Hosted Bookmark And Archive Manager

    Bookmark links and edit its metadata (like title, tags, summary) via web interface. Archive links to content in HTML, PDF or full-page PNG format. Automatic archival of links to non-html content like pdf, jpg, txt etc. i.e. Bookmarking links to pdf, jpg etc.. via the web interface will automatically save those files on the server. Supports archival of media elements of a web page using third-party download managers. Directory-based categorization of bookmarks. Automatic tagging of HTML links...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    DracOS GNU/Linux Remastered
    What is DracOS GNU/Linux Remastered ? DracOS GNU/Linux Remastered ( https://github.com/dracos-linux ) is the Linux operating system from Indonesia , open source is built based on Debian live project under the protection of the GNU General Public License v3.0. This operating system is one variant of Linux distributions, which is used to perform security testing (penetration testing). Dracos linux in Arm by hundreds hydraulic pentest, forensics and reverse engineering. Use a GUI-based...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 15
    restless

    restless

    A lightweight REST miniframework for Python

    A lightweight REST mini framework for Python. Works great with Django, Flask, Pyramid & Tornado, but should be useful for many other Python web frameworks. Based on the lessons learned from Tastypie & other REST libraries. Restless tries to be RESTful by default, but flexible enough. The main Resource class has data methods (that you implement) for all the main RESTful actions. It also uses HTTP status codes as correctly as possible. Restless is BYOD (bring your own data) and hence, works...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    ML workspace

    ML workspace

    All-in-one web-based IDE specialized for machine learning

    All-in-one web-based development environment for machine learning. The ML workspace is an all-in-one web-based IDE specialized for machine learning and data science. It is simple to deploy and gets you started within minutes to productively built ML solutions on your own machines. This workspace is the ultimate tool for developers preloaded with a variety of popular data science libraries (e.g., Tensorflow, PyTorch, Keras, Sklearn) and dev tools (e.g., Jupyter, VS Code, Tensorboard) perfectly...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    CEF Python

    CEF Python

    Python bindings for the Chromium Embedded Framework (CEF)

    ... use cases for CEF. Use it as a modern HTML5 based rendering engine that can act as a replacement for classic desktop GUI frameworks. Think of it as Electron for Python. Embed a web browser widget in a classic Qt / GTK / wxPython desktop application. Use it for automated testing of web applications with more advanced capabilities than Selenium web browser automation due to CEF low level programming APIs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    OMLX project is a place for processing of many projects to be ready to become open source projects.
    Leader badge
    Downloads: 122 This Week
    Last Update:
    See Project
  • 19
    wfCodeEditor

    wfCodeEditor

    A simple portable Code Editor for any language

    A simple portable Code Editor for any language. Signed application. View a preview of features here - https://wyntonfranklin.github.io/wfcodeeditor/ Embeded snippets for various languages Add your snippets Add your task wfTutorials right at your fingertips Feature a dark mode Portable Simple Command line interface Works with various languages (PHP, JAVA, Javascript, Python)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Opyrator

    Opyrator

    Turns your machine learning code into microservices with web API

    Instantly turn your Python functions into production-ready microservices. Deploy and access your services via HTTP API or interactive UI. Seamlessly export your services into portable, shareable, and executable files or Docker images. Opyrator builds on open standards - OpenAPI, JSON Schema, and Python type hints - and is powered by FastAPI, Streamlit, and Pydantic. It cuts out all the pain for productizing and sharing your Python code - or anything you can wrap into a single Python function...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    TransPose

    TransPose

    PyTorch Implementation for "TransPose, Keypoint localization

    TransPose is a human pose estimation model based on a CNN feature extractor, a Transformer Encoder, and a prediction head. Given an image, the attention layers built in Transformer can efficiently capture long-range spatial relationships between keypoints and explain what dependencies the predicted keypoints locations highly rely on.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    speedtest-cli

    speedtest-cli

    Command line interface for testing internet bandwidth using speedtest

    ... connection performance metrics like download, upload, latency and packet loss natively without relying on a web browser. Test the internet connection of your Linux desktop, a remote server or even lower-powered devices such as the Raspberry Pi with the Speedtest Server Network. Set up automated scripts to collect connection performance data, including trends over time.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    kamiFaka

    kamiFaka

    kamiFaka

    Applicable to all kinds of e-commerce, coupons, forum invitation codes, recharge cards, activation codes, registration codes, Tencent iQiyi points CDK, etc., support manual and automatic delivery, and a tiered wholesale model similar to 1688. Stisla UI: The web interface is beautiful Front-end uses VUE3.0, millisecond-level response. Has integrated Alipay face-to-face payment, WeChat official, Payjs, Hupijiao, YunGouOS, Yipay, Mugglepay, V visa-free and other more than a dozen payment...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Linkchecker for Markdown

    Linkchecker for Markdown

    Python asyncio + aiohttp Markdown *.md URL link checker

    Blazing-fast (10000 Markdown files per second) Python asyncio / aiohttp based simple check of links in Markdown .md files only. This tool is very helpful for large Markdown-based Jekyll and Hugo sites as well as Markdown-based MkDocs documentation projects. It is very fast and simple.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Responder

    Responder

    A familiar HTTP Service Framework for Python

    Responder is a web service framework, written for human beings. This gets you a ASGI app, with a production static files server (WhiteNoise) pre-installed, jinja2 templating (without additional imports), and a production web server based on uvloop, serving up requests with gzip compression automatically. A pleasant API, with a single import statement. Class-based views without inheritance. ASGI framework, the future of Python web services. WebSocket support! The ability to mount any ASGI / WSGI...
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.