28 projects for "python web crawler" with 2 filters applied:

  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Sales CRM and Pipeline Management Software | Pipedrive Icon
    Sales CRM and Pipeline Management Software | Pipedrive

    The easy and effective CRM for closing deals

    Pipedrive’s simple interface empowers salespeople to streamline workflows and unite sales tasks in one workspace. Unlock instant sales insights with Pipedrive’s visual sales pipeline and fine-tune your strategy with robust reporting features and a personalized AI Sales Assistant.
    Try it for free
  • 1
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications. It can be used for data mining, monitoring...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    Wapiti

    Wapiti

    Wapiti is a web-application vulnerability scanner

    Wapiti is a vulnerability scanner for web applications. It currently search vulnerabilities like XSS, SQL and XPath injections, file inclusions, command execution, XXE injections, CRLF injections, Server Side Request Forgery, Open Redirects... It use the Python 3 programming language.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 3
    Cinemagoer

    Cinemagoer

    Python package to retrieve and manage data of the IMDb

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. Platform-independent, it can retrieve data from both the IMDb's web server and a local copy of the whole db.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4

    Render 4 Monitoring

    Render Web Services for Viewing/Monitoring and Test Web Applications

    Renders and aggregates other web services into static HTML monitoring output. Also implements simple and dynamic testing of web based applications using mechanize. Includes Basic authentication, and ADFS authentication for web service testing. Custom authentications can be implemented very easily with python programming. Implements PhantomJS rendering for Javascript/HTML5 dynamic web pages. Requires python3
    Downloads: 0 This Week
    Last Update:
    See Project
  • Test your software product anywhere in the world Icon
    Test your software product anywhere in the world

    Get feedback from real people across 190+ countries with the devices, environments, and payment instruments you need for your perfect test.

    Global App Testing is a managed pool of freelancers used by Google, Meta, Microsoft, and other world-beating software companies.
    Try us today.
  • 5
    phoneutria
    A Java Web crawler: multi-threaded, scalable, with high performance, extensible and polite. It can be used to crawl and index any web or enterprise domain and is configurable through a XML configuration file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    cutplace
    Cutplace validates tabular data (CSV, fixed format) according to an interface control document (ICD). The ICD acts as executable specification and can be described using popular spreadsheet applications (Calc, Excel).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    sitecheck

    Modular web site spider for web developers.

    More than just a link checker, sitecheck is a website spider (also known as a crawler) which can assist with SEO by testing an entire site plus both inbound links from search engines and outbound links to other sites for the following issues: looping redirects (HTTP 301/302), broken links (HTTP 404), server errors (HTTP 500), spelling mistakes, low readability scores (using the Flesch Reading Ease test), missing/empty/duplicate meta tags, duplicate content, slow page speed, W3C validation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    z-utalm

    z-utalm

    Unified Test and Logging layer for multiple programming languages

    Modern software systems and application are commonly written in multiple languages, include scripting engines, and are frequently build on multiple specialized frameworks and middleware for a considerable diversity of runtime environments. The latest influencing update in development paradigm is the application of multicore processors. This projects is aimed to unify the required trace and logging output and integrate into debugging environments. The target is to provide general development,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Application Radi-testdir is a light weight test Management tool. Radi supports testdirectory features like configuring the test plan, updating (create/edit) the test results for the test image/build, Backup, User Management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Photo and Video Editing APIs and SDKs Icon
    Photo and Video Editing APIs and SDKs

    Trusted by 150 million+ creators and businesses globally

    Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
    Learn More
  • 10
    Analysis tools for scale test data generated by The Grinder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    ETICS
    ETICS stands for "eInfrastructure for Testing, Integration and Configuration of Software". It provides software professionals with an "out-of-the-box" build and test system, powered with a build and test product repository.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TestFarm is a client-server Python app that allows monitoring the stability and efficiency of a devel project by performing continuous builds and tests in multiple platforms. The overall idea is based on Tinderbox. With a simpler and extensible approach.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Python test runner, initial plan is a main process dispatching test to sub process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    BuildBot
    Buildbot is a system to automate the compile/test cycle required by most software projects. Buildbot is no longer hosted on SourceForge. See http://buildbot.net for links and documentation, and http://github.com/buildbot for the source.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    "Yet Another Do-It-Yourself (Test) Automation Framework" specification. Use or modify one of the existing software implementations (hosted/referenced here), or implement the framework yourself. Download and read the spec for more details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Fressia Project is an effort to develop an Open Source general framework for Testing Automation. It's intended for users (testers) that want a simple tool which can be used just out of the box. It's does not requires any complicated configuration stuff
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Browsershots
    Browsershots is a system for capturing screenshots of Web pages in many different browsers. When a user submits a web address on the central web interface, the screenshots are made automatically on distributed computers that are run by volunteers.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 18
    Addanc is a distributed/scalable system for stress/load testing web based applications. Addanc tests focus on the arrival rate of service requests rather than a fixed number of simulated clients.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Automated computer program grading solutions designed for trainings on computer languages, data structures and algorithms. See http://openjudge.net for details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Libverticale's mission is to provide semantic web services for an artists center.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    collection good tip code as utility packet,esp. Chinses! and design one mail core Blog system. only through mail we can blogging us Blog! and as one opening proj. , let China Python easy share tip code into uniform packet; and enjoy OpenSource...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    An unit testing framework (http://www.xprogramming.com/software.htm) to allow unit testing of Products in a real Zope environment, with all the bells and whistles.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    OpenLC is a set tools designed to facilitate benchmarking and stress testing of a wide variety of information servers (WEB, Email, FTP, LDAP, ...). A XML-RPC API is offered for developers interested in creating clients that query the services provid
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Armstrong is an integrated package for developing web applications in the python web-framework Django.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Gefira is an umbrella project for various opensource-related projects, usually backend or middleware oriented.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.