Search Results for "python web crawler" - Page 31

Showing 3144 open source projects for "python web crawler"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Crowdtesting That Delivers | Testeum Icon
    Crowdtesting That Delivers | Testeum

    Unfixed bugs delaying your launch? Test with real users globally – check it out for free, results in days.

    Testeum connects your software, app, or website to a worldwide network of testers, delivering detailed feedback in under 48 hours. Ensure functionality and refine UX on real devices, all at a fraction of traditional costs. Trusted by startups and enterprises alike, our platform streamlines quality assurance with actionable insights. Click to perfect your product now.
    Click to perfect your product now.
  • 1
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Grab Framework Project

    Grab Framework Project

    Web Scraping Framework

    Grab is a python framework for building web scrapers. With Grab you can build web scrapers of various complexity, from simple 5-line scripts to complex asynchronous website crawlers processing millions of web pages. Grab provides an API for performing network requests and for handling the received content e.g. interacting with DOM tree of the HTML document. The single request/response API that allows you to build network request, perform it and work with the received content. The API is built...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    pyspider

    pyspider

    A powerful Spider(Web Crawler) system in Python

    pyspider is a powerful Spider(Web Crawler) system in Python. Components are connected by message queue. Every component, including message queue, is running in their own process/thread, and replaceable. That means, when process is slow, you can have many instances of processor and make full use of multiple CPUs, or deploy to multiple machines. This architecture makes pyspider really fast. benchmarking. Since pyspider has various components, you can just run pyspider to start a standalone...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    myHouse

    myHouse

    Home monitoring and automation suite now known as eGeoffrey

    PLEASE NOTE: myHouse has been a nice starting point but difficult to customize, maintain and evolve. For these and many other reasons, I've completely re-designed- re-architect myHouse also giving it a new name (eGeoffrey). The basic principles are the same, the software is still open source but now is supposed to be way simpler to configure (entirely via the web interface), use, customize and extend. If you want to give it a look, check out https://www.egeoffrey.com. I've also put...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    WhatsappToDesktop

    WhatsappToDesktop

    Esse é um pacote de compilado do Whatsapp Web versão 1.0.

    Esse é um pacote de compilado do Whatsapp Web versão 1.0. Esse é um aplicativo multiplataforma de mensagens instantâneas e chamadas de voz para smartphones. Além de mensagens de texto, os usuários podem enviar vídeos, vídeos e documentos em formato PDF, além de fazer a livre conexão com o meio de uma conexão com internet ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Thexyz Cloud Drive

    Thexyz Cloud Drive

    Mobile Sync for Files, Email, Calendars, and Contacts

    Thexyz Cloud Drive application allows you to synchronize files on your desktop with your cloud storage space. This gives you instant access to the most recent versions of important files from anywhere and from any device. Need to edit that important document, but don’t have access to your desktop computer? With the editors included in Cloud Drive, you can create, access, and edit documents and spreadsheets right from your browser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Resume Match

    A tool to match resumes with job descriptions

    An ultra light and easy tool based on Python to match your resume against the job description. Companies use software to match the numerous resumes they receive with the job description they have posted. Higher the percentage of the match, higher is the chances of the resume to be actually looked at by the HR teams. This is an attempt to help job seekers in the above mentioned regards. This is by no means the best or even a powerful tool. But, it works and it might help...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Face Recognition

    World's simplest facial recognition api for Python & the command line

    Face Recognition is the world's simplest face recognition library. It allows you to recognize and manipulate faces from Python or from the command line using dlib's (a C++ toolkit containing machine learning algorithms and tools) state-of-the-art face recognition built with deep learning. Face Recognition is highly accurate and is able to do a number of things. It can find faces in pictures, manipulate facial features in pictures, identify faces in pictures, and do face recognition...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 9

    Syllog

    Syllog is a system for teaching logic, with anonymous answer logging

    Syllog is a system for teaching various forms of logic, including syllogistic logic. It consists of a student-facing Java Applet that runs in the student's browser, and a teacher-facing, server-based logging facility for recording the answers given by students. The student-facing part simply poses questions, to which the student must provide an answer. The teacher-oriented logging facility then logs the answers anonymously for further investigation by the teacher.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Powering the best of the internet | Fastly Icon
    Powering the best of the internet | Fastly

    Fastly's edge cloud platform delivers faster, safer, and more scalable sites and apps to customers.

    Ensure your websites, applications and services can effortlessly handle the demands of your users with Fastly. Fastly’s portfolio is designed to be highly performant, personalized and secure while seamlessly scaling to support your growth.
    Try for free
  • 10
    crawler4j

    crawler4j

    Open source web crawler for Java

    crawler4j is an open source web crawler for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes. You need to create a crawler class that extends WebCrawler. This class decides which URLs should be crawled and handles the downloaded page. shouldVisit function decides whether the given URL should be crawled or not. In the above example, this example is not allowing .css, .js and media files and only allows pages within...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Mapbox Maps SDK for React Native

    Mapbox Maps SDK for React Native

    A Mapbox GL react native module for creating custom maps

    Mapbox is the location data platform for mobile and web applications. We provide building blocks to add location features like maps, search, and navigation into any experience you create. Use our simple and powerful APIs & SDKs and our open-source libraries for interactivity and control. Once you’re signed in, all you need to start building is a Mapbox access token. Use this same short code with all of our interactive mapping libraries, Python and JavaScript SDKs, and directly against our REST...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Vaex

    Vaex

    Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python

    Data science solutions, insights, dashboards, machine learning, deployment. We start at 100GB. Vaex is a high-performance Python library for lazy Out-of-Core data frames (similar to Pandas), to visualize and explore big tabular datasets. It calculates statistics such as mean, sum, count, standard deviation etc, on an N-dimensional grid for more than a billion (10^9) samples/rows per second. Visualization is done using histograms, density plots and 3d volume rendering, allowing interactive...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Zoom Website Editor

    Zoom Website Editor

    Text based Website Editor

    A text/code editor which allows direct editing of web pages over FTP
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Clustergrammer

    Clustergrammer

    An interactive heatmap visualization built using D3.js

    Clustergrammer is a visualization library built using D3.js that enables intuitive interaction with high-dimensional data. Clustergrammer was built with biological data in mind. Clustergrammer is a web-based tool for visualizing high-dimensional data (e.g. a matrix) as an interactive and shareable hierarchically clustered heatmap. Clustergrammer's front end (Clustergrammer-JS) is built using D3.js and its back-end (Clustergrammer-PY) is built using Python. Clustergrammer produces highly...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Transcrypt

    Transcrypt

    Python in the Browser

    Lean and mean Python 3.6 to JavaScript compiler. Supports multiple inheritance, operator overloading and Python source level debugging, even of minified Javascript files. Transcrypt code is as fast and compact as its Javascript counterpart, and it is precompiled for page load speed. You can now develop your web applications completely in Python, with full access to any Javascript library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    fapws3

    my own copy of Fapws3 (Fast asynchronous Python web Server)

    This my copy of Fapws developped by william.os4y (https://github.com/william-os4y/fapws3)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    dadosSemiotica

    Collecter and manager of semiotica annalisis data

    This program is a web application to collect and organize data of text analysis. It works with sets of texts and the analysis are done on portions of the length of a sentence. One of the preprocessing modules is based on CoGroo (A LibreOffice & OpenOffice.org Portuguese Grammar Checker).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    PyTom

    PyTom

    http://www.sciencedirect.com/science/article/pii/S1047847711003492

    PyTom is a toolbox developed for interpreting cryo electron tomography data. All steps from reconstruction, localization, alignment and classification are covered with standard and improved methods. Please sign up to our mailing list to keep up with the most recent updates and versions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    gdpr

    Tool to maintain gdpr data protection declaration

    Admins often maintain multiple web pages, each of which under EU-GDPR requires a privacy statement. In order to keep them coherent, up-to-date and at the same time avoiding doing the same work multiple times, this project provides a tool to automatically create the appropriate statements for each page from a single source. The project is currently available in PHP, however if anyone is willing to provide a version in Python or Perl or whatever, it is more than welcome. The project...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    WebSrapper

    WebSrapper

    Webscraping with UI

    Hello , i have made this simple web scrapping tool so that one can have fun web scrapping with with ease .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    squirreldns

    simple way to show your current ip to the world

    simple way to login to a flask server and set your ip address so that others can communicate with you. Also has basic function to allow failover of nodes
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 23
    XBRL Framework
    ... the use of services (...) 5. XBRL Similarity Tool implements a process to evaluate similarity between concepts of XBRL (...) 6. XBRL SPED Taxonomy is a data model, in order to create a standard, organized to support to Brazilian Public Digital Bookkeeping System (SPED)(...) 7. easyXBRL is a Python-based XBRL library (...) 8. J-easyXBRL is a Java-based XBRL library (...) 9. Xbrl Parser converts XBRL file from XML to JSON format; https://sourceforge.net/p/xbrlframework/wiki/Home
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Leo is an outlining-oriented project manager and browser. With Leo you can organize programs, web sites, URL's, pictures or any other kind of data. Leo is fully scriptable in Python. Leo is 100% pure Python. It requires the Qt gui toolkit.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    GlassPenny
    Competitive stock picking internet based game. Along with the game, information will be collected and strategies will be tested. Users will develop a broader understanding of capital markets and the asset allocation decisions that drive our society.
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.