Showing 141 open source projects for "python web crawler"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    istSOS

    istSOS

    Free and Open Source Sensor Observation Service Data Management System

    istSOS is an OGC SOS server implementation written in Python. istSOS allows for managing and dispatch observations from monitoring sensors according to the Sensor Observation Service standard. The project provides also a Graphical user Interface that allows for easing the daily operations and a RESTful Web api for automatizing administration procedures. istSOS is released under the GPL License, and runs on all major platforms (Windows, Linux, Mac OS X), even though tests were conducted...
    Downloads: 66 This Week
    Last Update:
    See Project
  • 2
    X-RAY

    X-RAY

    The next web scraper, see through the <html> noise

    Supports strings, arrays, arrays of objects, and nested object structures. The schema is not tied to the structure of the page you're scraping, allowing you to pull the data in the structure of your choosing. The API is entirely composable, giving you great flexibility in how you scrape each page. Paginate through websites, scraping each page. X-ray also supports a request delay and a pagination limit. Scraped pages can be streamed to a file, so if there's an error on one page, you won't...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    pyindi-client

    Python binding to the libindi library

    ... there are also bindings for node.js, Tcl (incomplete) and PHP (not useful). As application examples you will find a Python Websocket server with which you may build a web application interacting with Indi servers, and a simple PyQt application similar to the Kstars Indi Control Panel (was built as an exercise). Finally there is an equatorial mount 3D simulator written with Freecad and Python, planned to be connected with the PyIndi module. *** The pyindi-client binding has moved to github. ***
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Holarse

    Holarse

    website software for holarse

    HolaCMS 3 Source Code which will power the new Holarse website.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 5

    survol

    RDF-based framework monitoring business systems activity

    A Python agent and a web interface aiming to help the analysis and investigation of a legacy application. A set of machines, processes, databases, programs etc ... all communicating with each other, manipulating your data, and whose software architecture has become, with time, complicated, difficult to understand, and undocumented. Data are aggregated with an RDF inference engine, creating a global vision of the business information processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    NTK RTMP SERVER

    NTK RTMP SERVER

    Naam Tamilar Web TV Live Streamer

    Naam Tamilar RTMP Server This project updated as open source for future use of Naam Tamilar Political Party. To contribute to the party and in some case if there is any possibilities if i cannot support them for long term. I thought of sharing this source code so in future it may be helpful for the community and party in which other software developers can help them to upgrade. This source is forked from - https://github.com/arut/nginx-rtmp-module and modified with multiple broadcast...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    mediaTUM is free software written in Python for archiving and retrieval of images, documents and other research data. It was originally developed in the framework of the DFG project IntegraTUM and is continuously expanded with new functionalities as required.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    OpenWebSpider
    OpenWebSpider is an Open Source multi-threaded Web Spider (robot, crawler) and search engine with a lot of interesting features!
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Reactive Extensions for JavaScript

    Reactive Extensions for JavaScript

    An API for asynchronous programming with observable streams

    An API for asynchronous programming with observable streams. The Observer pattern done right. ReactiveX is a combination of the best ideas from. The Observer pattern, the Iterator pattern, and functional programming. ReactiveX is everywhere, and it's meant for everything. Available for idiomatic Java, Scala, C#, C++, Clojure, JavaScript, Python, Groovy, JRuby, and others. Embrace ReactiveX's asynchronicity, enabling concurrency and implementation independence. Manipulate UI events and API...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    htmlarea

    htmlarea

    Small, powerful, full featured WYSIWYG editor

    HTMLArea 4 is a browser based WYSIWYG editor that easily replaces the TEXTAREA in your web pages. It is written in JavaScript, and suitable for use in any modern web browser, and any page on your web site. Current version is 4.0-2016-08-29
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Cub Linux

    Cub Linux

    Chromium + Ubuntu = Cub Linux

    The best of Chromium and Ubuntu. Cub Linux is a project to replicate the Chromium OS experience on an Ubuntu Linux base system. Cub Linux is free to download and use forever.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 12
    MultiPlink

    MultiPlink

    [UPDATE 28-10-2015] FINAL MULTIPLINK SSH LIFETIME v2.0 UP-5 "FAREWELL"

    Download Link Utama: http://sourceforge.net/projects/multiplink/files/Final_MultiPlink_SSHLIFETIME_2.0_UP-5_FAREWELL.rar/download Jika di Halaman Download, Link belum tersedia, klik link alternatif : https://www.dropbox.com/s/znptxp2gxhihpax/Final_MultiPlink_SSHLIFETIME_2.0_UP-5_FAREWELL.rar?dl=0 Lalu Download & Install juga Minor...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    PyLucid is a Open Source web content management system written in Python using the Django Framework. fully customizable output, expandable with plugins, i18n, L10n, Revision controled, support many database engines, WSGI conform and many more...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14

    Roxy Fileman

    Web file manager

    Free file browser for CKEditor and TinyMCE. Can be used in any web application. Ready for use with ASP .NET and PHP - all scripts are included. User unterface completely separated from the server side logic. Language independant server side scipts - can be implemented in any programming language - PHP, ASP .NET, Python etc. Latest versions is available at http://www.roxyfileman.com/download
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    ReorJS

    Distributed Computing with JavaScript

    Create your own distributed computer that can distributed javascript based applications to any computer with a web browser, headless browser or node.js installation. For more information and updates please see our website - http://reorjs.com.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Zoozle Search & Download Suchmaschine

    Zoozle Search & Download Suchmaschine

    Zoozle 2008 - 2010 Webpage, Tools and SQL Files

    Download search engine and directory with Rapidshare and Torrent - zoozle Download Suchmaschine All The files that run the World Leading German Download Search Engine in 2010 with 500 000 unique visitors a day - all the tools you need to set up a clone. Code Contains: - PHP Files for zoozle - Perl Crawler for gathering new content to database and all other cool tools i have created https://www.artikelschreiber.com/en/ https://www.unaique.net/en/ https://www.unaique.com/ https...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    GreasySpoon

    GreasySpoon

    An ICAP Services Factory

    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    FlightFeather's goal is "social networking for everyone". This means that anyone should have a chance to run a popular social networking site -- on minimal hardware, and without wasting bandwidth.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    BandwidthD-IPv6

    BandwidthD-IPv6

    Collects data about network useage using libpcap.

    Based off of bandwidthd (http://sourceforge.net/projects/bandwidthd) but updated to support large networks (13,000+ devices). IPv6 support is in the works. Our thanks goes to the original authors of bandwidthd who made this possible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    pyMantis
    pyMantis is a data-management system for (systems) biology build on the web2py framework. It features: tree based file explorer, relational db table wizzard with automated creation of user interfaces, internal and external access management, wiki, ..
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Open CarPool

    Open source framework for company carpools

    A simple open source framework for carpools within companies reduces traffic in daily commute and seeds ridesharing culture. OPEN CARPOOL SOURCES HAVE MOVED TO https://github.com/open-carpool
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    LeechCraft
    LeechCraft is a cross-platform modular "internet-client" application. It has plugins that allow one to browse the web, chat via instant messaging (like Jabber or IRC), read RSS/Atom feeds, download files via BitTorrent, Direct Connect and other protocols, play media files and much more.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 23
    Framework (scripts, configuration, code) to build free and public services around travel and leisure data. That project makes an extensive use of already existing data sources such as Geonames and dbPedia, and adds some glue around those (eg, links).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    dynamide
    dynamide is a dynamic web application framework for handling the presentation and business layers in a traditional web app. See http://dynamide.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    SuperWikia Alpha Codebase 1.1

    SuperWikia Alpha Codebase 1.1

    SuperWikia: The Runtime/Realtime Environment for Codesmiths

    The SuperWikia Alpha 1.0 application is an IDE specifically developed for 3D and graphic content. Its built upon the Unity foundation and incorporates various libraries from Komodo IDE, wxDevC++, CodeBlocks, QT Creator and other applications. Its Realtime purpose is an IDE (codesmith developer access), its Runtime (U/I) function is as a space building simulator, operating in conjunction with the SuperWikia 'Navigator' user interface to simulate space economics and fleet operations. It...
    Downloads: 0 This Week
    Last Update:
    See Project