Showing 82 open source projects for "python web crawler"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Get the most trusted enterprise browser Icon
    Get the most trusted enterprise browser

    Advanced built-in security helps IT prevent breaches before they happen

    Defend against security incidents with Chrome Enterprise. Create customizable controls, manage extensions and set proactive alerts to keep your data and employees protected without slowing down productivity.
    Download Chrome
  • 1
    Endian Firewall Community
    ... for email traffic (POP and SMTP), content filtering of Web traffic and a "hassle free" VPN solution (based on both OpenVPN and IPsec).
    Leader badge
    Downloads: 329 This Week
    Last Update:
    See Project
  • 2
    ZEG / Zero-Effort-Groupware

    ZEG / Zero-Effort-Groupware

    SOGo Zero-Effort-Groupware

    The ZEG (Zero Effort Groupware) edition of SOGo is intended to provide a complete out-of-the-box testing environment of SOGo, the Open Source messaging and calendaring software.
    Leader badge
    Downloads: 27 This Week
    Last Update:
    See Project
  • 3
    ciwiki

    ciwiki

    Personnal or familly wiki with low ressource requirement.

    Personal lightweight wiki based on DidiWiki. Upgraded to accept text and highlight color, image resize and video (youtube, dailymotion...) embedded. Written in C, doesn't require a lot of RAM. Works fine on Raspbian (Raspberry Pi). Example of Ciwiki running on Raspberry Pi B+ (700MHz, 512MB): http://inphilly.dyn.dhs.org
    Downloads: 18 This Week
    Last Update:
    See Project
  • 4
    WallPaper (alias crawlpaper)
    WallPaper (alias crawlpaper) is a desktop changer (NOT a screensaver) which includes a web crawler for picture download, an audio stream ripper, an audio player, a mini mp3 tag editor,etc. Also included support for .zip and .rar files and an interface to the BerkleyDB code for small databases.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 5
    YehDown

    YehDown

    A video downloader from youtube viemo and all major sites

    A video downloader . The Official home page for the YehDown tool has been published for the new feature updates. : https://Yehdown.yehigo.com The new Yehdown software download the video with improved download speed. The current update has a best user friendly UI. The tool has support for live , in real-time update for new features. Tested on windows 11.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    Cinemagoer

    Cinemagoer

    Python package to retrieve and manage data of the IMDb

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. Platform-independent, it can retrieve data from both the IMDb's web server and a local copy of the whole db.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Software, information, data sets and documentation for the Web as Corpus community.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Zero Install
    Zero Install is a decentralised cross-distribution software installation system. Create one package that works everywhere! With dependency handling and automatic updates, full support for shared libraries, and integration with native package managers
    Leader badge
    Downloads: 1,947 This Week
    Last Update:
    See Project
  • 9
    NTK RTMP SERVER

    NTK RTMP SERVER

    Naam Tamilar Web TV Live Streamer

    Naam Tamilar RTMP Server This project updated as open source for future use of Naam Tamilar Political Party. To contribute to the party and in some case if there is any possibilities if i cannot support them for long term. I thought of sharing this source code so in future it may be helpful for the community and party in which other software developers can help them to upgrade. This source is forked from - https://github.com/arut/nginx-rtmp-module and modified with multiple broadcast...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Photo and Video Editing APIs and SDKs Icon
    Photo and Video Editing APIs and SDKs

    Trusted by 150 million+ creators and businesses globally

    Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
    Learn More
  • 10
    AlienVault OSSIM

    AlienVault OSSIM

    Open Source SIEM

    OSSIM, AlienVault’s Open Source Security Information and Event Management (SIEM) product, provides event collection, normalization and correlation. For more advanced functionality, AlienVault Unified Security Management (USM) builds on OSSIM with these additional capabilities: * Log management * Advanced threat detection with a continuously updated library of pre-built correlation rules * Actionable threat intelligence updates from AlienVault Labs Security Research Team * Rich...
    Leader badge
    Downloads: 61 This Week
    Last Update:
    See Project
  • 11
    mediaTUM is free software written in Python for archiving and retrieval of images, documents and other research data. It was originally developed in the framework of the DFG project IntegraTUM and is continuously expanded with new functionalities as required.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Proxenet

    Proxenet

    The ONLY hacker friendly proxy for webapp pentests

    Proxenet is a hacker-friendly, DIY web proxy designed for web application penetration testing. Written in C, it allows interaction with higher-level languages like Python and Ruby for on-the-fly modification of HTTP requests and responses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Ganglia

    Scalable, distributed monitoring system for high-performance computing

    Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids. It is based on a hierarchical design targeted at federations of clusters. Supports clusters up to 2000 nodes in size.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 14
    Cub Linux

    Cub Linux

    Chromium + Ubuntu = Cub Linux

    The best of Chromium and Ubuntu. Cub Linux is a project to replicate the Chromium OS experience on an Ubuntu Linux base system. Cub Linux is free to download and use forever.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 15
    The Burster 3D
    Web browser plug-in for hosting real 3D on the websites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    The goal of this project is to make possible to access Progress database from any external program that can use sockets. The server (broker and agents) are written in Progress 4GL and made use of sockets capabilities of Progress V9.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    BandwidthD-IPv6

    BandwidthD-IPv6

    Collects data about network useage using libpcap.

    Based off of bandwidthd (http://sourceforge.net/projects/bandwidthd) but updated to support large networks (13,000+ devices). IPv6 support is in the works. Our thanks goes to the original authors of bandwidthd who made this possible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Fast Artificial Neural Network Library is a free open source neural network library, which implements multilayer artificial neural networks in C with support for both fully connected and sparsely connected networks. Cross-platform execution in both fixed and floating point are supported. It includes a framework for easy handling of training data sets. It is easy to use, versatile, well documented, and fast. Bindings to more than 15 programming languages are available. An easy to read...
    Downloads: 57 This Week
    Last Update:
    See Project
  • 19
    Html SymboliZe

    Html SymboliZe

    transcodes between html entities and regular text

    Hsz takes the text you type and turns it into the proper html entities. Hsz is designed to make web developing easier by providing an easy means of looking up html entity codes. (see http://www.w3schools.com/html/html_entities.asp for info about what html entity codes are)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    blog4y

    blog4y

    Easy blogging for you.

    Project goal is to allow you easy post and backup blog articles to most free blogging hosting. Provide Python API, command line utility, Emacs integration and GTK interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Django Live OS

    Django Live OS for building webapps using Django and MongoDB.

    This is Django Live, a live CD based on Debian stable, Squeeze 6/Lenny 5 that enables to setup/host/test Django apps with ease. No worrys of how to install Apache/Python/MySQL/Django..just fill it, shut it, and go on..LAMP made easy :)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Scripts and tools to manage your Server.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 23
    Command line utility to create graphs and plots for web, CGI, paper, slides. Create pie charts, bar graphs, scatterplots, and other kinds of graphic displays from your alphanumeric, date, time, or categorical data. Input data is in .csv or .txt. Output graphics in gif, postscript, eps, png, svg and others. Useful in dynamic web content systems and other unattended uses. Command line, batch mode. For full details see http://ploticus.sourceforge.net Dec 2016: A new compact python API...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 24
    Ming is an SWF ("Flash") file format output library. It is written in C, with wrappers for C++, Python, and PHP, plus rudimentary support for Ruby and Perl.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25

    LinkChecker

    check links in web documents or full websites

    New Homepage: http://wummel.github.io/linkchecker/ Linkchecker features: - recursive and multithreaded checking and site crawling - output in colored or normal text, HTML, SQL, CSV, XML or a sitemap graph in different formats - HTTP/1.1, HTTPS, FTP, mailto:, news:, nntp:, Telnet and local file links support - restrict link checking with regular expression filters for URLs - proxy support -...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.