Showing 140 open source projects for "python web crawler"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Get the most trusted enterprise browser Icon
    Get the most trusted enterprise browser

    Advanced built-in security helps IT prevent breaches before they happen

    Defend against security incidents with Chrome Enterprise. Create customizable controls, manage extensions and set proactive alerts to keep your data and employees protected without slowing down productivity.
    Download Chrome
  • 1
    Redland is a set of object-based, modular and portable C RDF libraries providing RDF APIs for the graph, triple storage (librdf), RDF/XML parsing and serializing (Raptor), SPARQL RDF querying (Rasqal). Language APIs in Perl, PHP, Python, Ruby and others.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    WSFuzzer
    WSFuzzer is a fuzzing penetration testing tool used against HTTP SOAP based web services. It tests numerous aspects (input validation, XML Parser, etc) of the SOAP target. It is only to be used against targets that have granted permission to be teste
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Some tools related to the Music Ontology - including domain-specific Semantic Web crawlers, audio collection management and mapping tools
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    A command-line Python utility to mine information on open source projects using the ohloh web service APIs. The script is enables analysis of open source projects indexed by ohloh by providing source lines of code (SLOC), contributor data and other info.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    A Python GUI tool for rapidshare premium member to download rapidshare links. Features: Resumable. Download rapidshare folder/linklist. Multiple concurrent files download / multiple concurrent parts per file. Proxy (HTTP, SOCKS). Urlcash, anonym.to
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    The suds project is a python soap web services client lib. Suds leverages python meta programming to provide an intuative API for consuming web services. Runtime objectification of types defined in the WSDL is provided without class generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Lucid Desktop is a Web Desktop/WebOS/Web Operating System built with Dojo. For more info, please visit http://www.lucid-desktop.org/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    PyH
    A powerful python module that lets you output HTML code from within a python script in a very efficient and convenient fashion. Code your web-page like a GUI! Create tags and modify their attributes at anytime during your script. http://pyh/googlecod
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    pyservices
    Making use of our library you can easily deploy and consume services available on the web. PyServices is a pythonic library that provides a default interface to WebServices written in many different protocols. Our objective is describe and implement
    Downloads: 0 This Week
    Last Update:
    See Project
  • Photo and Video Editing APIs and SDKs Icon
    Photo and Video Editing APIs and SDKs

    Trusted by 150 million+ creators and businesses globally

    Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
    Learn More
  • 10
    ZK Light is renamed to ZKuery and moved to http://code.google.com/p/zkuery/. ZK Light is a client-only version of ZK; Support Java, C, PHP, Python...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Spyse is a software framework for building multi-agent systems. It allows Python developers to build distributed intelligent systems of multiple cooperative agents based on FIPA, OWL, SOA and many others. Spyse is designed for ease-of-use and fun.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Methanol is a scriptable multi-purpose web crawling system with an extensible configuration system and speed-optimized architectural design. Methabot is the web crawler of Methanol.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    A simple remote control that should be able to play, pause, stop and skip on DNLA/UPnP devices it finds on the subnet. Currently tested on two machines, Ubuntu 9.10 and Windows XP Home against Rhythmbox with the DLNA/UPnP plugin .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ASI to Smart-M3 SIB synchronization agent
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    An online judge for programming contests. Unmaintained. Please fork and maintain if you wish. Here are some notes for a future maintainer: 1. The backend is not built for large-scale submissions over the web (it works OK for a small contest). The key missing component is a job queue for submissions. If this is implemented, then the backend will be able to scale.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Dare-Dare stands for Document Ajax Reader Extension. It's a full javascript online pdf reader. No flash, no closed sources !!!!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    RSS aggregator written in python, using django. Feeds and feed items can be annotated with tags and other information from services such as twitter & delicious. Allow communities such as teachers to get focused, useful information from the interweb.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Make AsciiDoc part of your literate programming tool set. With eWEB you can weave and tangle literate programs written as AsciiDoc documents, using embedded WEB code snippets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This project provides a fast distributed system for image processing, written in Python. It aim is to be used as service to PHP, Perl and Python application servers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Porcupine is an open-source Python based web application server that provides front-end and back-end revolutionary technologies for building modern data-centric Web 2.0 applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Library to get balance for DSV internet-privider
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Sofa is a CUDA-based reasoner
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The Java Sitemap Parser can parse a website's Sitemap (http://www.sitemaps.org/). This is useful for web crawlers that want to discover URLs from a website that is using the Sitemap Protocol. This project has been incorporated into crawler-commons (https://github.com/crawler-commons/crawler-commons) and is no longer being maintained.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    These are the scripts for putting together your own MindTouch/Zenoss mashup. The mashup places live data from a Zenoss installation in the context of MindTouch wiki pages for taking configuration notes and other possible future mashups.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    SnapLogic is an Open Source Data Integration framework that combines the power of state-of-the-art dynamic programming languages with standard Web interfaces to solve today's most pressing problems in data integration.
    Downloads: 1 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.