Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
Eddyspeak is an orrigional translation content management system. It allows you to translate GNU GetText PO files using a browser. This allows a comunity to work on the same file at the same time and speeds up development. Uses: PHP, MySQL and Python.
pso- Python Service objects is a package that simplifies HTTP handlers:
Built-in sessions. Write once run on modpython, modsnake, NASAPY, fastcgi, CGI. Easy interface to HTTP info. Simple, fast, robust and powerful extendable OO template parser.
Software to organize, interpret, and present course survey results on the web. Requires Python; uses a database and a webserver. Colored rating bars, plots, statistical comparisons, cross-referencing.
Support for the Oasis XRI (Extensible Resource Identifiers) effort. This includes resolvers and client libraries for XRIs in multiple languages and multiple platforms. See http://www.oasis-open.org/committees/xri
niab - Network In A Box.
Create a virtual lab network inside one machine. A lab can include routers, firewalls, clients and servers connected by a network specified by you. [Linux Network Simulator, UML, user mode linux]
PyGCS is a very stripped down MUD-like chat-server written entirely in Python. It has a single "room" and no large database to keep in memory and on disk. PyGCS has no embedded programming language.
HDCIM is an instant messenger for HDC users to exchange info, the IMs will travel encrypted with public key encryptions technologies. The instant messenger will be built for Windows and Linux, but it will be written for maximim portability
OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.
Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
PyWikiServer is a WikiWiki web server written entirely in Python. Data files are encoded using docbook XML. It does not need neither a Web Server nor any conversion/stylesheet tool.
Conflux is a Web-based groupware and file management solution intended to organise and facilitate peoples work at the office. The basic functionality includes a file depository, shared calendars, tasks, contacts, discussions, and an email client.
Pears is a three-pane newsfeed (RSS/RDF/Atom) aggregator which caches downloaded feeds for offline use. It has a clean, uncluttered interface, it's easy to use and works on Windows, Linux and MacOSX. You can extend its functionality with plugins.
Luxor Contrib is a collection of example apps, add-ons, plug-ins, tutorials, FAQs, how-tos and other goodies for the Luxor XUL (XML User Interface Language) toolkit.
PyEximon is a GNOME monitor/manager for the popular MTA, Exim. It includes real-time status graphs and log updates, colored log browsing, hierarchial message lists, as well as a graphical interface to common message functions.
Corbon is a light framework that transforms XML to XHTML pages. It strives to generate standards-compliant XHTML 1.0 code, with a strict separation of content and layout.
pyBlog is yet another blogging software written in python for creating custom blogs. It works in offline mode containing an ftp client for up- and downloading files to any web server. It supports multiple users (bloggers) and blogging via mail (optional)
Code for reference implementations of identity brokers and simple single sign-on (SSO) mechanisms that utilize XDI and link contracts to manage the dataweb.
Pyblish is a web server and application framework written in Python. The main goal for Pyblish is power through simplicity, clear separation of developer roles and easy extensibility.
pyChelsea is a python based, personal, visited, web page indexer, seach engine and interface for the browser/platform of your choice. If you remember a page based on a phrase, pyChelsea is for you.
Note: this project is no longer maintained. Please use gnome-python-extras (http://www.pygtk.org) instead. I apologize for any trouble this might cause, but this is better in the long run. Python bindings for GtkEmbedMozilla.
FreeGee is an integrated application framework for Python and C++. Included modules are: Python (scripting), PostgreSQL (database), wxWidgets/wxWindows (GUI), GnuPG (crypto), omniORB (OO middleware), Apache (Web), and other fine tools (see Home Page).