Educational Python web scraping case collection for many sites
lxSpider is a collection of web scraping examples designed primarily for learning and experimentation with data extraction techniques. It gathers numerous crawler implementations that demonstrate how to collect data from a wide range of websites and online services. It focuses heavily on practical cases that illustrate how different platforms handle requests, authentication parameters, and anti-scraping protections. lxSpider includes examples targeting areas such as e-commerce platforms,...
This is a apache v2.0 authentication module. Based on html form authentication and cookie authentication session. Cookie session are stored in memcache deamon. Can be used has an simple "Single Signe-On" (SSO).
All the code source and the bug tracking has migrated to github:
https://github.com/ZenProjects/Apache-Authmemcookie-Module
All the documentation are here:
https://zenprojects.github.io/Apache-Authmemcookie-Module/
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Framework (scripts, configuration, code) to build free and public services around travel and leisure data. That project makes an extensive use of already existing data sources such as Geonames and dbPedia, and adds some glue around those (eg, links).
Cheetah is a template engine and code generation tool, written in Python. Web development is its principle use, but Cheetah is very flexible and is also being used to generate C++ game code, Java, sql, form emails and even Python code.
JLink lets users author flow charts based on ISO 5807 and IBM standards. Developers can use JLink to add flowcharts to applications, serve a flow chart over the web in PDF or PNG, or dynamically create a flowchart with Javascript, Python or Ruby scripts
Not the greatest thing since Sliced Bread, it IS Sliced Bread. This project is a series of network tools and a network development SDK, with big hopes, and too little time to code it all.
"Web Test Tools" is an open source project which provides tools for different kind of testing such as unit tests (JUnit), and different kinds of tests for applications. Additionaly you can download here good implementation of internet protocols in Java.
Code for reference implementations of identity brokers and simple single sign-on (SSO) mechanisms that utilize XDI and link contracts to manage the dataweb.
2tools is a set of online CMS maintenance tools: just one click to migrate to/from different CMS; safe configuration settings to upgrade your CMS; easy Data movement to/from DBMS, databases or tables; SQL, XML, XSLT, PHP technologies.
All tools are main
The OO component based XML processing framework.The goal is to make the XML developer's life easier.It's going ot achieve that by providing reusable components for all stages of XML processsing with easy language and application binding.
Luxilla is a runtime/browser that turns XUL (XML User Interface Language) into live windows, dialogs, menus, toolbars and more without requiring a single-line of Javacode. Pass on the chrome folder holding your XUL markup and see it come alive.
Frosttie (FROnt-end SchemaTron Text Internet Engine) takes XHTML pages and processes them with various user-definable filters such a W3C's WAI, Section 508 (US) web usability compliance, ad removal, etc. It can be used with zKnowMan.