Bright Data - All in One Platform for Proxies and Web Scraping
Say goodbye to blocks, restrictions, and CAPTCHAs
Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
Get Started
Free CRM Software With Something for Everyone
216,000+ customers in over 135 countries grow their businesses with HubSpot
Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.
That project aims at providing a clean API, and the corresponding C++ implementation, for parsing travel-focused requests (e.g., "washington dc beijing monday r/t +aa -ua 1 week 2 adults 1 dog").
Google Mass Search is a small script written in python to get large number(as you need) of urls from google search results of a specified string. It is really simple to use but fast & powerful.
You can specify a search string, no. of results filename, and some optional fields. GMS retrieves all the required links in a few seconds and save it to the file. It also eliminates the redundant links.
You can also apply filters like links containing a given string or not containing a string....
PqDL is considered as a flexible yet useful tool which can help to download Pocket Queries that have more than 500 caches from geocaching.com. Pocket Queries that contain more than 500 caches won't be sent per mail. UNIX USERS: Manual checkout!
"Swish-e is a fast, flexible, and free open source system for indexing collections of Web pages or other files" (http://swish-e.org/ ) This module provides a Python API for this software.
The Semantic Web Peer - Allegra is an innovative library for Semantic Web applications development. It provides a framework for asynchronous network peer programming, a simple stack of Internet standards implementations, and two new network applications:
MedusWiki is a Python Wiki engine intended to be used as a personal knowledge management system. It uses Topic Maps (XTM) to store metadata, meaningful associations could be created between wiki pages. Zope Page Templates (ZPT) are used to produce HTML.
Graph-based Extraction and Summarization - a generic graph-based summarization framework. Basic functionality is provided - third-party modules can be plugged in.
Testeum is a Software Testing & User Test platform
Tired of bugs and poor UX going unnoticed despite thorough internal testing? Testeum is the SaaS crowdtesting platform that connects mobile and web app creators with carefully selected testers based on your criteria.
Document summarization system. By adding document content to system, user queries will generate a summary document containing the available information to the system.
A simple app that builds a library of your music, then fetches information from metal-archives, discogs, etc. and shows you what you're still missing (e.g. new releases). Moved to http://github.com/KenjiTakahashi/gayeogi
Jake is a console based app written in python and qt4. Plugins will let you do almost anything, for example, search in google, translate, view images, talk with it (aka AI bot). Also, skining system will let you choose how should jake look.
Wixi is a multi-platform wiki application for the desktop. It is written in python/wxpython and does anything you might expect from a wiki application. Generating html from text is done by txt2tags, wich is simple and easy to learn.
This is an ***old archive*** of tools developed for facilitating the use of Creative Commons licenses and metadata. --- For the most up to date representation of any of the projects listed here, please see: http://creativecommons.org/project/Developer.
Pypes is a framework which allows users to break complex data processing logic down into a series of smaller less complex tasks. These tasks, referred to as components, can then be connected so that the output of one becomes the input to another.
Nucular Archiving System for creating full text indices for fielded data. Python API, web, and command line interfaces. Fast. Very light weight. Concurrent read/writes with no possible locking issues. No server process. Proximity. Facets. Funny name.
pyTube is a python-based commandline YouTube search. One can search for videos and display them in their default web browser. Requires python 2.5 and gdata.
A threaded Web graph (Power law random graph) generator written in Python. It can generate a synthetic Web graph of about one million nodes in a few minutes on a desktop machine. It implements a threaded variant of the RMAT algorithm.
GOOPY: Python module to search google (with CLI included). With the module you can create python programs that can access google results. The CLI allows you to surf google without leaving bash!
DiskAt is disk/media catalogue app supporting multiple categories per item, good search and features which allow to use it as Movie/DVD/etc database. Written with PHP/Python/SQLite.
Opener-OAI extracts Dublin Core and LOM records from EduCommons RSS-feeds
and serves these records as an OAI-repository.
It is initially build for the EduCommons site of Open Universiteit Nederland but should work on all EduCommons sites.