Enterprise-Grade Proxies • Built-in IP Rotation • 195 Countries • 20K+ Companies Trust Us
Get unrestricted access to public web data with our ethically-sourced proxy network. Automated session management and advanced unblocking handle the hard parts. Scale from 1 to 1M requests with zero blocks. Built for developers with ready-to-use APIs, serverless functions, and complete documentation. Used by 20,000+ companies including Fortune 500s. SOC2 and GDPR compliant.
Get Started
Free CRM Software With Something for Everyone
216,000+ customers in over 135 countries grow their businesses with HubSpot
Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.
The Wikipedia Miner toolkit provides simplified access to Wikipedia. This open encyclopedia represents a vast, constantly evolving multilingual database of concepts and semantic relations; a promising resource for nlp and related research.
retypar converts photos, drawings etc. into text. The result can be saved as text (.txt, .ans), HTML (.html) or as an image (.jpg). You can influence the conversion by altering certain options (font, font size, scale factor, colour etc.).
otl is a text processor for generating markup from plain text. Much of both the input and output formats can be customized. otl supports structures such as nested ordered lists, headers and footers, and tables.
MarcXimiL is a flexible multi-platform bibliographic similarity analysis framework. Features: deduplication, information monitoring, visual analysis, plagiarism detection. Supported: MARCXML, OAI-PMH2 harvesting, and importation of text MARC.
A simple but powerful bulk file statistics calculator. Given operation(s) and a list of files containing identical text fields and corresponding numerical fields this program calculates and outputs a similar file with the numerical op performed.
CORPSE (CORPus SEarch) is a powerful search engine written in Java. The aim is to provide an efficient implementation of a word level inverted index search with various cool functions that can be used on very large corpora.
Rubyturgy is designed to help students perform textual analysis on publicly available texts. It is intended to mimic Unix text utility functionality, while providing more useful representations of data for students of literature.
Have you ever wanted to dive into the seedy underworld of Mexican Pro Wrestling but never had the opportunity? Your chance is now here. Currently LuchaManager is a barely running text based pet raising game, written in python.
Logprocessor is a tool for analyzing log files in a rigorous and reliable way. You write a command file and the log file can be run though the tool resulting in pass/fail.
Filecmp is a command-line application that gets two filenames as argument and outputs the comparison between them - e.g. if they are the same or not... it may look irrelevant but sometime it's very useful, specially inside scripts.
A small java application which helps CMake users who want to efficiently organize their source tree with the help of Visual Studio's project editor and CMake's "source_group" command.
OSCAR (Open Source Chemistry Analysis Routines) is software for the semantic annotation of chemistry papers. The modules OPSIN (a name to structure converter) and ChemTok (a tokeniser for chemical text) are also available as standalone libraries.
kuechenstation is a text mode client for the xmms2 daemon. Unlike other text
mode client's it's not commandline based, but features all you would expect
from a graphical client: scrolling title, playlist editor, medialib browser
SubtitleGet is a command-line tool written in Python, that I am creating in my spare time. It's main purpose is to search, download and/or upload subtitles using Opensubtitles.org fantastic site.
Document summarization system. By adding document content to system, user queries will generate a summary document containing the available information to the system.
APHID is an easy-to-install, easy-to-use DocBook environment. APHID transforms source documents (text or XML) into multiple output formats (HTML, PDF, HTML Help, etc.). APHID is a derivative work of eDE (http://www.e-novative.de).
Lurker is a mailing list archiver designed for capacity, speed, simplicity, and configurability in that order.
Noteworthy features include: google-style searching on all fields, chronology preserving threads, multilingual, and attachment support.
Proxy Tester is a proxy list tester and then it generates a wpad.dat file you can use for your browser based on working servers. wpad.dat file selects a random proxy server from the list of provided working proxy servers with each connection request.
=DOES NOT WORK ANYMORE AS DSA HAS PUT CAPTCHA= DSA Practical Driving Test Monitor helps you find any available practical driving test slot within specified date range. Runs on Linux/Mac/Windows and automates your manual task of finding the test slot.
We've moved to GitHub!
https://github.com/Karmabunny/pelzini
Code documenter for PHP, JavaScript and C which uses a database to facilitate such features as Search! Written in PHP, and using MySQL, PostgreSQL or SQLite, Pelzini is the
documentation tool of the future!
Powerful line based text editor for many platforms with a very advanced selection system. Currently, it is not usable, it is buggy and barely has any features (very few even basic ones)
It generates a recursive automaton text file to be parsed with the tools included in the librfa (http://fsvieira.com) project, that is able to generate to other formats/languages (C/C++, graphviz, ...)