Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Lightspeed golf course management software
Lightspeed Golf is all-in-one golf course management software to help courses simplify operations, drive revenue and deliver amazing golf experiences.
From tee sheet management, point of sale and payment processing to marketing, automation, reporting and more—Lightspeed is built for the pro shop, restaurant, back office, beverage cart and beyond.
Commandline spell checking script that checks single word. User doesn't have to open file or word processor just to check single word. Simply type: "wordcheck <word>" where <word> is the word you want to check. Uses GNU Aspell.
HTML template library written in C inspired by perl HTML::Template.
Template language has HTML-like tags (tmpl_var, tmpl_if, tmpl_loop, etc.)
Use library to build a variable list and pass it to a template.
Simple text editor for editing multiple language files in parallel. Corresponding translations are automatically highlighted in order to aid reviewing against multiple sources. Supports gettext PO and key = value format. Highly recommended for polyglots.
Ideal for lending professionals who are looking for a feature rich loan management system
Bryt Software is ideal for lending professionals who are looking for a feature rich loan management system that is intuitive and easy to use. We are 100% cloud-based, software as a service. We believe in providing our customers with fair and honest pricing. Our monthly fees are based on your number of users and we have a minimal implementation charge.
binreplace is a commandline tool to search and replace binary sequences in a file. It can be very powerful when used together with grep and sed. It uses prefix tree internally to search. It's efficient even with millions of search terms.
csart - Clever-Search-And-Replace-Text Search, find and replace text in named files or recursive in all directories (-r). If choosen, strings are only replaced if another key-string occures in the line (-w)
Just simple library for get text file information, for checking file text description. for example calculate how much character (with space or without space), line, blank line where we can find in the file text.
Cloud data warehouse to power your data-driven innovation
BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.
BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
Since Azeri Turkish is written in different alphabets throughout the world, This project aims to convert texts between some mainly used alphabets. specifcally, conversion between arbaic and latin alphabets is intended.
RTF2HTML is a name for a cross-platform C++ library (DLL, OCX) and command-line utility, which is intended to convert documents from Rich Text Format (e.g. Word, OO Writer) to HTML. Its features are tiny size, speed, low mem usage and compact output.
oEdtk is an open source project for automated printing processing.
It's a toolkit for building applications that prepare flat file data for massive printing of documents.
CSR is a program to identify, modify and replace code in automatic generated source code. A configuration file controls the behavior and actions of CSR. The configuration file usually contains regular expressions to search and replace the code.
Tubaina is a book generator. Given a text written in afc syntax, a markup language, an html or pdf output is generated. This project has been moved to Github: http://github.com/caelum/tubaina
Flywheel is a multi-purpose script engine. Define once, publish anywhere. Only seven template commands to learn. Full Java classpath access to functions through the Execute command. Iterate through lists with the List command.
The Python scripts for the conversion from the Chinese Pinyin transcription(ISO 7098) to International Phonetic Alphabet(IPA), comprised of a core module for developers and a flexible GUI application for the common end-users on Modern Chinese phonetics.
Zestaw dwóch narzędzi w C do konwersji oraz rozpoznawania standardów kodowania polskich znaków w plikach tekstowych. Obsługują ponad 25 różnych kodowań stworzonych w Polsce na przestrzeni lat. Wersja pod systemy uniksowe oraz DOS/Windows.
Send unique lines to standard out. Remove duplicate lines from anywhere within a text file. Usage example : uniquelines command_history > commmand_history.unique
xhead is a commandline utility for viewing part of a line delimited ascii text file written in C. It is similiar to the linux shell command "head", but it also allows you to skip a certain number of lines in the file before printing.
Like Unix-Tail BUT:
- Runs with or without GUI
- Suspend and resume tailing at runtime
- Can monitor a set of Files
- Print output to a textfield, stdout or file
- Runs in "Grep" mode, too (Read files once)
- (Almost) the same options as Unix-Tail
Enjoy you can now include a beautiful tables in ascii art in your text document, notepad, help files, source code. A TCL script can do a magic ascii Art Tables. The script compatible in windows and linux.
ZML, the Zeitung Markup Language, is a simple CMS for small newspapers. It was specifically designed to publish a student newspaper in print and on the Web. It uses LaTeX and XHTML. So far, it is documented in German only.
JLoom is a JSP like template language for text generation - e.g. source code, HTML, XML. JLoom templates are modular encapsulated. Parameters can be any Java type, even Generics or Varargs. There is a plugin for Eclipse and a commandline tool.