Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Cloud data warehouse to power your data-driven innovation
BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.
BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
Sylli is a universal syllabifier. Developed for Italian, it can easily be adapted to any language that is claimed to respect the SSP. Sylli divides timit, strings, files and directories into syllables.
WordNetLMF converts WordNet (http://wordnet.princeton.edu/) lexicographer files into KYOTO-LMF, the LMF dialect used in the KYOTO project (http://www.kyoto-project.eu/).
Proactively monitor, manage, and support client networks with ConnectWise Automate
Out-of-the-box scripts. Around-the-clock monitoring. Unmatched automation capabilities. Start doing more with less and exceed service delivery expectations.
Vtgrep stands for Visual Tree Grep and is a GUI to tgrep. It allows the user to build graphical representations of tree structures and then translates them into the tgrep syntax. provides search functionality, as well as search and result logging.
CIDIAN is a very simple offline Chinese-English dictionary written in Gambas2. Lookup any character or an entire text.Almost 100000 entries. Based on the CC-CEDICT project.
Coptic - English and Coptic - Czech dictionary related to Crum's coptic dictionary, written in C++, based on MySql, with Qt GUI. Is developed as part of project Marcion, containing only coptic data without study environment.
OO Pinyin Guide is a Java extension for OpenOffice 3 or higher. It enables the user to add pinyin transliteration over Chinese characters inside a text document. This tool can be useful for people learning or teaching Chinese.
Award-Winning Medical Office Software Designed for Your Specialty
Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.
RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.
The Open Translation Engine (OTE) is a web-based translation dictionary manager. The OTE allows a community of users to create and manage one or many translation dictionaries. The OTE is written in PHP and uses a MySQL database.
This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.
A lyrical analysis and classification tool focused specifically on rhyming style in rap lyrics. Functions include phonetic transcription, rhyme visualization, and rapper classification.
Connecting Historical Authorities with Links, Contexts and Entities. CHALICE is a historic placename gazetteer for the UK, published as Linked Data and linked to other widely-used sources of placename reference information on the semantic web.
stocleka is a project divided into a UI and a library for cleaning user stories and converting them to arff files (used for Weka). it may be mainly used for research and scientific purposes.
Varamozhi is a free English-Malayalam transliteration library. It can transliterate Malayalam text between Malayalam and English scripts. Varamozhi takes as the input, the mapping between a Malayalam font and a transliteration scheme; outputs functions i
Wikipedia Concept Association Map (WCAM) is new approach for textual knowledge representation and understanding. All concepts and associations are stored in a graph database for better performance and easy distribution.
Shift Word Helper is a scrabble-like games helper, which includes an online dictionary client. Shift Word Helper est un bon assistant pour les jeux de mots tels que le scrabble, avec un dictionnaire en-ligne integré. English & french supported
It's a utility application for updating and integrating translation memories, created by the Autshumato ITE, over a network. Licensed under the TMate OpenSource License and free to download and be used by anyone.
Affisix is a program for automatic recognition of prefixes. It takes large amount of words and according to the user setting it tries to determine which segments of these words are prefixes.
The aim of this project is to develop the basics for the usage of Maya Hieroglyphs on computers. There is quite a lot of stuff to do: Developing encodings for Unicode, fonts and input methods.