Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
This bash script adds several csv files into an existing .ods spreadsheet.
Each csv file creates a sheet.
If there is no input spreadsheet provided, it creates one and inserts all csv files in it.
This is a bash script for Linux for a Cukoo. It plays Cukoo sounds every hour and a Pendulum clock sound every half hour.
Sounds sample are free from http://www.wavlist.com/soundfx/020/ and included.
Next goals are to create a GUI/applet for Cinnamon.
Converts DVB-T MHEG-5 Data to XMLTV for EPG listings.
This project uses a few simple scripts to convert data grabbed from RedButton download (rb-download) into XMLTV format that MythTV uses for program guide listings.
Code repo moved to github.
https://github.com/solorvox/mheg2xmltv
clean-html.sh - bash script to clean HTML.
1) Convert the stored pages in UTF-8.
2) defecating saved pages of extra spaces, tabs,
blank lines, scripts, images, meta-information.
PS: when an <pre> produces limited filtering!
! Not all characters can be transcoding UTF-8. Be careful.
a simple bash script to harvest aleph x server. since some libraries dont have oai-pmh servers installed or configured, it turned out to be an option to harvest aleph x server to get libraries catalog data.