Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
Becape is a open source backup tool aimed to personal/desktop usage. It does incremental backups and stores the backup info in a sqlite database allowing to restore the exact state of the backed files at a chosen date.
The Splitter Backup System is designed to prepare TAR archives to be burned to DVD media. Since the maximum file size in an ISO9660 file system is 2G, Splitter will traverse a given list of file system nodes and split off TAR files of 2G or less.
BOBUP - Backup and recovery for application business objects. Backup usually implies backing up files/db - this is non-intuitive. With BOBUP, backup and recover granular info (eg. specific customer) independent of files/db.
Tapedispatcher is a service which provides coordinated concurrent access to all tape libraries in an organization. Supported are tape drives and libraries attached locally to a host, and those shared over a fibre channel SAN.
Simple class that allows backing up an entire website; automatically works around timeout limits splitting the job in multiple requests, allows update only and many other options.
A small utility written in Java that helps you extract songs from your iPod, by searching inside its hidden folders for the song, artist or album that you want.
Arrowbase is a collection of tools for backup persoses. Together they combine a backup system that can
be used on more then one Operating system. This makes the project not only widely spread but portable as
wel.
MyBackupTcl is a software that allows users (or a system administrators) to create the backup of own files in easy and automatic way. It includes bzip2 compression,GNU PGP,dvd and cd ISO,FTP,e-mail,automatic dumping for mysql,SVN,Trac and much more.
Bitcoop is a Peer to Peer backup system enabling the storage of files on remote computers. The size of files depends on the quantity you wish to share with the other peers. This software is intended for server farms that wish to backup data among themsel
Landlords, multi-family homes, manufactured home communities, single family homes, associations, commercial properties and mixed portfolios.
Rent Manager is award-winning property management software built for residential, commercial, and short-term-stay portfolios of any size. The program’s fully customizable features include a double-entry accounting system, maintenance management/scheduling, marketing integration, mobile applications, more than 450 insightful reports, and an API that integrates with the best PropTech providers on the market.
Repairs corrupt files by combining the good bytes from multiple corrupt copies. If you have (downloaded) many copies of the same file, and the corruptions are in different places, it is possible to fix the file.
SshView is an ssh terminal for the Eclipse IDE. Be advised: The latest release appears to require Eclipse 3.2 (build M20060629-1905) aka the Callisto release. Contact me if you're interested in lending a hand with the project.
A simple perl script for performing full and incremental system backups, with good controll over excluding or including files by type, directory etc. Incremental backups can be emailed of-site.
Character-based display of NetWorker activity: tapes which are used now,current speed on them and the number of sessions on each. You can observe messages and daemon.log files in "tail -f" style.
UberImaging centrally controls the disk imaging process of many nodes on a network. Clients booted via PXE are served a small initrd image containing udpcast, a client module, and other tools allowing clients to be remotely controlled by a portable GUI.
GTFileExplorer is a Filemanager. You can copy,paste,cut,delete and rename files per shortcut,menu,popupmenu and drag&drop.You can also open files with an external program. The main functionality is the synchronization of two directories, with a preview.
Alternate language bindings for the libdar library written by Dennis Corbin. The original application DAR is a command line backup tool that uses libdar, a library implemented in C++.
ArcAngel is a simple backup utility for programmers comfortable with XML and RegExp. It creates Zip format archives of specified files, with flexible file-selection rules, and pre- and post-backup task execution.
Archiving/backup framework written in Python. Uses archiving directives within the file system itself in the form of .archmanrc files; Python modules that specify which files and directories to include.
Robbie is a cross-platform framework for backing up data on computers. It uses secure hashes to keep track of files, and when you alter them it sends the new version to your backup store - either locally or remotely.
dumpnet is a collection of bash scripts that allows you to backup multiple servers simultaneously. You can do incremental file system dumps with tar, dump and rsync, backup databases and much more. AIDE can also easily be integrated.
CAIRN is a modular copy and restore program for the imaging of a computer. It copies every file on a computer and figures out how to recreate it from scratch. It is primarily network oriented but is also flexible enough to boot from any possible method.