Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight
Lock Down Any Resource, Anywhere, Anytime
CLEAR by Quantum Knight is a FIPS-140-3 validated encryption SDK engineered for enterprises requiring top-tier security. Offering robust post-quantum cryptography, CLEAR secures files, streaming media, databases, and networks with ease across over 30 modern platforms. Its compact design, smaller than a single smartphone image, ensures maximum efficiency and low energy consumption.
A small Python script that allows administrators to place quotas on *nix accounts without much technical knowledge or root access. It is ideal for those who share and/or resell web hosting or other resources.
Yet Another Rsync Backup Utility (YARBU). A robust but powerful snapshot-like rolling backup utility with email notification and straightforward configuration.
Sith'Ari is an disk backup solution, based on hard links.
Daily snapshots are built on the previous, then updated.
Only modified files are stored.
Sith'Ari Backup stands for : Synched Incremental Time Historical Archiver Rapid Imager Backup
Dun and Bradstreet Risk Analytics - Supplier Intelligence
Use an AI-powered solution for supply and compliance teams who want to mitigate costly supplier risks intelligently.
Risk, procurement, and compliance teams across the globe are under pressure to deal with geopolitical and business risks. Third-party risk exposure is impacted by rapidly scaling complexity in domestic and cross-border businesses, along with complicated and diverse regulations. It is extremely important for companies to proactively manage their third-party relationships. An AI-powered solution to mitigate and monitor counterparty risks on a continuous basis, this cutting-edge platform is powered by D&B’s Data Cloud with 520M+ Global Business Records and 2B+ yearly updates for third-party risk insights. With high-risk procurement alerts and multibillion match points, D&B Risk Analytics leverages best-in-class risk data to help drive informed decisions. Perform quick and comprehensive screening, using intelligent workflows. Receive ongoing alerts of key business indicators and disruptions.
Music-MP21 provides an authoring tool to pack an entire music album into a single .m21 file (MPEG-21 file) and a plugin for VLC Media Player to play these files. The .m21 files are compatible with MP4 files.
MonoCommander - monocmd. Target of this project is to develop an opensource "commander"-style file manager written in c#, based on mono, gtk-sharp, glib-sharp. While it should work cross-platform, the primary taget is linux/gnome desktop.
StFA provides the ability that allows you have atomic write, copy, delete, move operations on file system. Just like operate a simple relationship database.
Web Integrator is a Multiplatform web based Integration/replication solution with support for multiple data sources. Web Integrator allows connecting of diverse data sources/consumers located on different platforms and remote locations.
Secure and customizable compute service that lets you create and run virtual machines.
Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.
Reed-Solomon encoder/decoder for Block Devices. Provides methods for enhanced data recovery from corrupted storage media, along with faster methods for reading the data when no corruption is encountered.
lmftp is a perl program to schedule file transfers via ftp on a onetime or regular basis. lmftp supports compression, local backups, email notifications and automatic retries of failed jobs. It's intended to be used for file distribution or backup jobs
noflushd is a Linux daemon that monitors disk activity and spins down idle disks. It then blocks further writes to the disk to prevent it from spinning up again. Writes are cached and flushed to disk when the next read request triggers a spin-up.
This is a set of programs to manipulate Commodore 64 Datasette tape files. These tools can convert a WAV file to a TAP file and vice versa, or extract PRG/SEQ files from a TAP file. Additional tools are planned to convert PRG/SEQ files to a TAP file.
Doppioni searches for duplicates in your disks. I wrote Doppioni as exercise, after one year without coding in Java. At the moment is just an alpha version. Help would be very apreciated.... want to join this project?
An easy to use application for backing up files to multiple locations, including Amazon's S3. Runs on Windows, OS X and Linux. Bonkey is based on the Eclipse RCP and equires Java. Downloads at http://thebackupmonkey.blogspot.com
A Java Programm to sort JPEG-Images from digital still image cameras. The sorting is done by the capture date/time of the EXIF-Header. Image meta informations from EXIF-Header is stored in a database. Double files will be recognized by MD5-hash compare.
If you don't know how command line works or too lazy to open command prompt and want to create a hard link, use this tool. This only work on Windows 2000 and later with NTFS file system
This application reads the output of Web forms posted on your website (usually via email) and converts them to csv files for importing into a database, or managing in Excel
Defuddle is a data translation engine that supports mapping arbitrary ASCII and binary file formats to a data model defined in XML Schema in a manner similar to, but not compliant with the Data Format Description Language (http://www.ogf.org/dfdl/).
aRSync is "Another RSync" and provides a pure java implementation of the rsync algorithm. This implementation is intended to be ready to use for file transfer, to be easy to extend for further use cases and - most important - to become productive.
RBME is a simple tool to assist in implementing a general backup solution based on rsync and hardlinks with an automated management of the disk space on the backup media. rbme is enterprise-ready and send email reports about everything it does.