Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Free and Open Source HR Software
OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.
Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
The WAW tools provide a more automated approach to web harvesting, based on archival principles, automated process and human decision-making. The model seeks to use archival principles to preserve documents on the web.
Skwish is a fast, simple, lightweight Java library for storing blobs on the file system. It allows multiple concurrent readers and writers, provides all-or-nothing write semantics, and is designed to survive abnormal, unclean shutdown.
Run applications fast and securely in a fully managed environment
Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.
Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
This project consists of a proxy application that supports SMTP and POP3 and archives those emails in rfc 822 format. It does not include any archive but uses existing archives through a plugin infrastructure.
Java Password Tracker. A cross-platform application to safely store away your passwords. The file format is compatible with the original password safe application.
The Weblog Downloader allows users to download and archive their weblog entries in their local drives. Includes login functions to download "private" posts, option to download images, and Java GUI. Currently implemented for Xanga weblog entries.
Optimize every aspect of hiring with Greenhouse Recruiting
Hire for what’s next.
What’s next for many of us is changing. Your company’s ability to hire great talent is as important as ever – so you’ll be ready for whatever’s ahead. Whether you need to scale your team quickly or improve your hiring process, Greenhouse gives you the right technology, know-how and support to take on what’s next.
Backup Easy is a software for automatic backup of folders and files at chosen day and time. A graphical interface provides clear, easy control. Remote backup to FTP, FTP over SSL or SFTP server is supported.
AFRS utilizes the Linux inotify kernel tool to monitor your filesystem for file changes, records or displays those changes and if desired, replicates those changes in near real time to other systems running AFRS using rsync. AFRS can also be used as a li
The GIS Metadata Manager is a simple Java-based desktop application to manage files from geographic information systems (GIS) in a specified directory by grouping files into records, adding metadata and searching the metadata to find files.
*** NO LONGER MAINTAINED ***
A user-friendly, cross-platform, dCache compatible GridFTP client with an intuitive graphical interface, packaged as an Eclipse plug-in and as a standalone application.
Create versioned backups. Combines the best features of a mirror and incremental backups. The last backup is always a mirror, previous versions are available as incremental backups. Uses no special file format. Mainly an AntTask that is easy to integrate.
Open Access is a software for the release, exchange and sharing of contents. The software is conformant to the MPEG standard Open Access application format. It allows to package contents of any type into a single file and to attach additional metadata.
Contineo is a Web-based Document Management System (DMS). Features: Folder organization, document Versioning, Bulk import, import from mailbox. NOTE: this project has been DISMISSED in favor of LogicalDOC http://sourceforge.net/projects/logicaldoc
Photo cop is a small set of command line utilities to help you manage your photos and movies.
It provides :
- multi folder duplicate detection
- update copy to a storage folder
- cdrom data original retrieval
- multi set of folder comparison
FileCopier is a platform-independent utility for copying multiple files and folders to multiple locations with a single click. If an existing file had to be renamed, it provides a mechanism for undoing the copy if so desired.
Encrypti is a multi platform, easy to use encrypting tool for keeping your files secure. This application offers portability of encrypted files between various platforms. Secure your files with Encrypti!
Storesim simulates creation of large amount of random files. Directory structure created is similar to what is done in very large file repositories: based on hashdirs (found in mailservers, NFS servers, files servers, image databases...)
CDnavigator is an application that files your CD in a database. It also deposits information about photos (JPG), music (MP3) and films saved in media. You can also store another metadata as notes, rating, etc. to this data. You can search over all items.
Personalbackup a company-wide solution for backing up all your Windows machines and Samba shares. Personalbackup uses a web frontend for the users and administrators. No client software is needed at all to pull backups of your critical data.