Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Cloud data warehouse to power your data-driven innovation
BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.
BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
Java exception extractor. This utility will parse all files (either plain text or bzipped) and tries to search for various exceptions. It then tries to match exceptions against grouping rules (regexps). It is also able to group unrecognised exceptions.
Entirely source-based system. The goal is to provide extremely flexible but easy-to-compile Linux-like OS ready for SSI clusters. Installation procedure is network-independent and automated.
PyLine is a shell that is designed for users who want a easier alternative to cmd, but it is in its earlier stages, so it may not have as many commands. Works on Linux and Windows, and may work on Mac.
Python script that plots access points from Inssider GPX wardrive capture to an html file displaying a Google map. Useful if you care about WPA and WPA2 data instead of just plain old WEP...
Say goodbye to broken revenue funnels and poor customer experiences
Connect and coordinate your data, signals, tools, and people at every step of the customer journey.
LeanData is a Demand Management solution that supports all go-to-market strategies such as account-based sales development, geo-based territories, and more. LeanData features a visual, intuitive workflow native to Salesforce that enables users to view their entire lead flow in one interface. LeanData allows users to access the drag-and-drop feature to route their leads. LeanData also features an algorithms match that uses multiple fields in Salesforce.
PyVSS is a python module which allows you to copy files using Volume Shadow Copy Service (http://en.wikipedia.org/wiki/Shadow_Copy). It is in an early development phase, yet is stable, simple and straight-forward to use.
Software for manage portable profiles: you can use GNU/Linux loading the profiles of Evolution, Firefox, Pidgin (and others) from a flashdrive. Including a way for backup.
Blaze is an application launcher that distinguishes from amongst the others by being able to automate recurrent tasks performed in the file-system or even any application on Microsoft Windows.
Here lives few applications which can ease the burden of App-V/SoftGrid sequencing and maintaining the infrastructure. Dual License. Details at http://sourceforge.net/apps/mediawiki/softgridhelper/index.php?title=Main_Page
FWInit is a simple firewall initialization utility coded in Python with Python configuration scripts with access to useful functions for manipulating the iptables filters and keeping the same state between invocations.
Secure and customizable compute service that lets you create and run virtual machines.
Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.
Mobile Manager is a D-Bus service to manage mobile data devices in Linux-based platforms. Mobile Manager provides an abstraction layer for interoperability with the hardware, making it easy to add support for new devices.
Ascetic is a Python and Matplotlib based Solaris monitoring and charting tool. The tool may also be run in an offline mode on any major operating system.
A transparent network messaging system for offloading computations to other possibly more capable machines when they are available. The intended use of the system is for when the results of a computation aren't needed immediately.
Set of flexible tools for Unix sysadmins that allow them to store and retrieve useful details about their hosts. These tools can easily be embedded in automation, maintain documentation and provide reports and acts as an ITIL CMDB.
A simple command line regex search [and replace] written in Python. Searches individual files or whole directories, with the option to search recursively into subdirectories.