Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Total Network Visibility for Network Engineers and IT Managers
Network monitoring and troubleshooting is hard. TotalView makes it easy.
This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
Contextor is a light-weight simple-to-use Java based library to help developers and researchers working with the general concept of a resource; as examples, resources can be text resources, web resources, images and videos.
PHP Business Rules is an opensource application layer for PHP. The logic is defined in rules and facts. Like most Business Rule Engines (BRE) it uses the Rete algorithm.
The Tensor Voting Framework is a powerful technique for perceptual grouping, manifold learning, etc. It has proved to be a useful tool in the Computer Vision community. OpenTVF is an opensource implementation of TVF.
Run applications fast and securely in a fully managed environment
Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.
Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
Dicodess is a software framework for developing distributed cooperative decision support systems. It helps build DSS for mission-critical decision-making situations happening in dynamic, rapidly changing and often unpredictable distributed environments.
Example-based Modeling (EMO) is an tool to create data models, with examples, using a web interface. You interactively create a web-accessible database of models and samples for those models. A white paper describes the underlying assumptions.
Open data mining platform. Provides common architecture for algorithms of various types. Efficient processing of arbitrarily large volumes of data thanks to data streaming. Weka and Rseslib partially integrated. (www.debellor.org)
Sell and manage your activities and accommodation with an all-in-one reservation system
Booking software for adventure and wellness travel
Bookinglayer is an all-in-one reservation system built for complex booking scenarios involving accommodation and activities. They help retreats, resorts, and schools automate their booking process from start to finish. Bookinglayer’s solution allows you to start selling activities with your accommodation (80% of their clients' bookings include an activity). The software is highly customisable, allowing you to fully adapt your booking flow to your guest's decision-making process. Use their Customer Portal to ask for guest data post-booking, ensuring the booking process is as simple and quick as possible. Use the customer portal to sign waivers, schedule sessions, and collect outstanding balances, all of which are fully automated. Whether you have 5 cabanas or are a large multinational chain, Bookinglayer revolutionizes booking management, making it easier to collect and change bookings, sell and schedule activities, assign instructors, handle payments, and communicate with guests.
A graph persistence framework and associated toolkits based on a navigational query style. A default persistence engine is included for storing graph objects and properties into simple relational tables but the actual storage model is pluggable.
PerfParse. Storage and analysis of binary performance data produced by Nagios. High quality accurate graphs of live data from standard Nagios plugins. Permanent history of plugin results with advanced analysis tools.
MLboost: Machine Learning boost library in Python. MLboost main goal is to speedup any Machine Learning projects by simplifying data preprocessing, features selection and data visualisation. Design by Machine Learning practitioners to let them do ML...;)
Cyberinfrastructure Shell (CIShell) is an opensource, community-driven framework/application for the integration and utilization of datasets, algorithms, tools, and computing resources. Algorithms can be integrated using most programming languages.
Ex-Crawler is divided into 3 subprojects (Crawler Daemon, distributed gui Client, (web) search engine) which together provide a flexible and powerful search engine supporting distributed computing. More informations: http://ex-crawler.sourceforge.net
Data mines the voting record and other actions of Members of the UK Parliament. Extracts information from the parliament website and stores it in a database. Provides tools to analyse the information, producing statistics and tables about the MPs.
ngram is a module to compute the similarity between two strings. It is different to python's "difflib.SequenceMatcher" in that it cares more about the size of both strings. ngram is an port and extension of the perl module called "String::Trigram
WARNING: rgdal is only available for download from CRAN - this repository no longer used for development. NEVER use the very out of date file bundles for download from this site!!! See http://cran.r-project.org/web/packages/rgdal/index.html.
PyramidWorkshop is an image-compression System based upon the scientific work of John Robinson (inventor of BTPC) about image-compression with adaptive linear predictors and is in average more efficient than PNG/JPEG2000.
SvnCrawler is a tool for analysis of Subversion (SVN) repositories. The project includes different plug-ins for generating csv reports, rss feeds and more. Creating a new plug-in takes only a few bash lines. More info: http://svncrawler.sourceforge.net
With the "xix" library, GATE functionality is available in XQuery (via an MXQuery extension). OpenCalais invocation is supported, too. -- Source code at http://sgv-jenkins-01.ethz.ch/job/xixlib/ws/-- "Show project details" for instruction
Transana is a cross-platform tool for the transcription and qualitative analysis of audio and video data. It also provides the ability to identify and organize analytically interesting portions of videos, as well as attach keywords to those video clips.
ViSBARD (Visual System for Browsing, Analysis, and Retrieval of Data) is an interactive visualization and analysis tool for space physics data. It provides an integrated 3-D/2-D environment to analyze measurements across many spacecraft and MHD models.