Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Reach Your Audience with Rise Vision, the #1 Cloud Digital Signage Software Solution
Rise Vision is the #1 digital signage company, offering easy-to-use cloud digital signage software compatible with any player across multiple screens. Forget about static displays. Save time and boost sales with 500+ customizable content templates for your screens. If you ever need help, get free training and exceptionally fast support.
Small tools to convert binary data format from spectrometers and oscilloscopes. Now the translators for * Starnford SR430 * LeCROY oscilloscope are included.
Landlords, multi-family homes, manufactured home communities, single family homes, associations, commercial properties and mixed portfolios.
Rent Manager is award-winning property management software built for residential, commercial, and short-term-stay portfolios of any size. The program’s fully customizable features include a double-entry accounting system, maintenance management/scheduling, marketing integration, mobile applications, more than 450 insightful reports, and an API that integrates with the best PropTech providers on the market.
DICOM Tag Slayer lets you view, modify, export and find differences in DICOM-format files. Program is multiplatform, written in Python+PyQT4+PyDICOM gives you both GUI and CLI for easier script integration.
HTTP functional and non-functional (load and performance) toolkit based on jython/grinder (http://grinder.sf.net) ...includes capabilities to support: SOA services, REST, json/xml encoding, AES and WS security ... and a stub to collect requests
BibteXML is a bibliography schema for XML that expresses the content model of BibTeX – the bibliographic system for use with LaTeX. Stylesheets and conversion tools are provided.
This is a CSV (Camma Separated Values) to ARFF file format converter script written in python. Just 'Save As' your csv file as 'test.csv' and this script will convert it to 'test.arff' !
The purpose of this projects is to define, design and code a models-sharing system for the RepRap replication machines, so that end-users and designers can easily find, create, modify or build objects models.
For companies looking to automate their consolidation and financial statement function
The software is cloud based and automates complexities around consolidating and reporting for groups with multiple year ends, currencies and ERP systems with a slice and dice approach to reporting. While retaining the structure, control and validation needed in a financial reporting tool, we’ve managed to keep things flexible.
The London Datastore (http://data.london.gov.uk) was created by the Greater London Authority (GLA) as an innovation towards freeing London’s data. This SourceForge Project will be used to OpenSource our development efforts surrounding data formats
Serialization + persistance : in a few lines of code, compress and annotate Python objects into SQLite; then later retrieve them chronologically by keywords without any SQL. Most useful "standard" module for a database to store schema-less data.
nexml is an xml format that represents phylogenetic data (trees, character sequence alignments, sets of OTUs, substitution models). The format is inspired by the commonly used flat-file NEXUS format, but more robust and easier to process.
ESML is an interchange technology that utilizes external metadata to allow applications to plug and play seamlessly with Earth Science specific datasets in heterogeneous formats.
An Inkscape extension for editable (digital) timing diagrams in publication quality, based on a very simple state-oriented language for signal description.
A python package to find repetitive format pattern in HTML pages and extract information from them using this pattern. The idea is that in pages that have some kind of a list, there will be a repetitive pattern for the human eye (the page format).
Redland is a set of object-based, modular and portable C RDF libraries providing RDF APIs for the graph, triple storage (librdf), RDF/XML parsing and serializing (Raptor), SPARQL RDF querying (Rasqal). Language APIs in Perl, PHP, Python, Ruby and others.
now here: https://github.com/plastex/plastex
plasTeX is a Python-based LaTeX document processing framework. It gives DOM-like access to a LaTeX document, as well as the ability to generate mulitple output formats (e.g. HTML, DocBook, tBook, etc.).
The Virtual USB Analyzer is a graphical tool for browsing traces of captured USB data. It supports logs generated by Ellisys hardware analyzers and VMware's software analyzer, and Linux's usbmon.
C4me aims to provide a convenient way of editing XML files (and, in a distant future, even more modding-related files) for modifications for Sid Meier's Civilization 4. It's now in its infancy and not really usable - join and help changing that!
html2wordml is a python application for converting HTML pages to a WordML Microsoft Word XML document. The application can be used to create a new WordML document or to merge content into an existing template.
1. Create an object-oriented python script that can represent mathematical concepts and their properties. 2. Represent all numeric values exactly. 3. Provide a variety of formats to export or embed representations of the mathematical concepts.