With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Try free now
Cloud tools for web scraping and data extraction
Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
This is a CSV (Camma Separated Values) to ARFF file format converter script written in python. Just 'Save As' your csv file as 'test.csv' and this script will convert it to 'test.arff' !
The purpose of this projects is to define, design and code a models-sharing system for the RepRap replication machines, so that end-users and designers can easily find, create, modify or build objects models.
nexml is an xml format that represents phylogenetic data (trees, character sequence alignments, sets of OTUs, substitution models). The format is inspired by the commonly used flat-file NEXUS format, but more robust and easier to process.
DOP Software’s mission is to streamline waste and recycling business’ processes by providing them with dynamic, comprehensive software and services that increase productivity and quality of performance.
ESML is an interchange technology that utilizes external metadata to allow applications to plug and play seamlessly with Earth Science specific datasets in heterogeneous formats.
An Inkscape extension for editable (digital) timing diagrams in publication quality, based on a very simple state-oriented language for signal description.
A python package to find repetitive format pattern in HTML pages and extract information from them using this pattern. The idea is that in pages that have some kind of a list, there will be a repetitive pattern for the human eye (the page format).
Redland is a set of object-based, modular and portable C RDF libraries providing RDF APIs for the graph, triple storage (librdf), RDF/XML parsing and serializing (Raptor), SPARQL RDF querying (Rasqal). Language APIs in Perl, PHP, Python, Ruby and others.
Teradata VantageCloud Enterprise is a data analytics platform for performing advanced analytics on AWS, Azure, and Google Cloud.
Power faster innovation with Teradata VantageCloud
VantageCloud is the complete cloud analytics and data platform, delivering harmonized data and Trusted AI for all. Built for performance, flexibility, and openness, VantageCloud enables organizations to unify diverse data sources, run complex analytics, and deploy AI models—all within a single, scalable platform.
html2wordml is a python application for converting HTML pages to a WordML Microsoft Word XML document. The application can be used to create a new WordML document or to merge content into an existing template.
1. Create an object-oriented python script that can represent mathematical concepts and their properties. 2. Represent all numeric values exactly. 3. Provide a variety of formats to export or embed representations of the mathematical concepts.
A powerful python module that lets you output HTML code from within a python script in a very efficient and convenient fashion. Code your web-page like a GUI! Create tags and modify their attributes at anytime during your script. http://pyh/googlecod
This python script takes an exported wordpress xml file and outputs a single html document containing all posts in order of entry, and a table of contents broken down by Category. CSS tags added for easy formatting.
CSVChecker is a python utility designed to check CSV file for field length and format. Configuration is made on a field basis through YAML configuration file.
Parse, analyze and -- most importantly -- use COBOL data definitions. This gives you access to COBOL data from Python programs. Write data analyzers, one-time data conversion utilities and Python programs that are part of COBOL systems. Really.
Notice: Citation Style Language (CSL) development has moved to https://github.com/citation-style-language/ (only the xbiblio-devel mailing list is still in use). For up to date information on CSL, visit the project home at CitationStyles.org.
This Python 3.1 tool manipulates the coordinate system of CNC GCODE for machining or engraving. It can Flip X, Y, or Z coordinates, mirror X, Y, or Z coordinates, flip or Mirror both XY coordinates, or insert Z motions into a GCODE file devoid of them.
This is an ETL software which loads data from DBF/XBase files into MySQL. This utility has command line interface, designed to work without user interaction.
A HTML scraper that uses machine learning frameworks to extract labelled fields from raw HTML. The project also involves the development of a tool to display the semi structured data generated by the scraper component.
SDict Viewer is a viewer for dictionaries in open format developed by AXMASoft (free dictionaries are available for download at http://sdict.com). Primary goal of the project is to provide usable dictionary app for Nokia Internet Tablets running Maemo.
SuperWillow is a Music Generation program. Artists have many influences which they have accumulated over the years by listening to countless pieces of music, this principle is reflected in SuperWillow.