With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
Try free now
Deliver secure remote access with OpenVPN.
Trusted by nearly 20,000 customers worldwide, and all major cloud providers.
OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
Latent Semantic Analysis extension for PHP. This php extension is ment to perform small scale LSA expermements like providing related content, document mapping or tuning LSA parameters in a loop.
RDF-DocMan is a document manager based on a Sesame (RDF repository) backend. Documents are stored in the filesystem and their metadata in a Sesame repository.
It was developed for porQual web content generator (also in sf.net).
The Vodoo/Stream project let users to define transducers dedicated to documentanalysis. Such transducers describe how fragments are matched and transformed. Finally a document can be an XML fragment, a free text or something else depending on extensions
T-Rex (Trainable Relation Extraction) is a highly configurable machine learning-based Information Extraction from Text framework, which includes tools for document classification, entity extraction and relation extraction.
Bright Data - All in One Platform for Proxies and Web Scraping
Say goodbye to blocks, restrictions, and CAPTCHAs
Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
Irudiko is a library written in C++ for generating Locality Sensitive Hashing sketches from any textual and web document. Mainly designed to work with HTML pages, it has also an optimization support for English or Italian documents.
Graphist uses PHP's GD library to produce data plots, in real time, served up as standard images for consumption by web pages (though such images could be saved for use in other document types).
Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.
Qualiweb aims at providing semantic web metrics for modeling a website visitors needs according to a given taxonomy or document classification. Web metrics provided by Qualiweb give an indication of how successful each of the website topics have been.
vyasa is a digital library application that incorporates the functions of digital asset and document management systems. It facilitates information retrieval and knowledge discovery by providing comprehensive metadata generation and semantic analysis.
Phoenix is an information extraction engine written in java.
Controlled by rules (declared in xml), it extracts information form any XML document (unstructured XHTML/OpenOffice documents). Supports XPath, additional conditions and top-down decomposit
teiPublisher is a xml repository management system to create TEI document repository. The software components are
XML analysis tools for Ontolology Development
Create/Delete/Edit backup tools
Search page customizations and result display
XSL styleshe
Loganalyzer for Windows XP Firewall and Linux Iptables firewall. Generates a nice html document with statistics from all the pakets captured by the firewall. The program is written in Python and has an (optional) graphical interface.
SimpleRDF/XSL template simplifies RDF/XML sources as much as possible to allow easy processing. SimpleRDF/PHP5 parser takes advantage of SimpleRDF/XSL. It has extremly simple API. You can parse any RDF/XML compatible document (incl. RSS) and much more...
JUDGE (Java Utility for Document Genre Eduction) features automatic classification and clustering of documents, optionally as a webservice.
The program is written entirely in Java and makes use of the Weka machine learning toolkit.
ARDAT (Automated Requirements DocumentAnalysis Tool) is a Java based stand alone application. It will perform analysis of requirements documents to aid in the approval/rejection process of HP style project request specifications.
Mojo Webstats is a simple web statistics tool. It uses Javascript, PHP and MySQL to process web statistics. Includes: 1) date/time, 2) document location and title, 3) ip address and hostname, 4) browser version, 5) visitor resolution, and 6) referrer.
Unstructured text is no match for Litersta - see further details here: https://litersta.com
Working with text now becomes effortless when paired with Litersta textual analytics software.
Unlike database fields, which are easily queried, text contains unstructured data that must be parsed for key objects that can be transformed in to powerful metrics.
Litersta - textual analytics - software leverages statistical algorithms to programmatically locate, and extract, overall document...
A set of classes for Natural Language Processing in PHP for:
1. Part of speech Tagging - Brill, n-gram, HMM
2. Princeton Wordnet querying and access
3. Document summarization
4. Document classification - EM, Bayes
5. Stemming - Porter, Lancaster
Comparer is a document comparer program. The initial task of the comparer is to find the likeness of all documents given to it by generating a graph of all usable connections between documents.
This browser-based tool is a flexible solution for documenting both logical and physical database schema designs. It supports simple version tracking concepts to document schema changes in varying stages of planning and implementation.