Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.
Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors.
Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
Search engine and data mining applications and ClueWeb datasets.
The Lemur Project develops search engines, browser toolbars, textanalysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
Linha do Texto is a web based game for classificate semiotic text content from user input data with 1 to 4 parameters predifined in each section. It can be used to discuss discrete and continuous semantic categories in a scientific and educational fied.
DSTK - DataScience ToolKit is an opensource free software for statistical analysis, data visualization, textanalysis, and predictive analytics. Newer version and smaller file size can be found at: https://sourceforge.net/projects/dstk3/
It is designed to be straight forward and easy to use, and familar to SPSS user. While JASP offers more statistical features, DSTK tends to be a broad solution workbench, including textanalysis and predictive analytics features. ...
Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
SCAN (Smart Content Aggregation and Navigation) is a universal semantic content aggregator. It combines search, textanalysis, tagging and metadata functions to provide new user experience of desktop navigation and document management.
Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
Like Unix-Tail BUT:
- Runs with or without GUI
- Suspend and resume tailing at runtime
- Can monitor a set of Files
- Print output to a textfield, stdout or file
- Runs in "Grep" mode, too (Read files once)
- (Almost) the same options as Unix-Tail
hypKNOWsys aims at developing a Java-based workbench for knowledge discovery and knowledge management. Currently, hypKNOWsys has released two intermediate tools: DIAsDEM Workbench (text mining for semantic tagging) and WUMprep (Web mining pre-processing)
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.
SENTENSA Knowledge Miner is a platform independent tool for searching any text. SENTENSA uses robust methods of indexing and searching text, leveraging on experience from more than 20 years of information retrieval.
(Almost) all a scholar in the Humanities needs (polytonic Greek fonts, stylistic and metrical analysis tools, search engines on TLG and PHI) concentrated in only one Linux Live CD, ready to use everywhere at home or at University, without installation
The application will be able to provide further information about the location of a host by analyzing the senders IP address. It works like other localizer software and provides different types of visualisation (map, text).
The "Universal Content Evaluation and Categorisation Software" is a program for analysing a websites, or more generally, a texts content. The text is arranged in dozens of categories, permitting more efficient web searches and information processing.
ReturnMyPicture is an HTML content generation and processing program written in Java. The implementation is a stock market forum providing users with HTML form input elements to annotate technical analysis charts and to post these with a text message.
This project is an web-based text log file processor written in Java for the analysis of log files produced by the MMORPG Vanguard: Saga of Heroes, but will expand to include other uses including other games and security log analysis.
A tool to extract the information in any log file into a database. It has no inherent knowledge of particular log formats but by using inbuilt rules and asking a few simple questions should be able to parse any type of text log intelligently.
What is a domain-specific language intended for constructing web
services on top of ordinary web pages, or otherwise automate
web-related tasks.
Using a powerful pattern matching sublanguage, What strives to be for XML/HTML what Perl is for textanalysis