28 projects for "text analysis" with 2 filters applied:

  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Browserless

    Browserless

    The headless Chrome/Chromium driver on top of Puppeteer

    Browserless is an open-source headless browser automation library and service built on top of Puppeteer that simplifies the process of running and scaling Chromium-based browser tasks in production environments. It provides a high-level API for interacting with headless Chrome, allowing developers to perform operations such as generating PDFs, capturing screenshots, extracting text or HTML, and automating web navigation. The project is designed to act as a production-ready abstraction layer...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Weibo Crawler

    Weibo Crawler

    Python crawler for collecting and downloading Sina Weibo user data

    weibo-crawler is a Python-based data collection tool designed to retrieve information from Sina Weibo user accounts. It automates the process of gathering posts, user profile details, and engagement metrics from one or more target accounts. weibo-crawler can extract comprehensive information about users, including profile attributes such as nickname, follower count, following count, and account metadata. It also captures detailed data about each post, including the content, publishing time,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3

    dhcpd-pools

    ISC dhcpd leases usage analysis

    This is dhcpd-pools ISC dhcp shared network and pool range usage analysis. Purpose of command is to count usage ratio of each IP range and shared network pool which ISC dhcpd is in control of. Users of the command are most likely ISPs and other organizations that have large IP space. Program is written C. Design goal is to get analysis done quickly where there is lots of data. On cheap laptop the speed of analysis is roughly 100k leases per second. Number of ranges, or shared networks,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    The Lemur Project

    The Lemur Project

    Search engine and data mining applications and ClueWeb datasets.

    The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
    Downloads: 62 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    TreeGraph

    TreeGraph

    Information Manager(split/analyze/compare/combine).

    For Homepage, Blog, Family Tree, Database, C#|hjt|js|chm Editor. Convert hjt2xml, (c#)cs2xml, chm2xml, js2xml, xml2cs, xml2js, xml2hjt, cs2hjt, hjt2cs, cs2chm, hjt2chm. IE/Opera/Firefox/PocketPC supported.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    mzitu

    mzitu

    Python crawler that downloads image galleries and analyzes titles

    ...It focuses on automating the collection of large sets of images by programmatically parsing page content and iterating through gallery entries. mzitu also includes a simple analysis script that processes downloaded folder names to generate statistics and visualizations. Using text segmentation and frequency analysis, the project can create a word cloud representing common keywords found in the dataset. This makes the repository both a scraping example and a small data analysis experiment built around the collected content. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Hunspell is a spell checker and morphological analyzer library and program designed for languages with rich morphology and complex compounding or character encoding. Hunspell interfaces: Curses, Ispell compatible pipe interface, OpenOffice.org UNO module
    Leader badge
    Downloads: 308 This Week
    Last Update:
    See Project
  • 9
    SCAN
    SCAN (Smart Content Aggregation and Navigation) is a universal semantic content aggregator. It combines search, text analysis, tagging and metadata functions to provide new user experience of desktop navigation and document management.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Honeyd2MySQL

    Honeyd2MySQL

    Utility to move honeyd.log entries to a MySQL database.

    This is yet another simple piece of software that extracts all the basic stats from honeyd’s text-based log files and inserts them in a MySQL database. Then you can run some queries and of course visualize the data if you want to. Many things are hardcoded or dead simple, but it does the job. The file is a modified version of “honeyd_importer” perl script originally writen by Joshua Gimer and shared through “honeypots” mailing list.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Like Unix-Tail BUT: - Runs with or without GUI - Suspend and resume tailing at runtime - Can monitor a set of Files - Print output to a textfield, stdout or file - Runs in "Grep" mode, too (Read files once) - (Almost) the same options as Unix-Tail
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    hypKNOWsys aims at developing a Java-based workbench for knowledge discovery and knowledge management. Currently, hypKNOWsys has released two intermediate tools: DIAsDEM Workbench (text mining for semantic tagging) and WUMprep (Web mining pre-processing)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    SENTENSA Knowledge Miner is a platform independent tool for searching any text. SENTENSA uses robust methods of indexing and searching text, leveraging on experience from more than 20 years of information retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    (Almost) all a scholar in the Humanities needs (polytonic Greek fonts, stylistic and metrical analysis tools, search engines on TLG and PHI) concentrated in only one Linux Live CD, ready to use everywhere at home or at University, without installation
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    csvplus is a collection of Perl-based utilities for querying csv files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    A multiplatform visual implementation of the Unix utility grep
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The application will be able to provide further information about the location of a host by analyzing the senders IP address. It works like other localizer software and provides different types of visualisation (map, text).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The "Universal Content Evaluation and Categorisation Software" is a program for analysing a website’s, or more generally, a text’s content. The text is arranged in dozens of categories, permitting more efficient web searches and information processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    GoldSeeker is a small formatted data extraction application. It can parse informations from a text, html or other file, and export it in a database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Integradata is a plugable, rules-based, declarative data validation system written in Java
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A statistics generator (html, text, charts) for distributed.net personal proxy servers log files written in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    ReturnMyPicture is an HTML content generation and processing program written in Java. The implementation is a stock market forum providing users with HTML form input elements to annotate technical analysis charts and to post these with a text message.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Gokstad will be a basic crawler and text analysis engine. Its current scope is to download news webpages and do simple text analysis on top of it. The name "Gokstad" comes from a sea worthy, clinker-built ship, constructed largely of oak by the vikings
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB