Showing 50 open source projects for "content analysis"

View related business solutions
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    SEO GEO Content Engine

    SEO GEO Content Engine

    Professional SEO and GEO content workflows for brands, SaaS teams

    SEO GEO Content Engine is a comprehensive system designed to combine content generation, optimization, and analysis into a single AI-driven pipeline. It builds on tools like GEO Content Writer and SEO GEO Audit to create an end-to-end workflow for producing and refining search-optimized content. The system automates the process of identifying opportunities, generating content, and validating its effectiveness based on modern search criteria. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    SEO Machine

    SEO Machine

    A specialized Claude Code workspace for creating long-form

    SEO Machine is an AI-powered content production system built as a structured workspace for generating long-form, SEO-optimized blog content through automated workflows. It integrates research, writing, analysis, and optimization into a single pipeline, allowing users to produce high-quality articles tailored to search engine performance. The system uses specialized commands and agents to perform tasks such as keyword research, competitor analysis, content drafting, and optimization. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    SEO GEO Audit

    SEO GEO Audit

    Professional SEO and GEO audit workflows for brands, SaaS teams

    SEO GEO Audit is an AI-driven analysis tool designed to evaluate how well websites are optimized for both traditional search engines and emerging generative AI search systems. It focuses on identifying gaps in content, structure, and metadata that affect visibility in geographic and semantic search contexts. The system analyzes websites against modern search patterns, including how content is interpreted by AI-powered search engines and retrieval systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    sperm

    sperm

    Collection of reverse engineering articles curated for learning

    sperm is a curated repository that gathers a collection of notable articles related to reverse engineering and software analysis. It primarily acts as a knowledge archive where previously published technical posts are compiled and organized for easier access and long-term reference. These articles originate from multiple technical communities and platforms and are exported into Markdown format to maintain a consistent and readable structure. sperm focuses on educational material that explores reverse engineering techniques, mobile analysis, and related technical discussions. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Weibo Crawler

    Weibo Crawler

    Python crawler for collecting and downloading Sina Weibo user data

    ...It automates the process of gathering posts, user profile details, and engagement metrics from one or more target accounts. weibo-crawler can extract comprehensive information about users, including profile attributes such as nickname, follower count, following count, and account metadata. It also captures detailed data about each post, including the content, publishing time, topics, mentions, likes, reposts, and comments. In addition to textual data, the project can download original media from posts, such as images, videos, and Live Photo content. Collected data can be exported to structured formats such as CSV or JSON or stored in databases for further analysis and research. It supports incremental crawling so users can periodically collect only newly published posts, making it useful for ongoing monitoring or dataset updates.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    watercrawl

    watercrawl

    AI-ready web crawler that extracts and structures website content

    WaterCrawl is an open source web crawling and data extraction platform designed to transform website content into structured data suitable for machine learning and AI workflows. It enables developers and researchers to crawl web pages, extract meaningful information, and convert it into formats that are easier to process and analyze. It provides a modern crawling system that can automatically navigate links, control crawl depth, and collect content from targeted sections of a website....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    BrowserOS

    BrowserOS

    Agentic browser; privacy-first alternative to ChatGPT Atlas

    BrowserOS is an open-source, agentic web browser built on a Chromium base that integrates AI agents directly into the browsing experience. Rather than just doing standard browsing, it places AI intelligence at the core: you can connect your own API keys (for e.g., OpenAI, Anthropic, Google Gemini) or run local models (via e.g., Ollama) so that your browsing data and automation stay on your machine — privacy and control are emphasized throughout. The interface remains familiar to users of...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 8
    Spider

    Spider

    High-performance Rust web crawler and scraper for large-scale data

    ...It focuses on speed, concurrency, and reliability by using asynchronous and multi-threaded processing to handle large volumes of web pages. It can rapidly crawl websites to collect links, retrieve page content, and extract structured information from HTML documents. Spider can operate concurrently across many pages, allowing it to gather large datasets in a short period of time. Spider also provides mechanisms for subscribing to crawl events so developers can process page data such as URLs, status codes, or HTML content as it is discovered. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Gemma 4 Browser Assistant

    Gemma 4 Browser Assistant

    On-device AI agent Chrome extension powered by Transformers.js

    Gemma 4 Browser Assistant is an open-source browser extension that embeds an AI assistant directly into the browsing experience, powered by on-device machine learning models. It uses Transformers.js and Gemma models to run inference locally in the browser, eliminating the need for external servers and preserving user privacy. The extension includes a side panel interface that allows users to interact with the AI while browsing, enabling tasks such as summarizing pages and answering...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Ansvisor

    Ansvisor

    Open-source Answer Engine Optimization platform

    Ansvisor is an open-source AI visibility and answer engine optimization platform built to help brands understand how they appear in AI-generated answers. It tracks mentions across platforms such as ChatGPT, Google AI Overview, Google AI Mode, Gemini, Perplexity, Copilot, Grok, and Claude. The platform helps teams measure visibility scores, compare competitors, analyze prompt performance, and identify which sources AI systems cite alongside their brand. It also includes AI traffic analytics,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    eslint-plugin-jsx-a11y

    eslint-plugin-jsx-a11y

    Static AST checker for a11y rules on JSX elements

    Static AST checker for accessibility rules on JSX elements. This plugin does a static evaluation of the JSX to spot accessibility issues in React apps. Because it only catches errors in static code, use it in combination with axe-core/react to test the accessibility of the rendered DOM. Consider these tools just as one step of a larger a11y testing process and always test your apps with assistive technology. If you installed ESLint globally (using the -g flag in npm, or the global prefix in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    SiteOne Crawler (desktop app)

    SiteOne Crawler (desktop app)

    A free, feature-rich web analyzer and exporter/cloner you will love!

    A free in-depth website analyzer providing audits of security, performance, SEO, accessibility and other technical aspects. Available as a desktop application for Windows/macOS/Linux and as a CLI tool for advanced users and CI/CD processes. It also includes an offline web page exporter (website clone, mirror).
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Catbird Linux

    Catbird Linux

    Linux for content creation, web scraping, coding, and data analysis.

    Catbird Linux is a USB pluggable Live Linux operating system built for media creation, web scraping, and software coding. It is the daily driver you want for retrieving data, making videos or podcasts, and making software tools to automate the repetitive tasks. It is ready for work in Python, Lua, and Go languages, with numerous packages for web scraping or downloading data via API calls. Using Catbird Linux, it is possible to accomplish in depth stock market analysis, track weather...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 14
    Linha do Texto is a web based game for classificate semiotic text content from user input data with 1 to 4 parameters predifined in each section. It can be used to discuss discrete and continuous semantic categories in a scientific and educational fied.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    lxspider

    lxspider

    Educational Python web scraping case collection for many sites

    ...It focuses heavily on practical cases that illustrate how different platforms handle requests, authentication parameters, and anti-scraping protections. lxSpider includes examples targeting areas such as e-commerce platforms, social media services, content sites, research databases, and information portals. Many of the cases explore techniques related to request analysis, signature generation, and reverse engineering that are often needed when interacting with modern web applications. It also provides supplementary materials and tools used in crawling workflows, such as debugging utilities and reverse-engineering aids.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    RED HAWK

    RED HAWK

    All-in-one reconnaissance and vulnerability scanning toolkit for sites

    RED HAWK is an open source command-line security tool designed for information gathering, vulnerability scanning, and web reconnaissance tasks. It combines multiple scanning and analysis capabilities into a single toolkit to help security researchers and penetration testers quickly analyze a target website. It can collect a wide range of information about domains, servers, and web applications, including network details, hosting configuration, and content management system detection. It also provides vulnerability scanning features that help identify potential issues such as error-based SQL injection vulnerabilities and sensitive file exposure. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Web Security Audit

    Web Security Audit

    Passively audits the security posture on current page for your browser

    ...Assume that the tool is to be used on non-malicious websites, currently not under attack or compromised. Add-on wants to report security misconfigurations, or failure to use best security practices. - Add-on tries to analysis the commonly vulnerable setting of servers: lack of use of security-relevant headers, including: - strict-transport-security - x-xss-protection - content-security-policy - x-frame-options - x-content-type-options It is available for Opera Beta and Developer browser - https://bit.ly/2TvvFw4
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    ECommerceCrawlers

    ECommerceCrawlers

    Collection of Python ecommerce and website crawler examples projects

    ...Developers can study the individual project documentation to understand the analysis process.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    mzitu

    mzitu

    Python crawler that downloads image galleries and analyzes titles

    ...It demonstrates how to build a scraper that navigates gallery pages, retrieves image links, and saves the images locally in a structured directory layout. It focuses on automating the collection of large sets of images by programmatically parsing page content and iterating through gallery entries. mzitu also includes a simple analysis script that processes downloaded folder names to generate statistics and visualizations. Using text segmentation and frequency analysis, the project can create a word cloud representing common keywords found in the dataset. This makes the repository both a scraping example and a small data analysis experiment built around the collected content. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    WeChatSogou

    WeChatSogou

    Python library to crawl and retrieve data from WeChat accounts

    ...Developers can integrate the library into scripts or larger data collection systems to automate gathering content from public accounts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Plagiarism Checker Pro

    Plagiarism Checker Pro

    Plagiarism Checker for Wordpress

    Plagiarism Checker Pro plugin checks for copied or duplicate content on Wordpress platform. Plagiarism Checker PRO provides plagiarized percentage for the copied content using an unique set of algorithm. For time being supports English language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Perl Web Scraping Project

    Perl Web Scraping Project

    Perl Web Scraping Project

    ...It is a form of copying, in which specific data is gathered and copied from the web, typically into a central local database or spreadsheet, for later retrieval or analysis. Web scraping a web page involves fetching it and extracting from it.[1][2] Fetching is the downloading of a page (which a browser does when you view the page). Therefore, web crawling is a main component of web scraping, to fetch pages for later processing. Once fetched, then extraction can take place. The content of a page may be parsed, searched, reformatted, its data copied into a spreadsheet, and so on.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Log Monitoring - Monitor/Check Log Files

    Log File Monitoring - Check Log Files, Analyze, Alert on any UNIX Log

    Monitoring log files is mandatory in all UNIX environments. LoGrobot does this for you efficiently. It analyzes, graphs and alerts on system log files, application log files, database log files, custom log files...basically any log file. Benefits: Automatically scans log files for errors or user specified patterns Shows the offending log entries in the alerts generated on a monitored log file Shows latest size of a log file at the time of the most recent log check Shows total...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Reconcile: Web Credibility Evaluation

    Reconcile: Web Credibility Evaluation

    Robust online credibility evaluation of web content

    The goal of the project is the creation of new mechanisms for supporting users in the evaluation of Web content credibility and improvement of the quality of Web content credibility evaluation. Our society increasingly relies on the vast amount of information available on the Internet. As a consequence, the issue of judging or evaluating the credibility of this information becomes of crucial importance. Thus, algorithms and mechanisms for the assessment of credibility of Web content can...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    LogDistiller is a logfile merge and sort tool. Log content is classified according to rules configured in an XML file. Classification results go into reports, which are published : simply stored in a file, sent by mail, or even added to a news feed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo