Showing 202 open source projects for "linux file parser"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    ...Extracts Information from Web by parsing millions of pages. Store data into Derby Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby and MySQL Database - Written in Java Cross Platform Also See Free email Sender : https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    LXR Cross Referencer
    A general purpose source code indexer and cross-referencer that provides web-based browsing of source code with links to the definition and usage of any identifier. Supports multiple languages. Up-to-date information in http://lxr.sourceforge.net **CAUTION!** Releases are now issued on Codeberg due to legal reasons. See https://codeberg.org/ajlittoz/CB_LXRsource/releases
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    ftserver-android

    ftserver-android

    Self-hosted search engine with web service to share discoveries with

    Full Text Search Engine for Android Mobile, Windows Desktop, Linux Server. You can use the KeyWord to find relative WebSites, dig in important information, search answers. It has a web server inside, use it to share discoveries with people. App's Source Codes included, can be freely distributed over the internet in an unchanged or changed form. Check the file size after downloaded the Android APK.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Easyspider - Distributed Web Crawler

    Easyspider - Distributed Web Crawler

    Easy Spider is a distributed Perl Web Crawler Project from 2006

    Easy Spider is a distributed Perl Web Crawler Project from 2006. It features code from crawling webpages, distributing it to a server and generating xml files from it. The client site can be any computer (Windows or Linux) and the Server stores all data. Websites that use EasySpider Crawling for Article Writing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • 5
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    ...Extracts Information from Web by parsing millions of pages. Store data into Derby OR MySQL Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby Database - Written in Java Cross Platform See also Free Email Sender in this link: https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    X-RAY

    X-RAY

    The next web scraper, see through the <html> noise

    Supports strings, arrays, arrays of objects, and nested object structures. The schema is not tied to the structure of the page you're scraping, allowing you to pull the data in the structure of your choosing. The API is entirely composable, giving you great flexibility in how you scrape each page. Paginate through websites, scraping each page. X-ray also supports a request delay and a pagination limit. Scraped pages can be streamed to a file, so if there's an error on one page, you won't...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    ...OpenSearchServer runs on Windows and Linux/Unix/BSD.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 8
    The MangaStream Downloader is an open source application written in Java for managing and downloading manga from the site mangastream.com and mangafox.me. It is written under the GNU-GPL license and uses an open source HTML parser - TagSoup. Follow the project page on Facebook for updates: https://www.facebook.com/MangastreamDownloader
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    YouSeer is an open source search engine framework, which was built on top of other open source components. It’s part of the general SeerSuite framework. YouSeer utilizes Hereitrix as a crawler and solr as an indexing system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    Aperture is a Java framework for extracting and querying full-text content and metadata from various information systems (file systems, web sites, mail boxes, ...) and the file formats (documents, images, ...) occurring in these systems.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Movie Central Lobby

    Movie Central Lobby

    MovieCentral of sid's with Extra Features

    This is same project of Sid's MovieCentral >>>https://sourceforge.net/projects/moviecentral/ Added Upcoming, Currently playing movie details Added movies filter based on year, genre, rating Also added gallery style movies display Changed video player to vlc player Powered by tmdb
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Meta Tag Generator. Allows you to research SEO keywords generate proper compliant meta tags and output them to a HTML or text file for insertion into a finished web project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    That project aims at providing a clean API, and the corresponding C++ implementation, for parsing travel-focused requests (e.g., "washington dc beijing monday r/t +aa -ua 1 week 2 adults 1 dog").
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    getNya

    getNya

    Simple application for downloading pictures from Zerochan.net

    Simple java application for downloading high-quality pictures from Zerochan.net. You can find images by size or a tag. It's simple. And flat. All you need to do: download .jar file and run it with Oracle JVM (or any another JVM supporting image decoding)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    Robots.io

    Robots.txt parsing library

    Robots.io is a Java library designed to make parsing a websites 'robots.txt' file easy. The RobotsParser class provides all the functionality to use robots.io. Domains passed to RobotsParser are normalised to always end in a forward slash. Disallowed Paths returned will never begin with a forward slash. This is so that URL's can easily be constructed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ChanDown - 4chan Image Downloader

    ChanDown - 4chan Image Downloader

    Auto Rescanning - Search Terms - Regularly Updated With New Features

    ========== NOTE: (AS OF 11/05/2015) 4chan html structure has changed, full images are downloaded as well as the thumbnail. Fix coming shortly (after my exams are over) to stop the thumbnails from downloading. ========== This is the first release of my 4chan image downloader. This downloader packs loads of great features such as the search ability. Check the features section and be sure to let me know if you want a feature added. Coming Soon: - Wiki, explaining in depth how to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    imgal is a simple image gallery and file browser written in one php file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    icfs

    Indexed cloud file system

    icfs provides a way for mapping multiple URLs, referencing various web-enabled artifacts, into a single file system presentation under Linux. URLs can reference static pages, REST calls, or web objects in an object store.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    MovieGrabber

    MovieGrabber

    MovieGrabber is a fully automated way of downloading

    MovieGrabber has now moved to GitHub! https://github.com/binhex/moviegrabber
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    mindCMS

    mindCMS

    Small, fast and flexible Content Management System for PHP / MySQL

    Small, fast and flexible Content Management System - CMS for PHP / MySQL A very small, fast, compact and flexible Content Management System (CMS) for PHP Webservers using a reasonable amount of functions. Easily maintain your web pages and online files in any webbrowser.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Create one search engine, get your Clickbank and Amazon affiliates search engine today. Completely open script, put your Amazon and Clickbank affiliate and start running, and Google Adsense if you want! Introduction: Get it run in 5 minutes. 1 - An easy install (just upload all files to ftp) 2 - Change the Clickbank and Amazon Affiliates ID in index.php file 3 - Grant write permissions to words.txt text file. Demo: www.i-need-to-buy.com Requirements: PHP...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    MineSoft Datamine System

    MineSoft Datamine System

    PHP application for datamining

    Application for datamining. Use for good not evil. this isnt totally practical if you are targetting MASS ammounts of websites. its not a bot. each url has to be entered by hand.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    SEO & SEM - Marketing Text Writer

    SEO & SEM - Marketing Text Writer

    Open Source SEO & SEM Text Creation Tools for free Article Writer

    Open Source Tool for Search Engine Optimization (SEO & SEM) used for automatic content processing. These SEO Content Genrators and Article Writers based on Text...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB