Showing 228 open source projects for "linux file parser"

View related business solutions
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    ...Extracts Information from Web by parsing millions of pages. Store data into Derby Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby and MySQL Database - Written in Java Cross Platform Also See Free email Sender : https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    LXR Cross Referencer
    A general purpose source code indexer and cross-referencer that provides web-based browsing of source code with links to the definition and usage of any identifier. Supports multiple languages. Up-to-date information in http://lxr.sourceforge.net **CAUTION!** Releases are now issued on Codeberg due to legal reasons. See https://codeberg.org/ajlittoz/CB_LXRsource/releases
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    ftserver-android

    ftserver-android

    Self-hosted search engine with web service to share discoveries with

    Full Text Search Engine for Android Mobile, Windows Desktop, Linux Server. You can use the KeyWord to find relative WebSites, dig in important information, search answers. It has a web server inside, use it to share discoveries with people. App's Source Codes included, can be freely distributed over the internet in an unchanged or changed form. Check the file size after downloaded the Android APK.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Easyspider - Distributed Web Crawler

    Easyspider - Distributed Web Crawler

    Easy Spider is a distributed Perl Web Crawler Project from 2006

    Easy Spider is a distributed Perl Web Crawler Project from 2006. It features code from crawling webpages, distributing it to a server and generating xml files from it. The client site can be any computer (Windows or Linux) and the Server stores all data. Websites that use EasySpider Crawling for Article Writing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    Swagbucks Web Search Bot

    Swagbucks Web Search Bot

    This is a automatic swagbucks search automator.

    Basically this allows you to be able to automate searches by putting in search terms through any .txt file appropriately formatted and then the program picks that up and basically opens the link through your computer. You must login to Swagbucks for this to work. With that in mind, the title speaks for itself. To find new releases, check the version folder in each designated Operating System here. To get archived releases (ill-advised), check out the GitHub:...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    ...Extracts Information from Web by parsing millions of pages. Store data into Derby OR MySQL Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby Database - Written in Java Cross Platform See also Free Email Sender in this link: https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    X-RAY

    X-RAY

    The next web scraper, see through the <html> noise

    Supports strings, arrays, arrays of objects, and nested object structures. The schema is not tied to the structure of the page you're scraping, allowing you to pull the data in the structure of your choosing. The API is entirely composable, giving you great flexibility in how you scrape each page. Paginate through websites, scraping each page. X-ray also supports a request delay and a pagination limit. Scraped pages can be streamed to a file, so if there's an error on one page, you won't...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    ...OpenSearchServer runs on Windows and Linux/Unix/BSD.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 9
    The MangaStream Downloader is an open source application written in Java for managing and downloading manga from the site mangastream.com and mangafox.me. It is written under the GNU-GPL license and uses an open source HTML parser - TagSoup. Follow the project page on Facebook for updates: https://www.facebook.com/MangastreamDownloader
    Downloads: 1 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    YouSeer is an open source search engine framework, which was built on top of other open source components. It’s part of the general SeerSuite framework. YouSeer utilizes Hereitrix as a crawler and solr as an indexing system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Aperture is a Java framework for extracting and querying full-text content and metadata from various information systems (file systems, web sites, mail boxes, ...) and the file formats (documents, images, ...) occurring in these systems.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    Movie Central Lobby

    Movie Central Lobby

    MovieCentral of sid's with Extra Features

    This is same project of Sid's MovieCentral >>>https://sourceforge.net/projects/moviecentral/ Added Upcoming, Currently playing movie details Added movies filter based on year, genre, rating Also added gallery style movies display Changed video player to vlc player Powered by tmdb
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DBTV *abandoned*

    DBTV *abandoned*

    Version 3.x

    DBTV V3 - Your television episode and series torrent search and downloader! Note: This is just a spare time, zero profit project. If anyone is interested in donating to allow me to spend more time fixing things sooner, you can visit here; https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=FV2KMDSJXMSAN DBTV is a TV Torrent Downloader! Scans your tv show folder and lists your shows. Looks up and downloads torrents for newer (or older) tv shows. (Use a torrent program...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Meta Tag Generator. Allows you to research SEO keywords generate proper compliant meta tags and output them to a HTML or text file for insertion into a finished web project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    That project aims at providing a clean API, and the corresponding C++ implementation, for parsing travel-focused requests (e.g., "washington dc beijing monday r/t +aa -ua 1 week 2 adults 1 dog").
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17
    getNya

    getNya

    Simple application for downloading pictures from Zerochan.net

    Simple java application for downloading high-quality pictures from Zerochan.net. You can find images by size or a tag. It's simple. And flat. All you need to do: download .jar file and run it with Oracle JVM (or any another JVM supporting image decoding)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Robots.io

    Robots.txt parsing library

    Robots.io is a Java library designed to make parsing a websites 'robots.txt' file easy. The RobotsParser class provides all the functionality to use robots.io. Domains passed to RobotsParser are normalised to always end in a forward slash. Disallowed Paths returned will never begin with a forward slash. This is so that URL's can easily be constructed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ChanDown - 4chan Image Downloader

    ChanDown - 4chan Image Downloader

    Auto Rescanning - Search Terms - Regularly Updated With New Features

    ========== NOTE: (AS OF 11/05/2015) 4chan html structure has changed, full images are downloaded as well as the thumbnail. Fix coming shortly (after my exams are over) to stop the thumbnails from downloading. ========== This is the first release of my 4chan image downloader. This downloader packs loads of great features such as the search ability. Check the features section and be sure to let me know if you want a feature added. Coming Soon: - Wiki, explaining in depth how to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Multi Whois Client
    Multi Whois is a small whois domain name search program capable of finding bulk domains via a wordlist file or specific domains provided by the user. It's a good way to search for domains that are available to buy. The wordlist format consists of a domain name per line with or without the .dot tld.. Examples below google yahoo foobar Note:Must select a tld from the dropdown box or google.com yahoo.net foobar.guru
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    DeSR is a multilingual statistical dependency parser. It produces dependency parse trees for natural language sentences using a parsing model learned from annotated corpora.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    imgal is a simple image gallery and file browser written in one php file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24
    MovieGrabber

    MovieGrabber

    MovieGrabber is a fully automated way of downloading

    MovieGrabber has now moved to GitHub! https://github.com/binhex/moviegrabber
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    mindCMS

    mindCMS

    Small, fast and flexible Content Management System for PHP / MySQL

    Small, fast and flexible Content Management System - CMS for PHP / MySQL A very small, fast, compact and flexible Content Management System (CMS) for PHP Webservers using a reasonable amount of functions. Easily maintain your web pages and online files in any webbrowser.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB