Showing 90 open source projects for "linux file parser"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Ex-Crawler
    Ex-Crawler is divided into 3 subprojects (Crawler Daemon, distributed gui Client, (web) search engine) which together provide a flexible and powerful search engine supporting distributed computing. More informations: http://ex-crawler.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    With DoCASU, Alfresco users have a simplified and easy to use solution to access, search and manage documents. DoCASU is a Rich Internet Application (RIA) based on Alfresco Web Scripts and ExtJS. Find all details on: http://code.optaros.com/trac/docasu
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Provide a robust and efficient implementation of n-gram based classifiers to Java. N-Gram algorithms have shown to be surprisingly good at tasks like guessing the language/encoding from an arbitrary text file. And there are many more applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    nxs crawler is a program to crawl the internet. The program generates random ip numbers and attempts to connect to the hosts. If the host will answer, the result will be saved in a xml file. After than the crawler will disconnect... Additionally you can
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    This is an ***old archive*** of tools developed for facilitating the use of Creative Commons licenses and metadata. --- For the most up to date representation of any of the projects listed here, please see: http://creativecommons.org/project/Developer.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Fire.now is a Firefox plugin that automatically adds your documents to the WhereIsNow latest version discovery service. Everytime you upload a document somewhere, Fire.now integrates the WhereIsNow keys into the file and add it's url to WhereIsNow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Narrows search result produced by popular Internet search engines, allowing to put extra filtering conditions, as certain words presented, certain words excluded, and so on.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    A torrent search engine plugin for the Azureus/Vuze bittorrent platform.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9
    Javen library is a framework for developing C++ application simply, with similar API to Java library. Hawk search engine is a software platform that used to build Vertical Search Product more easily for the Moderate Company or End Users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • 10
    RSS EXTRACTOR is a java library for generating RSS newsfeeds considering the RSS web feeds from multiple websites. It extracts the best of newsfeed entries and a produces a RSS file which is a fusion of newsfeed entries from several websites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    A FUSE-based file system that represents Linked Open Data sets as hierarchical file systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A tool which allows you to download all erotic images and videos hosted on popular image hosting sites tagged with a given tag without clicking you through the web interface. ,,java bayimg yourtag'' is enought.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Command line application written in Java useful for automation of downloading process and filtering contents of downloaded files. jDownloader uses simple script file to configure downloading and filtering processes.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 14
    InfoCrawler allows you to crawl and index various types of documents, accessing data from various resources: Intranets, public WEB sites, local or remote file systems. For product information please see our website at http://www.infocrawler.org/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A Java library which allows to parse the latest freely available RDF files available at DMOZ (Open Directory Project) and inserts them into any JDBC compliant relational database (i.e. MySQL, PostgreSQL and others to come like Oracle, MS Access, SQLite).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Hyper Estraier is a full-text search system. It works as with Google, but based on peer-to-peer architecture. Using Hyper Estraier, we can construct a large-scaled search engine with cheap computers.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    JxtASK is a P2P system that is aimed to search, download and share academic content hosted on websites that will join the JxtASK community. Joining is simple: siteadmins must generate(even automatically)a XML catalog which describes the files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Lan search engine that indexes files on FTP and SMB servers and allows to search and browse them through web interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Google(™) meets the Matrix. Red Piranha combines Lucene (Searching Ability), XML-RDF (ability to learn), Tomcat (for P2P Power) and Spring (Ease of use) to not only let you find anything, anywhere, but to actually understand what you are looking for.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    O3-XDS stands for "Open Three (O3) - Cross Enterprise Document Sharing". O3-XDS is an Open Source and IHE compliant XDS Registry and Repository.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Sharehound is a network file systems indexer and searcher written in Java. Currently supports SMB file shares (i.e. MS Windows-based shares) and FTP resources. Web UI is used for search and crawl monitoring. RSS feed is provided for search results.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The Informa library provides a convenient Java API for handling news channels and metadata about them. Different syntax formats (RSS 0.91, 1.0, 2.0 and Atom 0.3, 1.0) for feeds are supported. Also support for channel information descriptions (OPML) avail
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SCAM is a development environment for building metadata stores for RDF and the Semantic Web. SCAM is built upon international technology standards and metadata standards. Such as RDF, Dublin Core, IEEE/LOM and IMS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 25
    jGetFile is a command-line scriptable recursive file downloader for the web. Where other downloaders fail, jGetFile succeeds in downloading the files you want with simplicity and ease of use.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB