Showing 30 open source projects for "big file split java"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    PDFsam

    PDFsam

    PDFsam, a desktop application to split, merge, mix, rotate PDF files

    PDFsam Basic is our free and open-source desktop application to split, merge, extract pages, rotate and mix PDF files. PDFsam Visual is a powerful tool to visually compose PDF files, reorder pages, delete pages, split, merge, rotate, encrypt, decrypt, extract text, convert to grayscale, crop PDF files. PDFsam Basic is written using JavaFX. Since version 4 it is released as a self-contained application and bundles a jlinked JDK while version 3 requires a Java Runtime Environment 8 with JavaFx...
    Downloads: 138 This Week
    Last Update:
    See Project
  • 2
    Stirling-PDF

    Stirling-PDF

    Web application that allows you to perform operations on PDF files

    Stirling PDF is a powerful, locally hosted web-based PDF manipulation tool offering a wide range of editing, conversion, and utility features. It allows users to merge, split, compress, convert, OCR, and perform other operations on PDF files directly from a browser without uploading data to third-party servers. The tool is privacy-conscious, self-hostable via Docker, and built with modularity in mind to allow future expansion and integration.
    Downloads: 40 This Week
    Last Update:
    See Project
  • 3
    Apache HBase

    Apache HBase

    Get random, realtime read/write access to your Big Data

    Use Apache HBase™ when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables, billions of rows X millions of columns, atop clusters of commodity hardware. Apache HBase is an open-source, distributed, versioned, non-relational database modeled after Google's Bigtable. A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, Apache...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    Apache Hudi

    Apache Hudi

    Upserts, Deletes And Incremental Processing on Big Data

    Apache Hudi (pronounced Hoodie) stands for Hadoop Upserts Deletes and Incrementals. Hudi manages the storage of large analytical datasets on DFS (Cloud stores, HDFS or any Hadoop FileSystem compatible storage). Apache Hudi is a transactional data lake platform that brings database and data warehouse capabilities to the data lake. Hudi reimagines slow old-school batch data processing with a powerful new incremental processing framework for low latency minute-level analytics. Hudi provides...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    ElasticJob

    ElasticJob

    Distributed scheduled job framework

    ElasticJob is a distributed scheduling solution consisting of two separate projects, ElasticJob-Lite and ElasticJob-Cloud. ElasticJob-Lite is a lightweight, decentralized solution that provides distributed task sharding services. ElasticJob-Cloud uses Mesos to manage and isolate resources. It uses a unified job API for each project. Developers only need code one time and can deploy at will. Support job sharding and high availability in distributed system. Scale out for throughput and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    Parkiet

    Parkiet

    Parquet format file GUI editor

    Parquet file viewer and editor written in Java and SWT. It uses Apache Avro library for reading and writing edited parquet files. Only Parquet files with simple data type columns are supported.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Stirling-PDF

    Stirling-PDF

    #1 Locally hosted web application that allows you to work on PDFs

    This is a robust, locally hosted web-based PDF manipulation tool using Docker. It enables you to carry out various operations on PDF files, including splitting, merging, converting, reorganizing, adding images, rotating, compressing, and more. This locally hosted web application has evolved to encompass a comprehensive set of features, addressing all your PDF requirements. Stirling PDF does not initiate any outbound calls for record-keeping or tracking purposes. All files and PDFs...
    Leader badge
    Downloads: 137 This Week
    Last Update:
    See Project
  • 8
    Floreant POS

    Floreant POS

    Opensource Enterprise Grade Restaurant POS

    Enterprise Grade Point of Sale application for QSR, Casual Dine-In, Fine Dine-In, Cafe and Retail. Established in 2008 Floreant POS has powered 30,000 restaurants in 25 countries around the world. It can run on single or multiple terminals and manage new order, visualize tables, manage kitchen and produce advanced reports. Floreant POS is initiated and maintained by OROCUBE LLC, USA (https://orocube.com) and released under MRPL 1.2, a variant of Mozilla Public License 1.1. ...
    Leader badge
    Downloads: 110 This Week
    Last Update:
    See Project
  • 9
    Yaoqiang BPMN Editor

    Yaoqiang BPMN Editor

    an Open Source BPMN 2.0 / DMN 1.1 Modeler

    Yaoqiang BPMN Editor is a graphical editor for business process diagrams, compliant with OMG specifications (BPMN 2.0 / DMN 1.1).
    Leader badge
    Downloads: 45 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    MoneyCSV

    MoneyCSV

    Personal finance program based on CSV text files (no closed database)

    This personal finance program allows you track all personal accounts including investment accounts. Money CSV provides reoccurring bills, budget tracking, report generation, net worth tracking and custom transaction import rules. The program supports importing of downloaded transaction files (OFX, QFX and QIF). Money CSV supports stock price updates if you get a key for the Yahoo Finance (YH Finance) API from RapidAPI.com. The intro level API key is free, and you get 500 downloads per month...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    Advanced Trigonometry Calculator

    Advanced Trigonometry Calculator

    Precision Trigonometry: Advanced Calculator for Complex Math

    Advanced Trigonometry Calculator is equipped with a user-friendly interface that allows for easy input of problems and instant computation. Professionals such as engineers who need to perform advanced trigonometric calculations in their work will find this tool extremely useful. ATC Online Alpha: https://advantrigoncalc.sourceforge.io/atc/ More info by clicking below: https://advantrigoncalc.sourceforge.io/ Advanced Trigonometry Calculator was only and always only developed by...
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    SmartPOS /Point of Sale Web with ERP

    SmartPOS /Point of Sale Web with ERP

    SmartPOS (Includes->SmartERP distro of Idempiere ERP) is 100% Web

    SmartPOS 7.1 is a complete ERP+POS (Idempiere 7.1 Distro) , taking all the power of an ERP, but creating an intuitive , agile and easy to learn Point of Sale (POS) 100% Web . SmartPOS has been designed to work as part of the ERP, and support complex business environments, where you can manage different products per organization, different organizations per client, different warehouses per organization, and for each warehouse private config for your POS, price lists, taxes,...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 15

    HSRA

    Hadoop spliced read aligner for RNA-seq data

    HSRA is a MapReduce-based parallel tool for mapping reads from RNA sequencing (RNA-seq) experiments. RNA-seq analyses typically begin by mapping reads to a reference genome in order to determine the location from which the reads were originated, which is a very time-consuming step. This tool allows bioinformatics researchers to efficiently distribute their mapping tasks over the nodes of a cluster by combining a fast multithreaded spliced aligner (HISAT2) with Apache Hadoop, which is a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    apache spark data pipeline osDQ

    apache spark data pipeline osDQ

    osDQ dedicated to create apache spark based data pipeline using JSON

    This is an offshoot project of open source data quality (osDQ) project https://sourceforge.net/projects/dataquality/ This sub project will create apache spark based data pipeline where JSON based metadata (file) will be used to run data processing , data pipeline , data quality and data preparation and data modeling features for big data. This uses java API of apache spark. It can run in local mode also. Get json example at https://github.com/arrahtech/osdq-spark How to run Unzip the zip file Windows : java -cp .\lib\*;osdq-spark-0.0.1.jar org.arrah.framework.spark.run.TransformRunner -c ....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    nervalreports

    nervalreports

    A lightweight report creation Java library

    Nerval Reports is a lightweight report creation library, focused on minimal computational costs. Ideally, report creation should iterate only once through its data and minimize memory allocation and processor's use, but also restrict its dependencies only to what your specific use needs. Instead of the highly expensive way of well-known engines like Jasper Reports, where performance and data reiteration is a big and set aside problem (and also is the report design as a non-programming task),...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    ...DSTK is written in C#, Java and Python to interface with R, NLTK, and Weka. It can be expanded with plugins using R Scripts. We have also created plugins for more statistical functions, and Big Data Analytics with Microsoft Azure HDInsights (Spark Server) with Livy. License: R, RStudio, NLTK, SciPy, SKLearn, MatPlotLib, Weka, ... each has their own licenses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Hypersocket VPN

    Hypersocket VPN

    A simple yet powerful open source VPN solution

    Hypersocket is a streamlined VPN solution built for all types of remote access scenarios. Powerful yet simple to use, Hypersocket uses a single installed client to provide access to any number of network resources. Its powerful granular access means you can have fine grained control over your entire network and remote users. Accessed through a secure HTTPS port, the Hypersocket client provides access to any number of host/port combinations all effortlessly assigned server-side by the...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Universal Java Matrix Package

    Universal Java Matrix Package

    sparse and dense matrix, linear algebra, visualization, big data

    The Universal Java Matrix Package (UJMP) is an open source Java library which provides sparse and dense matrix classes, as well as a large number of calculations for linear algebra such as matrix multiplication or matrix inverse. Operations such as mean, correlation, standard deviation, replacement of missing values or the calculation of mutual information are supported, too. The Universal Java Matrix Package provides various visualization methods, import and export filters for a large...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Flamingo Project

    Flamingo Project

    Workflow Designer, Hive Editor, Pig Editor, File System Browser

    Flamingo is a open-source Big Data Platform that combine a Ajax Rich Web Interface + Workflow Engine + Workflow Designer + MapReduce + Hive Editor + Pig Editor. 1. Easy Tool for big data 2. Use comfortable in Hadoop EcoSystem projects 3. Based GPL V3 License Supporting Pig IDE, Hive IDE, HDFS Browser, Scheduler, Hadoop Job Monitoring, Workflow Engine, Workflow Designer, MapReduce.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    odt2braille
    odt2braille is a Braille extension to OpenOffice.org Writer. odt2braille enables authors to print documents to a Braille embosser and to export documents as Braille files. The Braille output is well-formatted and highly customizable.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 23
    JPDF Tools
    JPDF Tools is a GUI java program built on the JPDF Export library. Its main aim is to create pdf files by inserting texts, images or tables. Users can also merge PDF files, split PDF files, merge images into PDF files and soon convert from and to PDF files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Sheetster Web Spreadsheet

    Sheetster Web Spreadsheet

    an embeddable spreadsheet for your web application

    Sheetster is a web-based spreadsheet editor and viewer. It can import and export the XLS and XLSX file formats and nearly the full range of Excel formula functions are supported. Most basic features are implemented as well as some of the more advanced ones. The user interface is written entirely in JavaScript and its server-side API is pure Java based on the OpenXLS spreadsheet engine. Neither the client nor the server needs Excel or any browser plugins. Sheetster is primarily intended...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    JPDF Export
    JPDF Export is a java library built on the famous iText library. It provides simple functions that can be used to build complex pdf files. It also provides simple classes to merge, split and convert pdf files
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB