Showing 1485 open source projects for "data"

View related business solutions
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 1
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    A Simple CSV Parser for Java under a commercial-friendly Apache 2.0 license
    Leader badge
    Downloads: 90 This Week
    Last Update:
    See Project
  • 3
    PDF Split and Merge

    PDF Split and Merge

    Split and merge PDF files on any platform

    Split and merge PDF files with PDFsam, an easy-to-use desktop tool with graphical, command line and web interface.
    Leader badge
    Downloads: 192 This Week
    Last Update:
    See Project
  • 4
    Game Extractor

    Game Extractor

    Open and edit the archive files used in over 5200 games.

    Reads and writes archives used in many popular games. Great for mod development and game translations. Supports over 5200 games, with more games added through small plugins. Also shows file previews and converts them to different formats.
    Leader badge
    Downloads: 346 This Week
    Last Update:
    See Project
  • Build AI Apps with Gemini 3 on Vertex AI Icon
    Build AI Apps with Gemini 3 on Vertex AI

    Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

    Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.
    Try Vertex AI Free
  • 5
    CsvJdbc - CSV file JDBC driver
    A read-only JDBC driver for Java that uses Comma Separated Value (CSV) files as database tables. Ideal for writing data import programs and conversion programs.
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 6

    RecordEditor

    Editor for Fixed Width, Csv and Existing Xml files.

    The RecordEditor is a Data File editor for Flat Files (delimited and fixed field position). It supports Unix / PC / Legacy (e.g. Mainframe) file formats, both Text and binary files. The Editor uses a Record-Layout description to format the files. This is ideal for Fixed width (Text or Binary) files, Cobol Data Files, Mainframe files and complicated Csv files.
    Leader badge
    Downloads: 40 This Week
    Last Update:
    See Project
  • 7
    Google2SRT

    Google2SRT

    Download, save and convert multiple subtitles from YouTube videos

    Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.
    Downloads: 50 This Week
    Last Update:
    See Project
  • 8

    JRecord

    Read Cobol data files in Java

    provide Java Record based IO routines for Fixed Width (including Text, Mainframe, Cobol and Binary) and delimited Flat files via a Record Layout (Cobol, CSV or XML). The source is now available at https://github.com/bmTas/JRecord Projects using JRecord include: * https://github.com/thospfuller/rcoboldi - Cobol File in R * https://github.com/tmalaska/CopybookInputFormat - Cobol files in Hadoop * https://github.com/gss2002/copybook_formatter * https://github.com/gss2002/ftp2hdfs...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 9
    Systems Biology Markup Language (SBML)

    Systems Biology Markup Language (SBML)

    A file format for exchanging computational models in systems biology

    The Systems Biology Markup Language (SBML) is an XML-based description language for representing computational models in systems biology. Visit the project web site to learn more.
    Downloads: 19 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
    Leader badge
    Downloads: 20 This Week
    Last Update:
    See Project
  • 11
    Grassroots DICOM

    Grassroots DICOM

    Cross-platform DICOM implementation

    Grassroots DiCoM is a C++ library for DICOM medical files. It is accessible from Python, C#, Java and PHP. It supports RAW, JPEG, JPEG 2000, JPEG-LS, RLE and deflated transfer syntax. It comes with a super fast scanner implementation to quickly scan hundreds of DICOM files. It supports SCU network operations (C-ECHO, C-FIND, C-STORE, C-MOVE). PS 3.3 & 3.6 are distributed as XML files. It also provides PS 3.15 certificates and password based mecanism to anonymize and de-identify DICOM datasets.
    Leader badge
    Downloads: 110 This Week
    Last Update:
    See Project
  • 12

    MPXJ

    Multi platform library to read and write schedule data

    MPXJ is an open source file handling library for Java, .Net and Ruby to read and write Microsoft Project MPX and MSPDI XML files, Planner files, Primavera XER files and databases, Asta Powerproject files and databases, Asta Easyplan files, Phoenix Project Manager files, FastTrack FTS files, GanttProject GAN files, TurboProject PEP files, ConceptDraw PROJECT CDPX, CPDZ and CPDTZ files, Synchro Scheduler SP files and MPP, MPT, and MPD files for Project 98, 2000, 2002, 2003, 2007, 2010, 2013,...
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 13

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    x3d

    x3d

    X3D is the open-standard format for 3D graphics scenes on the Web.

    Extensible 3D (X3D) Graphics is a royalty-free International Standard for real-time interactive 3D graphics on the Web, providing unsurpassed interoperability for 3D communications on the Web. This project includes source for example X3D scene libraries and multiple X3D codebases produced by Web3D Consortium members. All open-source contributions are welcome.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 15

    jgraphml

    A Java library to write and read graphml diagrams

    A Java library to write and read graphml diagrams. This library is compatible with the popular yEd tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    jPicEdt

    jPicEdt

    Another drawing editor for LaTeX with PSTricks & TikZ

    jPicEdt is an extensible internationalized vector-based drawing editor for LaTeX and related packages (TikZ, PsTricks,...), written in Java. It is also a library of reusable high-level graphic primitives.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    ant4docbook

    ant4docbook

    ANT4DOCBOOK is an ANT task for DOCBOOK

    ANT4DOCBOOK is an ANT task for DOCBOOK, a semantic markup language for technical documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    MRCube

    MRCube

    Meta-Model Management based on RDFs Revision Reflection

    MRCube is a graphical editing tool of RDF-based contents developed for managing a relationship between RDF and RDFS contents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    toPDF

    Online service for PDF conversion (to PDF)

    A simple online service for PDF conversion. This project is a simple library and also a web application. It offers a REST service and a simple upload service for synchronous conversion. This library/application doesn't contain conversion libraries because it's a wrapper for existing tools. toPDF currently supports the open source tool PDF Creator (http://www.pdfforge.org) and the commercial solution, easy PDF, from BCL (http://www.pdfonline.com/easypdf/sdk/).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Library of Java code to interact with the data-recording formats (RPFITS and PSRFITS) used at the Australia Telescope National Facility (ATNF). It also provides tools for extracting metadata from the files for supply to Virtual Observatory instances.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    PdfInspector1

    PdfInspector1

    Application to inspect text and images of pdf books.

    With the application you can open and browse pdf books. In addition you will be able to inspect the codes for characters or locate the images. May be an example of basic use of pdfbox. JDK-17 compatibility
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    EasyML

    EasyML serialization library, to and from XML, similar to Gson

    EasyML converts Java objects into XML and back again, without the need for annotations or other types of configuration. EasyML offers extensive support for JDK classes and also supports customization through user settings, user extensions, or through the Java Serialization API. EasyML provides support for: - reading and writing XML text and/or org.w3c.dom documents - Java Collections framework - Java Serialization framework - Multi-threading - Java Generics and functional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    xmlj

    XMLJ is a Java XML Editor and validator project.

    XMLJ is a Java XML Editor and validator project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A free Java library and extensible framework for mapping, parsing and unmarshalling CSV file contents into Plain Old Java Objects. It provides Xdoclet support for easy and painless mapping of CSV records to Java Objects. The library requires J2SE 1.5
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Prowide Core

    Prowide Core

    Java framework for SWIFT [tm] MT (FIN) messaging

    Prowide Core (previously known as WIFE) is an open source Java framework for managing SWIFT MT messages. The library key features are the comprehensive Java model for all MT message types, along a straight-forward parsing and building API. The project is active since 2006, production ready and commercially supported. For detail information on the support package options please check: https://www.prowidesoftware.com/services SWIFT is a trademark of S.W.I.F.T. SCRL....
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB