Download Latest Version v1.0.0-Beta2_ On our way to 1.0! source code.tar.gz (81.8 MB)
Email in envelope

Get an email when there's a new version of Kotlin Dataframe

Home / v0.15.0
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2024-12-09 9.5 kB
v0.15_ Experimental new CSV-, and Geographic integrations and many other fixes source code.tar.gz 2024-12-09 76.8 MB
v0.15_ Experimental new CSV-, and Geographic integrations and many other fixes source code.zip 2024-12-09 78.1 MB
Totals: 3 Items   154.9 MB 0

This release contains several new features, tons of fixes and two new exciting experimental new integrations:

  • Experimental new CSV parser based on Deephaven-CSV. See below for more information.
  • Experimental new GeoDataFrame class for working with geographical data (from GeoJson/Shapefile) and plotting it with Kandy. See below for more information.
  • Full BigInteger support: Just like we support the BigDecimal numbers, DataFrame now also supports BigInteger in parsing, converting, statistics, column arithmetics, etc.
  • Custom SQL DataBase registration
  • Improved parsing: Parsing and converting String columns to other types is now faster. We also introduce the new experimental ParserOptions.useFastDoubleParser setting which uses FastDoubleParser for faster and more flexible Double parsing.
  • We continue improving our Compiler Plugin with every release. See below for more information.
  • See this notebook for some more information about the changes.

New Experimental CSV integration

DataFrame's CSV parsing has been based on Apache Commons CSV from the beginning. While this has been sufficient for most applications, it had some issues like running out of memory, performance, and our API lacking in clarity, documentation, and completeness.

For DataFrame 0.15, we introduce a new separate package org.jetbrains.kotlinx:dataframe-csv which tries to solve all these issues at once. It's based on Deephaven-CSV which makes it faster and more memory efficient. And since we built it from the ground up, we made sure the API was complete, predictable, and documented carefully.

To try it yourself, explicitly add the dependency org.jetbrains.kotlinx:dataframe-csv to your project. In notebooks you can add enableExperimentalCsv=true to the %use-magic, like %use dataframe(enableExperimentalCsv=true). Use the new DataFrame.readCsv()/DataFrame.readTsv()/DataFrame.readDelim() functions over the old DataFrame.readCSV() ones.

We happily await your feedback!

New Experimental Geo integration

Kandy v0.8 introduces geo-plotting which allows you to visualize geospatial/geographical data using the awesome Kandy DSL. To make working with this geographical data (from GeoJson/Shapefile) easier, we happily accepted the GeoDataFrame PR from the Kandy team.

To try it yourself, explicitly add the dependency org.jetbrains.kotlinx:dataframe-geo to your project or notebook (with the repository maven("https://repo.osgeo.org/repository/release")) and use GeoDataFrame.readGeoJson() or GeoDataFrame.readShapeFile() to get started!

Features

Compiler Plugin

Fixes

Docs and Examples

New Contributors

Full Changelog: https://github.com/Kotlin/dataframe/compare/v0.14.2...v0.15.0

Source: README.md, updated 2024-12-09