Skip to content

Latest commit

 

History

History
23 lines (16 loc) · 1.27 KB

README.md

File metadata and controls

23 lines (16 loc) · 1.27 KB

SplitDataset Stable Dev Build Status Coverage

Julia package for reading datasets split blockwise into multiple HDF5 files. It is the Julia analogue of SplitDataset in Python.

Provides a H5SplitDataset that can be treated as a DiskArray

To load files, use the folder-argument constructor.

H5SplitDataset(folder::String) # opens a SplitDataset from a folder containing h5 and json files

To write whole arrays, use the following constructor.

H5SplitDataset(
    folder,
    a::AbstractArray{T,N},
    block_size::NTuple{N,Int},
)

Writing not-block-aligned parts of arrays is currently not supported. This package is an intermediate solution until Zarr supports all the features we need.