Author: | The Blosc development team |
---|---|
Contact: | [email protected] |
Github: | https://github.com/Blosc/python-blosc2 |
Actions: | |
PyPi: | |
NumFOCUS: | |
Code of Conduct: |
Python-Blosc2 is a high-performance compressed ndarray library with a flexible compute engine. It uses the C-Blosc2 library as the compression backend. C-Blosc2 is the next generation of Blosc, an award-winning library that has been around for more than a decade, and that is been used by many projects, including PyTables or Zarr.
Python-Blosc2 is Python wrapper that exposes the C-Blosc2 API, plus an integrated compute engine. This allows to perform complex calculations on compressed data in a way that operands do not need to be in-memory, but can be stored on disk or on the network. This makes possible to work with data no matter how large it is, and that can be stored in a distributed fashion.
Most importantly, Python-Blosc2 uses the C-Blosc2 simple and open format for storing compressed data, making it easy to integrate with other systems and tools.
You can find more introductory info about Python-Blosc2 at:
https://www.blosc.org/python-blosc2/getting_started/overview.html
Blosc2 now provides Python wheels for the major OS (Win, Mac and Linux) and platforms.
You can install the binary packages from PyPi using pip
:
pip install blosc2 --upgrade
For conda users, you can install the package from the conda-forge channel:
conda install -c conda-forge python-blosc2
The documentation is available here:
https://blosc.org/python-blosc2/python-blosc2.html
Additionally, you can find some examples at:
https://github.com/Blosc/python-blosc2/tree/main/examples
Finally, we taught a tutorial at the PyData Global 2024 that you can find at: https://github.com/Blosc/Python-Blosc2-3.0-tutorial. There you will find different Jupyter notebook that explains the main features of Python-Blosc2.
This software is licensed under a 3-Clause BSD license. A copy of the python-blosc2 license can be found in LICENSE.txt.
Discussion about this package is welcome at:
https://github.com/Blosc/python-blosc2/discussions
Stay informed about the latest developments by following us in Mastodon, Bluesky or LinkedIn.
Blosc2 is supported by the NumFOCUS foundation, the LEAPS-INNOV project and ironArray SLU, among many other donors. This allowed the following people have contributed in an important way to the core development of the Blosc2 library:
- Francesc Alted
- Marta Iborra
- Aleix Alcacer
- Oscar Guiñón
- Juan David Ibáñez
- Ivan Vilata i Balaguer
- Oumaima Ech.Chdig
In addition, other people have participated to the project in different aspects:
- Jan Sellner, contributed the mmap support for NDArray/SChunk objects.
- Dimitri Papadopoulos, contributed a large bunch of improvements to the in many aspects of the project. His attention to detail is remarkable.
- And many others that have contributed with bug reports, suggestions and improvements.
You can cite our work on the various libraries under the Blosc umbrella as follows:
@ONLINE{blosc,
author = {{Blosc Development Team}},
title = "{A fast, compressed and persistent data store library}",
year = {2009-2025},
note = {https://blosc.org}
}
If you find Blosc useful and want to support its development, please consider making a donation via the NumFOCUS organization, which is a non-profit that supports many open-source projects. Thank you!
Compress Better, Compute Bigger