-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Define a data logging/ingestion format and spec #41
Comments
One thing that came to mind is why use the |
@bruno-f-cruz This makes it easier when searching for chunks of the same device across epoch folders, as what happens in the Aeon data formats. I want to keep pushing for this, as I think it is an important use case to keep compatibility for, even though it may not be used in 90% of cases. |
I guess my question is whether it should be part of the spec or not. From the Python interface point of view it doesn't appear to add much. I wonder if we can find a way that the interface works as long as the pattern is '*_' or if there is an advantage of introducing this dependency and locking the spec to it. To be clear: I am not against folding it in, just wonder if we really need to add it! |
@bruno-f-cruz Picking the outstanding issues from this spec:
Do we still need this now that
If we agree changing the spec to use only register numbers then it should be fully compatible.
If we adopt the proposal in harp-tech/device.behavior#21 then we will have a trivial way to store the metadata at acquisition time. Proposed solutions
I think this last default is fine. There are questions of compatibility for projects like Aeon who want to go for multi-chunking of data and have possibly slightly different naming conventions for file layouts. This is fine because the standard folder structure is optional, i.e. it is always possible for projects and APIs to pass the data file path directly, so I don't think we necessarily need to worry too much about this as long as there is a reasonable way forward. Assuming there is nothing else missing I think we are close to having a complete proposal for the data logging spec format that we could port into the issue description above, and discuss in the next Harp club meeting. |
Summary
One of the goals of the harp-ecossytem is to define data format and specifications to allow users to log their data in a stable and shareable format.
Current Implementations
At the Allen
The current implementation at the Allen follows the following pattern: https://allenneuraldynamics.github.io/Bonsai.AllenNeuralDynamics/articles/core-logging.html#harp-data
Essentially, all messages from a single device and GroupedBy Register and save in their respective binary file. The name of the binary file current follows the convention <DeviceName__RegisterName.bin>.
e.g.:
This has a few problems:
Possible solutions
-
<UserGivenName>.harp / <DeviceName>_<RegisterNumber>.bin
The text was updated successfully, but these errors were encountered: