Insufficient rules on custom information to allow data merging #137

Ichoran · 2017-03-09T22:54:24Z

We don't have a precise specification for custom data. This is a problem if we want to be able to merge and split time series--what do you do with custom data?

I propose we include the following specification for custom fields in the data section:

If the data is arrayed, each value for an associated custom key must either be an array of the length of the number of timepoints, or a single value that is assumed to apply to every timepoint.
If the data is not arrayed, there are no restrictions on values
When data is split by time, the custom values that are arrays are split at the same indices
When data is merged, arrays are concatenated, constant values are collapsed if they are the same, and are duplicated to every timepoint if they differ. If keys are present for some timepoints and not others, the missing timepoints will be filled in by JSON null.

This way the custom JSON data behaves the same way as the time series numeric data. (In particular, like the origin data where you can set a single origin for an arrayed time series.)

Ichoran · 2017-03-16T23:24:20Z

I have tried to write something along these lines in the documentation in #146 but it's not yet implemented by readers.

MichaelCurrie · 2017-03-20T15:35:53Z

I really like this idea. For now the Python parser just drops any custom fields as soon as the file is read, leaving it to other more specialized readers to handle the custom fields.

Being able to merge them in a way that makes sense would make the readers more useful for labs, and would mean they wouldn't have to specialize the readers at all, they could just deal with the custom fields they are interested in once the object is in memory.

So now all that's left is implementing it!

Ichoran · 2017-06-05T23:16:10Z

This is fully implemented in Scala (save for bugs) in #152

MichaelCurrie added this to the python_1.2.0 milestone Mar 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Insufficient rules on custom information to allow data merging #137

Insufficient rules on custom information to allow data merging #137

Ichoran commented Mar 9, 2017

Ichoran commented Mar 16, 2017

MichaelCurrie commented Mar 20, 2017

Ichoran commented Jun 5, 2017

Insufficient rules on custom information to allow data merging #137

Insufficient rules on custom information to allow data merging #137

Comments

Ichoran commented Mar 9, 2017

Ichoran commented Mar 16, 2017

MichaelCurrie commented Mar 20, 2017

Ichoran commented Jun 5, 2017