Proposal for timestamp type #209

frsyuki · 2015-12-22T07:17:30Z

Follows up #130 and #207.
See #207 for discussion.

ludocode · 2015-12-22T07:29:13Z

spec.md

+* Timestamp 32 format can represent a timestamp in [1970-01-01 00:00:00 UTC, 2106-02-07 06:28:16 UTC) range. Nanoseconds part is 0.
+* Timestamp 64 format can represent a timestamp in [1970-01-01 00:00:00.000000000 UTC, 2514-05-30 01:53:04.000000000 UTC) range.
+* Timestamp 96 format can represent a timestamp in [-584554047284-02-23 16:59:44 UTC, 584554051223-11-09 07:00:16.000000000 UTC) range.
+* In timestamp 64 and timestamp 96 formats, nanoseconds must not be larger than 999999999.


Interesting. This line seems to be the first rule regarding "canonical" representation of data when multiple representations would otherwise be available. None of the other types have such restrictions. For example a uint16 doesn't say the value must be at least 256 (since if it's less than that, it could have been represented by a uint8.)

Not that I'm complaining. I do think being strict about representation is a good thing, and I'd love it if the spec were strict everywhere else whenever multiple representations are available. UTF-8 forbids overlong sequences after all. (Although looking at the Protocol Buffers spec, it doesn't explicitly forbid overlong varints. I'm not sure why. Maybe it's unimportant.)

I added this restriction specifically because deserialization becomes unexpectedly complicated if nanosecond part includes carry over, and following this restriction doesn't make serialization code complicated.

vmihailenco · 2016-02-24T15:01:36Z

This proposal looks good and I would like to implement it for https://github.com/vmihailenco/msgpack. Any chance it is going to be merged/changed soon?

tagomoris · 2016-02-24T17:23:20Z

@vmihailenco msgpack organization doesn't have golang implementation, and msgpack.org introduces ugorji/go/codec. AFAIK, msgpack organization want not to increase # of repositories under itself.
Of course, you can implement anything on your own repository, but it's off topic for this pull-request.

vmihailenco · 2016-02-25T07:29:13Z

I am asking if this proposal going to be merged. Reference to the repository is just a proof that I am really interested in it.

tagomoris · 2016-02-25T18:05:17Z

I got it. Sorry for misunderstanding.

tagomoris · 2016-02-25T18:08:45Z

IMO, we can merge this new spec because there's no objections...

drewnoakes · 2016-11-17T13:57:53Z

+1 for merging this. Looks solid and very useful.

It'd be useful to know the process, if any, for evolving the spec. Publishing such details would build confidence in using the MsgPack format.

tagomoris · 2016-11-17T14:02:18Z

@frsyuki ?

potterdai · 2017-04-25T00:24:04Z

Will this be merged?

methane · 2017-04-25T00:43:39Z

spec.md

+    timestamp 64 stores the number of seconds and nanoseconds that have elapsed since 1970-01-01 00:00:00 UTC
+    in 2 32-bit unsigned integers:
+    +--------+--------+--------+--------+--------+--------+--------+--------+--------+--------+
+    |  0xd7  |   -1   |nanoseconds in 30-bit unsigned int|   seconds in 34-bit unsigned int   |


This is not "2 32-bit unsigned integers".

Thank you! Fixed.

tagomoris · 2017-08-10T05:56:35Z

🎉

ludocode · 2017-09-27T05:19:46Z

When de-serializing these new timestamps, does it make sense to consider an extension of type -1 with unrecognized length an error?

I'm currently implementing this in MPack and I'm trying to decide how to handle this. The spec doesn't explicitly say that such data should be considered invalid, but the pseudocode in the spec does say "error" if the length is not 4, 8 or 12. So I'm guessing raising an error is the expected behaviour, and judging by the implementors linking to this issue so far, this seems to be the consensus.

It seems to me that this is a bit of a problem though because data that was previously considered valid, like say an extension of type -1 and length 7, is now considered invalid and will flag errors in the newest parsers. Although negative extension types were reserved, this still feels like a backwards-incompatible tightening of the rules. It seems to contradict the new description of extensions which is that it is the application (not the library) that gets to decide whether to treat it as opaque or reject it as invalid.

I had considered reporting an extension of type -1 as a timestamp if it's parseable (i.e. the length is 4, 8 or 12 and the nanoseconds are in bounds), and otherwise just reporting it as an opaque extension object like any other instead of raising an error. Extension functions (like accessing the raw data) would still be available for extensions of type -1 regardless of length and regardless of whether it's recognized as a timestamp. This way if someone for whatever reason was using -1, this change would not break their code or data.

I suppose we can just say that nobody should have been using an extension of type -1 in the first place. So maybe it doesn't matter and I'm worrying over nothing. Still, as far as I know most libraries do not differentiate between reserved extensions and application-defined extensions (at least not through anything more than documentation.) So there's nothing stopping a user from accidentally choosing a reserved type and having it break later when libraries start rejecting their data.

For this reason I'm starting to think that maybe libraries and the spec should completely differentiate between reserved extensions and application-defined extensions. I may decide to handle reserved extensions as an entirely separate type within MPack, that way if users use them, it will be obvious that they may break in the future and they're on their own.

Something else to keep in mind with raising errors on unknown lengths is that this prevents us from inserting new lengths for timestamp in the future. For example lots of people wanted timezones, so I had previously suggested that we could extend this later to add one or two bytes to each length to specify it. This is actually not possible because it won't be backwards compatible. Existing libraries will report a parsing error (probably discarding the whole message) when they encounter a timestamp with the new lengths. So this means that this definition of timestamps is frozen forever: no other lengths can be added, and any change or addition will have to be introduced as a different extension type.

tagomoris · 2017-09-27T06:33:04Z

@ludocode It looks an issue about merged spec. Could you create another issue for it?
We can't follow the discussion under closed pull-requests.

Proposal for timestamp type

65c12af

ludocode reviewed Dec 22, 2015
View reviewed changes

frsyuki mentioned this pull request Dec 22, 2015

Proposal to add Timestamp predefined ext type: -1 #207

Closed

ludocode mentioned this pull request Feb 16, 2016

enum mpack_type_t only exposes only biggest int and uint type ludocode/mpack#35

Closed

vmihailenco mentioned this pull request Feb 24, 2016

decode into time.Time when interface{} provided vmihailenco/msgpack#70

Closed

frsyuki mentioned this pull request May 16, 2016

server side workflow log filtering treasure-data/digdag#93

Closed

methane reviewed Apr 25, 2017

View reviewed changes

fixed typo in timestamp type

fcfc08a

This was referenced May 11, 2017

implement time extension vmihailenco/msgpack#132

Merged

Weird issue vmihailenco/msgpack#39

Closed

potterdai mentioned this pull request May 14, 2017

Implement Timestamp extension type lexmag/msgpax#39

Merged

neuecc mentioned this pull request Jul 16, 2017

DateTime Kind is UTC but it should be unspecified MessagePack-CSharp/MessagePack-CSharp#83

Closed

fixed typo in timestamp pseudo code

f617975

frsyuki merged commit 40e3d3d into master Aug 10, 2017

frsyuki deleted the extension-timestamp branch August 10, 2017 05:42

frsyuki mentioned this pull request Aug 10, 2017

[WIP] Add add support for Timestamp type msgpack/msgpack-java#431

Closed

yfakariya mentioned this pull request Aug 11, 2017

Any plans to ship 1.0.0 anytime soon? msgpack/msgpack-cli#246

Open

kawanet mentioned this pull request Dec 5, 2017

Test cases for timestamp extension type -1 kawanet/msgpack-test-suite#1

Open

3Hren mentioned this pull request Aug 30, 2018

Relax assertion on negative extension types 3Hren/msgpack-rust#174

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal for timestamp type #209

Proposal for timestamp type #209

frsyuki commented Dec 22, 2015

ludocode Dec 22, 2015

frsyuki Dec 22, 2015

vmihailenco commented Feb 24, 2016

tagomoris commented Feb 24, 2016

vmihailenco commented Feb 25, 2016

tagomoris commented Feb 25, 2016

tagomoris commented Feb 25, 2016

drewnoakes commented Nov 17, 2016

tagomoris commented Nov 17, 2016

potterdai commented Apr 25, 2017

methane Apr 25, 2017

frsyuki Apr 25, 2017

tagomoris commented Aug 10, 2017

ludocode commented Sep 27, 2017

tagomoris commented Sep 27, 2017

Proposal for timestamp type #209

Proposal for timestamp type #209

Conversation

frsyuki commented Dec 22, 2015

ludocode Dec 22, 2015

Choose a reason for hiding this comment

frsyuki Dec 22, 2015

Choose a reason for hiding this comment

vmihailenco commented Feb 24, 2016

tagomoris commented Feb 24, 2016

vmihailenco commented Feb 25, 2016

tagomoris commented Feb 25, 2016

tagomoris commented Feb 25, 2016

drewnoakes commented Nov 17, 2016

tagomoris commented Nov 17, 2016

potterdai commented Apr 25, 2017

methane Apr 25, 2017

Choose a reason for hiding this comment

frsyuki Apr 25, 2017

Choose a reason for hiding this comment

tagomoris commented Aug 10, 2017

ludocode commented Sep 27, 2017

tagomoris commented Sep 27, 2017