Opentelemetry-specification: TraceID & SpanID are specified as "byte arrays" but retrieval is unspecified

Created on 24 Jul 2020 · 9Comments · Source: open-telemetry/opentelemetry-specification

There is an inconsistency in how opentelemetry-js and opentelemetry-ruby, opentelemetry-go, and so forth handle the TraceID and SpanID fields on Span.

Most of OTel's language implementations (e.g. ruby, go, python) return a byte array (or integral representation), others (like js) return a hex stringified representation of the byte array. My understanding is that the hex string representation should be used for serialization/deserialization but not for the internal representation returned by Span.TraceID()

api p1 required-for-ga trace

Source

lizthegrey

Most helpful comment

I don't think we should specify this. In different languages, different things may be more efficient.

Oberon00 on 28 Jul 2020

👍2

All 9 comments

A third alternative (currently used in the rust implementation) is a new opaque type that doesn't couple the ids to their underlying data representation.

jtescher on 24 Jul 2020

This was brought up earlier by @flarna for JS as a performance optimization. After some benchmarks showed only a very minor improvement, the effort was dropped due to lack of support and the effort required. https://github.com/open-telemetry/opentelemetry-js/issues/698

One complicating factor in JS is that the Buffer class typically used for "byte arrays" in JS is not available in browsers.

dyladan on 24 Jul 2020

In Java we're exploring switching from opaque types to strings since it's very common for the strings to be needed anyways so that keeps things a little simpler.

https://github.com/open-telemetry/opentelemetry-java/pull/1374

anuraaga on 25 Jul 2020

from the spec sig mtg, triaged this as P1, assigning initially to @lizthegrey since it sounds like she's working on this in golang sig

andrewhsu on 28 Jul 2020

I don't think we should specify this. In different languages, different things may be more efficient.

Oberon00 on 28 Jul 2020

👍2

I think if the goal is purely efficiency, leaving unspecified makes sense. I'd consider whether there is a UX advantage to specifying retrieval as strings. I think in practice, users see IDs as strings in their trace console, logs, etc. So a user calling something like getTraceId() returning a String seems very intuitive.

anuraaga on 29 Jul 2020

I think we can recommend having a convenience getter with a string representation, regardless of how the IDs are stored internally. But languages should document if that one is doing an "expensive" conversion or not, and offer cheaper alternatives if applicable.

Oberon00 on 29 Jul 2020

👍1