KeyValueList and log attributes with duplicate keys as undefined behavior #533

pellared · 2024-03-12T15:45:37Z

Towards open-telemetry/opentelemetry-specification#3931

Related to open-telemetry/opentelemetry-specification#3938

I think it is not a breaking change as this does not change anything in the encoding. The receivers already have to handle/validate key-values with duplicate keys.

Implementing de-duplication decreases performance. More:

logs: Allow duplicate keys opentelemetry-specification#3931 (comment)
logs: Allow duplicate keys opentelemetry-specification#3931 (comment)
same approach is used in OTel Go SDK prototype (this allows having no heap allocation). More: Drop keyvals with duplicated keys pellared/opentelemetry-go#10

Performance may be more important for applications instrumenting their code than even losing log records which will contain duplicates (which are very unlikely).

The OTLP exporters deduplicate key-val pairs before sending the data to satisfy receivers that require them to have unique keys. Alternatively, the SDKs can have a processor which deduplicate key-val pairs.

opentelemetry/proto/common/v1/common.proto

austinlparker · 2024-03-12T17:35:38Z

While this might not be an explicit breaking change, it seems like it will certainly be an implicit one. Is there no other way to address this?

dmitryax

As @mx-psi mentioned, the OTel Collector is built on top of the key uniqueness requirement. Any key duplication is treated as a UB, and the result of the processing of such data is undetermined.

AFIAR the initial implementation of the proto was using map. Then it was changed to the key/value list due for performance improvements.

I'm strongly against this change since it can lead to further adoption of this practice.

pellared · 2024-03-12T18:34:33Z

Then it was changed to the key/value list due for performance improvements.

This proposal is also due for performance improvements.

the OTel Collector is built on top of the key uniqueness requirement

Is this something that can be changed if needed in future? In my opinion, for now it is not necessary.

dmitryax · 2024-03-12T18:37:30Z

This proposal is also due for performance improvements

I think the data consistency and deterministic processing across the OpenTelemetry project are more important than the performance degradation of one instrumentation library.

pellared · 2024-03-12T18:38:58Z

I think the data consistency and deterministic processing across the OpenTelemetry project are more important than the performance degradation of one instrumentation library.

It is not only performance degradation of one instrumentation library but whole signal's (e.g. logs) processing pipeline.

EDIT: The issue is valid not only when the duplication actually exists. The SDK simply has to ensure that there are no duplicates. For instance in Go Logs API we pass attributes as array (and optional slice) to decrease the number of heap allocations. The SDK would need to use a map to remove duplicated attributes.

MrAlias · 2024-03-12T18:39:23Z

This proposal is also due for performance improvements

I think the data consistency and deterministic processing across the OpenTelemetry project are more important than the performance degradation of one instrumentation library.

There are ~~two~~ three known languages that need this currently (Go, Rust, and C++). It is unfair to say this is only for a single instrumentation library.

MrAlias · 2024-03-12T18:41:06Z

It seems like if the expectation is for the sender to de-duplicate the attributes prior to sending the data it should be achievable on the receiver side as well, right?

MrAlias · 2024-03-12T18:51:07Z

cc @jmacd

pellared · 2024-03-12T19:00:04Z

@mx-psi @TylerHelmuth @dmitryax

I want to point out that this PR does NOT require making any changes in the Collector if the last (or any) item with the same key will be preserved. The main point is just to allow passing duplicate keys and make sure that it does not break the receiver as this would allow performance improvements in SDKs.

EDIT: It would be even acceptable if the collector drop attributes with duplicated keys, but I do not imagine that this happens 😉

cijothomas · 2024-03-12T21:23:12Z

This proposal is also due for performance improvements

I think the data consistency and deterministic processing across the OpenTelemetry project are more important than the performance degradation of one instrumentation library.

There are ~~two~~ three known languages that need this currently (Go, Rust, and C++). It is unfair to say this is only for a single instrumentation library.

OTel .NET too. (OTel .NET does not do de-duplication today for logs today).

pellared · 2024-03-13T11:24:46Z

Added to PR description:

The OTLP exporters can have a configuration to deduplicate key-val pairs before sending the data to satisfy receivers that require them to have unique keys.

mx-psi · 2024-03-13T11:25:16Z

Thanks for making this more in line with JSON. The new wording allows for a backwards-compatible implementation in the Collector. However, I still don't think we have a satisfactory solution (I don't think there even is one).

Collector components will at most be able to forward duplicate keys, but no matter what implementation we pick components won't be able to interact with duplicate keys in any meaningful way. If an application wants to support duplicate keys in a meaningful way, they won't be able to use the Collector as a library for implementing their OTLP server.

There already are a number of applications from vendors and non-commercial FOSS that use the Collector as a library (Jaeger, Grafana Tempo, Datadog Agent, AWS Cloudwatch Agent, Elastic Agent, all vendor-specific Collector distros...). All of these applications won't be able to interact with these duplicate keys. If a sizeable part of the OpenTelemetry ecosystem is not able to ~~use~~ handle this feature correctly at all, what's the point of making this change?

pellared · 2024-03-13T11:27:04Z

If a sizeable part of the OpenTelemetry ecosystem is not able to use this feature at all, what's the point of making this change?

Because it gives significant performance improvements for the SDKs. It would not require them to handle de-duplication.

mx-psi · 2024-03-13T11:46:56Z

Personally, I don't think a performance improvement justifies what we may as well call a correctness issue on the Collector side

austinlparker · 2024-03-13T12:14:40Z

For the sake of discussion, is there any data about the actual performance impact of deduplication in the log pipeline at an SDK level?

Furthermore, the OpenTelemetry model is fundamentally built on blending multiple signals together. While I can certainly believe that existing logging patterns may rely on duplicate keys in log messages, is this a pattern we expect to hold going forward?

jsuereth · 2024-03-13T12:32:37Z

I'm very much against this change. This breaks a known invariant and forces all backends to deal with the issue. I don't think the benefit outweighs the overall cost.

austinlparker · 2024-03-13T12:36:18Z

I'm very much against this change. This breaks a known invariant and forces all backends to deal with the issue. I don't think the benefit outweighs the overall cost.

To echo this, my interest in this topic is that it would also have implications for every single vendor or tool that ingests OTLP. The scope of this change is massive, which means the burden of proof needs to be commensurate.

pellared · 2024-03-13T12:52:27Z

For the sake of discussion, is there any data about the actual performance impact of deduplication in the log pipeline at an SDK level?

Please read the description. There is a benchmark for Rust which indicates 30% performance improvement.

Side note:

Furthermore, the OpenTelemetry model is fundamentally built on blending multiple signals together. While I can certainly believe that existing logging patterns may rely on duplicate keys in log messages, is this a pattern we expect to hold going forward?

That is a good question. I updated the proto so that it currently only should affect logs. But I see that this change may be a precedence for further requests e.g. to do the same for metrics metadata.

I am changing the PR to draft to communicate that at this point of time I do not plan to push this further.

Thanks everyone for your feedback.

Still, the change in the Logs Data Model would be beneficial even if the OTLP would require unique keys as other exporters can benefit from this. I will follow-up in open-telemetry/opentelemetry-specification#3931.

pellared · 2024-03-15T13:27:09Z

I'm very much against this change. This breaks a known invariant and forces all backends to deal with the issue. I don't think the benefit outweighs the overall cost.

I do not understand this statement as right now it is still possible to send duplicated key-values using OTLP. The backends need to deal with this anyway. The PR just proposes to mark this as undefined behavior, because it is technically possible to send it. Not checking for duplicates and improving performance may be more important for instrumentations than even losing log records which will contain duplicates. Especially given that such duplications are rare in real-world and checking for duplicates always causes an overhead.

I decided to reopen the PR as I think it is very coupled to open-telemetry/opentelemetry-specification#3938.

tigrannajaryan

I am blocking this until there is full clarity on whether this is a breaking change or no.

tigrannajaryan · 2024-03-15T15:48:44Z

The PR just proposes to mark this as undefined behavior, because it is technically possible to send it.

@pellared This is unnecessary. There are many other payloads which are technically possible to send but are wrong because they violate a certain specified invariant. Unlike other specifications (e.g. C++ standard) the undefined behavior in OTLP spec is not explicitly specified. In this spec a behavior is "undefined" simply by being unspecified.

As an example: we have AGGREGATION_TEMPORALITY_UNSPECIFIED value of AggregationTemporality. The spec says that it must not be used. Technically nothing prevents senders from using it. However, the spec is silent about what happens when receivers see this value. It is an undefined behavior. It is implied that the behavior of the receivers when they see this value is not known.

pellared · 2024-03-15T16:12:13Z

Closing as unnecessary.

tigrannajaryan · 2024-03-15T16:21:29Z

Although this is now closed I would like to comment on it to clarify an important principle that we can use to evaluate change proposals like this.

This change says that a payload that was previously non-compliant is now possible. It then gives a leeway to receivers to behave as they wish with such payloads, including to reject they payload or to accept and process it in some reasonable way.

I think such approach is fundamentally not compatible with OpenTelemetry's value of vendor neutrality. Allowing such leeway can result in critical differences in behaviors by different vendor backends, making it impossible for the OpenTelemetry user to switch vendors as they desire. If one vendor happily accepts your data and the other outright rejects it then I think we are failing the interoperability goal. This is bad and goes against OpenTelemetry principles.

That alone I think is sufficient for me to reject the change.

pellared added 2 commits March 12, 2024 16:45

Allow attributes with duplicate keys

518cd4b

Update CHANGELOG.md

e7eec35

pellared marked this pull request as ready for review March 12, 2024 15:49

pellared requested a review from a team March 12, 2024 15:49

pellared mentioned this pull request Mar 12, 2024

logs: Allow duplicate keys open-telemetry/opentelemetry-specification#3931

Closed

github-actions bot assigned bogdandrutu Mar 12, 2024

mx-psi reviewed Mar 12, 2024

View reviewed changes

opentelemetry/proto/common/v1/common.proto Show resolved Hide resolved

Update common.proto

ac9b3b3

dmitryax suggested changes Mar 12, 2024

View reviewed changes

pellared requested review from mx-psi, TylerHelmuth and dmitryax March 12, 2024 19:17

Update common.proto

5449b6e

pellared changed the title ~~Allow attributes with duplicate keys~~ Allow attributes with duplicate keys as undefined behaviour Mar 13, 2024

pellared changed the title ~~Allow attributes with duplicate keys as undefined behaviour~~ Allow attributes with duplicate keys as undefined behavior Mar 13, 2024

pellared changed the title ~~Allow attributes with duplicate keys as undefined behavior~~ KeyValues with duplicate keys as undefined behavior Mar 13, 2024

Update CHANGELOG.md

99c4b81

Update logs.proto

b14b08d

pellared requested a review from a team March 13, 2024 12:35

Update common.proto

a625954

pellared marked this pull request as draft March 13, 2024 12:37

Update common.proto

def3740

pellared changed the title ~~KeyValues with duplicate keys as undefined behavior~~ KeyValueList and log attributes with duplicate keys as undefined behavior Mar 13, 2024

Update CHANGELOG.md

892dfe3

pellared mentioned this pull request Mar 13, 2024

Change map to key-value pair collection in Logs Data Model open-telemetry/opentelemetry-specification#3938

Closed

pellared marked this pull request as ready for review March 15, 2024 13:35

github-actions bot assigned tigrannajaryan Mar 15, 2024

tigrannajaryan requested changes Mar 15, 2024

View reviewed changes

pellared closed this Mar 15, 2024

pellared deleted the patch-1 branch March 15, 2024 17:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KeyValueList and log attributes with duplicate keys as undefined behavior #533

KeyValueList and log attributes with duplicate keys as undefined behavior #533

pellared commented Mar 12, 2024 •

edited

Loading

austinlparker commented Mar 12, 2024

dmitryax left a comment •

edited

Loading

pellared commented Mar 12, 2024 •

edited

Loading

dmitryax commented Mar 12, 2024

pellared commented Mar 12, 2024 •

edited

Loading

MrAlias commented Mar 12, 2024 •

edited

Loading

MrAlias commented Mar 12, 2024

MrAlias commented Mar 12, 2024

pellared commented Mar 12, 2024 •

edited

Loading

cijothomas commented Mar 12, 2024

pellared commented Mar 13, 2024 •

edited

Loading

mx-psi commented Mar 13, 2024 •

edited

Loading

pellared commented Mar 13, 2024 •

edited

Loading

mx-psi commented Mar 13, 2024

austinlparker commented Mar 13, 2024

jsuereth commented Mar 13, 2024

austinlparker commented Mar 13, 2024

pellared commented Mar 13, 2024

pellared commented Mar 15, 2024 •

edited

Loading

tigrannajaryan left a comment

tigrannajaryan commented Mar 15, 2024

pellared commented Mar 15, 2024

tigrannajaryan commented Mar 15, 2024 •

edited

Loading

KeyValueList and log attributes with duplicate keys as undefined behavior #533

KeyValueList and log attributes with duplicate keys as undefined behavior #533

Conversation

pellared commented Mar 12, 2024 • edited Loading

austinlparker commented Mar 12, 2024

dmitryax left a comment • edited Loading

Choose a reason for hiding this comment

pellared commented Mar 12, 2024 • edited Loading

dmitryax commented Mar 12, 2024

pellared commented Mar 12, 2024 • edited Loading

MrAlias commented Mar 12, 2024 • edited Loading

MrAlias commented Mar 12, 2024

MrAlias commented Mar 12, 2024

pellared commented Mar 12, 2024 • edited Loading

cijothomas commented Mar 12, 2024

pellared commented Mar 13, 2024 • edited Loading

mx-psi commented Mar 13, 2024 • edited Loading

pellared commented Mar 13, 2024 • edited Loading

mx-psi commented Mar 13, 2024

austinlparker commented Mar 13, 2024

jsuereth commented Mar 13, 2024

austinlparker commented Mar 13, 2024

pellared commented Mar 13, 2024

pellared commented Mar 15, 2024 • edited Loading

tigrannajaryan left a comment

Choose a reason for hiding this comment

tigrannajaryan commented Mar 15, 2024

pellared commented Mar 15, 2024

tigrannajaryan commented Mar 15, 2024 • edited Loading

pellared commented Mar 12, 2024 •

edited

Loading

dmitryax left a comment •

edited

Loading

pellared commented Mar 12, 2024 •

edited

Loading

pellared commented Mar 12, 2024 •

edited

Loading

MrAlias commented Mar 12, 2024 •

edited

Loading

pellared commented Mar 12, 2024 •

edited

Loading

pellared commented Mar 13, 2024 •

edited

Loading

mx-psi commented Mar 13, 2024 •

edited

Loading

pellared commented Mar 13, 2024 •

edited

Loading

pellared commented Mar 15, 2024 •

edited

Loading

tigrannajaryan commented Mar 15, 2024 •

edited

Loading