Prevent duplicate system instructions in Python telemetry by Copilot · Pull Request #5981 · microsoft/agent-framework

Copilot · 2026-05-20T21:13:54Z

Motivation and Context

Python observability currently records system instructions twice on the same span: once in gen_ai.system_instructions and again in gen_ai.input.messages. This change keeps system instructions in their dedicated attribute and limits input messages to the actual conversation history.

Description

Telemetry serialization
- Update Python observability message capture to stop writing prepended system instructions into gen_ai.input.messages.
- Preserve existing gen_ai.system_instructions emission.
Behavioral scope
- Keep the logging path intact for prepended instruction messages.
- Only narrow the span attribute payload so gen_ai.input.messages reflects user/assistant/tool history.
Coverage
- Adjust focused observability tests for chat client and agent paths.
- Add assertions that non-ASCII system instructions remain preserved without leaking into input messages.

# Before
gen_ai.system_instructions = [{"type": "text", "content": "Be helpful."}]
gen_ai.input.messages = [
    {"role": "system", "parts": [{"type": "text", "content": "Be helpful."}]},
    {"role": "user", "parts": [{"type": "text", "content": "hello"}]},
]

# After
gen_ai.system_instructions = [{"type": "text", "content": "Be helpful."}]
gen_ai.input.messages = [
    {"role": "user", "parts": [{"type": "text", "content": "hello"}]},
]

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the Contribution Guidelines
All unit tests pass, and I have added new tests where possible
Is this a breaking change? If yes, add "[BREAKING]" prefix to the title of the PR.

alliscode · 2026-05-20T21:28:04Z

@copilot The relevant rule in the OTEL semantic conventions is:

gen_ai.system_instructions should be used when the provider/API allows system instructions to be provided separately from chat history. Instructions that are part of chat history should instead be recorded in gen_ai.input.messages.

Please make sure there are test case to ensure this is the case. Also test the case where the users input messages already contain a system message and system_insrtructions is also supplied, the span should contain:

gen_ai.system_instructions = framework/options instructions
gen_ai.input.messages      = original system message + user/assistant/tool history

Copilot · 2026-05-20T21:33:12Z

@copilot The relevant rule in the OTEL semantic conventions is:

...

Addressed in 5b12184. I added explicit coverage for separate framework instructions vs chat-history system messages, including the mixed case where system_instructions is set while the original input already contains a system message, for both chat-client and agent spans.

alliscode · 2026-05-20T21:59:50Z

-    prepped = prepend_instructions_to_messages(normalize_messages(messages), system_instructions)
-    otel_messages: list[dict[str, Any]] = []
+    normalized_messages = normalize_messages(messages)
+    prepped = prepend_instructions_to_messages(normalized_messages, system_instructions)


@copilot Is prepped used for anything else? Why do we still need to call prepend_instructions_to_messages if we intentionally skip over what it's adding?

Updated in 016cc1a. prepend_instructions_to_messages is still needed for the logging path: we still emit prepended framework instructions as message log events, but _capture_messages now builds gen_ai.input.messages directly from the original normalized chat history. I also added a focused regression test for that split.

alliscode · 2026-05-20T22:17:10Z

+    for index, (message, otel_message) in enumerate(
+        zip(logging_messages[:prepended_count], prepended_messages, strict=True)
+    ):
        # Reuse the otel message representation for logging instead of calling to_dict()
        # to avoid expensive Pydantic serialization overhead
-        otel_message = _to_otel_message(message)
-        otel_messages.append(otel_message)
+        logger.info(
+            otel_message,
+            extra={
+                OtelAttr.EVENT_NAME: OtelAttr.CHOICE if output else ROLE_EVENT_MAP.get(message.role),
+                OtelAttr.PROVIDER_NAME: provider_name,
+                MessageListTimestampFilter.INDEX_KEY: index,
+            },
+        )


@copilot Why are we doing this twice now?

Fixed in 84e5a62. _capture_messages now walks the logging message list once, logs each OTEL message once, and only appends the non-prepended entries to the span payload. I also added a regression test for the mixed framework-instructions + original system-message case so we keep the expected split without double-processing.

alliscode · 2026-05-20T22:28:54Z

-    otel_messages: list[dict[str, Any]] = []
-    for index, message in enumerate(prepped):
+    normalized_messages = normalize_messages(messages)
+    logging_messages = prepend_instructions_to_messages(normalized_messages, system_instructions)


We should not be always logging the messages with prepended instructions. I would expect this to be split similar to the span messages

@copilot please look

Fixed in 5e039d1. _capture_messages now logs only the normalized chat history, so framework/options instructions stay in gen_ai.system_instructions instead of being prepended into the message logs. I also updated the focused regression tests for both the instructions-only and mixed system-message cases.

Copilot · 2026-05-20T22:34:28Z

@copilot The relevant rule in the OTEL semantic conventions is:

...

Addressed in 5e039d1. The observability tests cover both cases now: separate framework/options instructions stay in gen_ai.system_instructions, and when the original input already contains a system message it remains in gen_ai.input.messages alongside the rest of the chat history for both chat-client and agent spans.

alliscode · 2026-05-20T22:51:07Z

-    otel_messages: list[dict[str, Any]] = []
-    for index, message in enumerate(prepped):
+    normalized_messages = normalize_messages(messages)
+    span_messages = []


Suggested change

span_messages = []

otel_messages: list[dict[str, Any]] = []

@copilot

Addressed in 589fad0.

Initial plan

7566e3e

Copilot AI assigned Copilot and alliscode May 20, 2026

Copilot AI requested review from Copilot and removed request for Copilot May 20, 2026 21:13

Copilot started work on behalf of alliscode May 20, 2026 21:14 View session

Copilot AI linked an issue May 20, 2026 that may be closed by this pull request

Python: System instructions duplicated in telemetry: appears in both gen_ai.system_instructions and gen_ai.input_messages #5978

Open

Fix duplicated system instructions in Python telemetry

30174f9

Copilot AI requested review from Copilot and removed request for Copilot May 20, 2026 21:17

Clarify telemetry message filtering

12c0164

Copilot AI requested review from Copilot and removed request for Copilot May 20, 2026 21:18

Copilot AI changed the title ~~[WIP] Fix duplicated system instructions in telemetry~~ Prevent duplicate system instructions in Python telemetry May 20, 2026

Copilot finished work on behalf of alliscode May 20, 2026 21:19

Copilot AI requested a review from alliscode May 20, 2026 21:19

Copilot started work on behalf of alliscode May 20, 2026 21:28 View session

test: cover separate and in-history system messages

1b691ae

Copilot AI requested review from Copilot and removed request for Copilot May 20, 2026 21:33

Copilot finished work on behalf of alliscode May 20, 2026 21:34

alliscode reviewed May 20, 2026

View reviewed changes

Copilot started work on behalf of alliscode May 20, 2026 22:00 View session

Clarify observability message logging split

016cc1a

Copilot AI requested review from Copilot and removed request for Copilot May 20, 2026 22:03

Simplify observability logging serialization

ece3db8

Copilot AI review requested due to automatic review settings May 20, 2026 22:04

Polish observability message serialization

40e4a0b

Copilot AI requested review from Copilot and removed request for Copilot May 20, 2026 22:07

Tighten observability zip checks

25ceb87

Copilot AI requested review from Copilot and removed request for Copilot May 20, 2026 22:08

Copilot finished work on behalf of alliscode May 20, 2026 22:09

Copilot AI requested a review from alliscode May 20, 2026 22:09

alliscode reviewed May 20, 2026

View reviewed changes

Copilot started work on behalf of alliscode May 20, 2026 22:17 View session

Refactor observability message capture loop

84e5a62

Copilot AI requested review from alliscode and Copilot and removed request for Copilot May 20, 2026 22:21

Copilot finished work on behalf of alliscode May 20, 2026 22:22

alliscode reviewed May 20, 2026

View reviewed changes

Copilot started work on behalf of alliscode May 20, 2026 22:29 View session

Fix telemetry logging for separate system instructions

5e039d1

Copilot AI requested review from Copilot and removed request for Copilot May 20, 2026 22:34

Copilot finished work on behalf of alliscode May 20, 2026 22:35

Copilot AI requested a review from alliscode May 20, 2026 22:35

alliscode reviewed May 20, 2026

View reviewed changes

Copilot started work on behalf of alliscode May 20, 2026 22:51 View session

Refine observability OTEL message typing

589fad0

Copilot AI requested review from Copilot and removed request for Copilot May 20, 2026 22:54

Copilot finished work on behalf of alliscode May 20, 2026 22:55

Copilot AI requested a review from alliscode May 20, 2026 22:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent duplicate system instructions in Python telemetry#5981

Prevent duplicate system instructions in Python telemetry#5981
Copilot wants to merge 14 commits into
mainfrom
copilot/fix-duplicated-system-instructions

Copilot AI commented May 20, 2026 •

edited

Loading

Uh oh!

alliscode commented May 20, 2026

Uh oh!

Copilot AI commented May 20, 2026

Uh oh!

alliscode May 20, 2026

Uh oh!

Copilot AI May 20, 2026

Uh oh!

alliscode May 20, 2026

Uh oh!

Copilot AI May 20, 2026

Uh oh!

alliscode May 20, 2026

Uh oh!

alliscode May 20, 2026

Uh oh!

Copilot AI May 20, 2026

Uh oh!

Copilot AI commented May 20, 2026

Uh oh!

alliscode May 20, 2026

Uh oh!

Copilot AI May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

Description

Contribution Checklist

Uh oh!

alliscode commented May 20, 2026

Uh oh!

Copilot AI commented May 20, 2026

Uh oh!

alliscode May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 20, 2026

Choose a reason for hiding this comment

Uh oh!

alliscode May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 20, 2026

Choose a reason for hiding this comment

Uh oh!

alliscode May 20, 2026

Choose a reason for hiding this comment

Uh oh!

alliscode May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI commented May 20, 2026

Uh oh!

alliscode May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented May 20, 2026 •

edited

Loading