[代理镜像] Prevent repeated foreground summarization after failure by bhavyaus · Pull Request #4953 · microsoft/vscode-copilot-chat

bhavyaus · 2026-04-03T02:39:24Z

When foreground summarization fails (e.g. summary too large), the outcome metadata is recorded on the turn. On subsequent BudgetExceededErrors within the same turn, renderWithSummarization checks for this metadata and skips the expensive LLM summarization call, falling back directly to a no-cache-breakpoints render. This prevents the user from seeing repeated 'Compacting...' progress indicators and avoids wasting LLM calls on summarization that already failed. - Add retry guard in renderWithSummarization checking turn metadata - Record failure metadata (source: foreground, outcome: errorKind) - Add 4 tests covering failure metadata, success metadata, round.summary not set on failure, and retry guard contract

Copilot

Pull request overview

Prevents repeated (and likely futile) foreground conversation summarization attempts within the same turn after an initial summarization failure, reducing redundant LLM calls and repeated “Compacting conversation...” UX.

Changes:

Add an early retry guard in renderWithSummarization to skip foreground summarization if the latest turn already recorded a prior foreground failure.
Fall back directly to a non-cache-breakpoints render when the guard triggers.
Add new summarization-related unit tests (though several do not currently validate the behaviors their names/descriptions imply).

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
src/extension/intents/node/agentIntent.ts	Adds retry guard based on turn `SummarizedConversationHistoryMetadata` to avoid repeated foreground summarization after failure.
src/extension/prompts/node/agent/test/summarization.spec.tsx	Adds tests around summarization success/failure metadata, but some assertions are currently ineffective/misleading.

Comments suppressed due to low confidence (2)

src/extension/prompts/node/agent/test/summarization.spec.tsx:496

This test asserts round.summary stays undefined after a failed summarization, but in this test you never apply SummarizedConversationHistoryMetadata back onto toolCallRounds (unlike the existing agentPromptToString helper which does). As written, round.summary will remain undefined regardless of whether summarization succeeded, so the test doesn’t validate the intended behavior.

	test('failed summarization does not set round.summary', async () => {
		chatResponse[0] = 'x'.repeat(500_000);

		const instaService = accessor.get(IInstantiationService);
		const endpoint = instaService.createInstance(MockEndpoint, undefined);

		const toolCallRounds = [
			new ToolCallRound('ok', [createEditFileToolCall(1)]),
			new ToolCallRound('ok 2', [createEditFileToolCall(2)]),
			new ToolCallRound('ok 3', [createEditFileToolCall(3)]),
		];

		const turn = new Turn('turnId', { type: 'user', message: 'hello' });
		const testConversation = new Conversation('sessionId', [turn]);

		const promptContext: IBuildPromptContext = {
			chatVariables: new ChatVariablesCollection([{ id: 'vscode.file', name: 'file', value: fileTsUri }]),
			history: [],
			query: 'edit this file',
			toolCallRounds,
			toolCallResults: createEditFileToolResult(1, 2, 3),
			tools,
			conversation: testConversation,
		};

		const customizations = await PromptRegistry.resolveAllCustomizations(instaService, endpoint);
		const props: AgentPromptProps = {
			priority: 1,
			endpoint,
			location: ChatLocation.Panel,
			promptContext,
			enableCacheBreakpoints: true,
			triggerSummarize: true,
			customizations,
		};
		const renderer = PromptRenderer.create(instaService, endpoint, AgentPrompt, props);
		await renderer.render();

		// None of the rounds should have summary set since summarization failed
		for (const round of toolCallRounds) {
			expect(round.summary).toBeUndefined();
		}
	});

src/extension/prompts/node/agent/test/summarization.spec.tsx:545

The “failure metadata on turn prevents repeated foreground summarization attempts” test re-implements the retry-guard boolean locally instead of exercising the actual behavior in agentIntent.ts (e.g., verifying that a second budget-exceeded path does not invoke the summarization fetcher again). This makes the test prone to false confidence: it will keep passing even if the production guard regresses or is removed.

	test('failure metadata on turn prevents repeated foreground summarization attempts', async () => {
		// This test verifies the contract that agentIntent.ts relies on:
		// after a foreground summarization failure, setting SummarizedConversationHistoryMetadata
		// with outcome !== 'success' on the turn causes the retry guard to skip summarization.

		const turn = new Turn('turnId', { type: 'user', message: 'hello' });

		// Simulate what agentIntent.ts does after a failed foreground summarization
		turn.setMetadata(new SummarizedConversationHistoryMetadata(
			'', // no toolCallRoundId for failures
			'', // no summary text for failures
			{
				model: 'test-model',
				source: 'foreground',
				outcome: 'budgetExceeded',
				contextLengthBefore: 100_000,
			},
		));

		// Verify the retry guard condition from renderWithSummarization matches
		const previousForegroundSummary = turn.getMetadata(SummarizedConversationHistoryMetadata);
		expect(previousForegroundSummary).toBeDefined();
		expect(previousForegroundSummary!.source).toBe('foreground');
		expect(previousForegroundSummary!.outcome).toBe('budgetExceeded');
		expect(previousForegroundSummary!.outcome).not.toBe('success');

		// The guard condition: source === 'foreground' && outcome && outcome !== 'success'
		const shouldSkip = previousForegroundSummary!.source === 'foreground'
			&& !!previousForegroundSummary!.outcome
			&& previousForegroundSummary!.outcome !== 'success';
		expect(shouldSkip).toBe(true);

		// Also verify that successful summarization does NOT trigger the skip guard
		turn.setMetadata(new SummarizedConversationHistoryMetadata(
			'roundId',
			'summary text',
			{
				model: 'test-model',
				source: 'foreground',
				outcome: 'success',
			},
		));
		const successMeta = turn.getMetadata(SummarizedConversationHistoryMetadata);
		const shouldSkipAfterSuccess = successMeta!.source === 'foreground'
			&& !!successMeta!.outcome
			&& successMeta!.outcome !== 'success';
		expect(shouldSkipAfterSuccess).toBe(false);
	});

- Extract renderWithoutSummarization() helper to deduplicate fallback render logic between skip-path and failure-path - Add triggerSummarizeSkipped telemetry event + GenAiMetrics counter when the retry guard activates - Add comment documenting getLatestTurn() assumption - Extract createSummarizationTestContext() helper to DRY test setup - Fix test assertions: failed summarization throws from renderer (fallback lives in agentIntent.ts, not PromptRenderer)

Copilot AI review requested due to automatic review settings April 3, 2026 02:39

Copilot started reviewing on behalf of bhavyaus April 3, 2026 02:40 View session

vs-code-engineering bot assigned bhavyaus Apr 3, 2026

Copilot AI reviewed Apr 3, 2026

View reviewed changes

Comment thread src/extension/intents/node/agentIntent.ts Outdated

Comment thread src/extension/prompts/node/agent/test/summarization.spec.tsx Outdated

bhavyaus added 2 commits April 2, 2026 19:52

Simplify summarization failure tests

c752581

bhavyaus enabled auto-merge April 3, 2026 03:28

deepak1556 approved these changes Apr 3, 2026

View reviewed changes

bhavyaus added this pull request to the merge queue Apr 3, 2026

Merged via the queue into main with commit b5208ae Apr 3, 2026
19 checks passed

bhavyaus deleted the dev/bhavyau/foreground-summarization-retry-fix branch April 3, 2026 07:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent repeated foreground summarization after failure#4953

Prevent repeated foreground summarization after failure#4953
bhavyaus merged 3 commits intomainfrom
dev/bhavyau/foreground-summarization-retry-fix

bhavyaus commented Apr 3, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

bhavyaus commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bhavyaus commented Apr 3, 2026 •

edited

Loading