[代理镜像] Add: Eliminating mathematical hallucinations with deterministic tool use by michaelwinczuk · Pull Request #2599 · openai/openai-cookbook

michaelwinczuk · 2026-04-09T01:55:43Z

Summary

A practical notebook showing how to eliminate LLM mathematical hallucinations by routing computation to SymPy via tool use.

What it covers

Why LLMs hallucinate math (token prediction vs computation)
A simple compute_math tool using SymPy
Side-by-side comparison: GPT-4o-mini without tools vs with SymPy tool
Benchmark data: 3B (55%), 7B (77%), 32B (93%), SymPy tool (100%)
The "separate perception from execution" pattern

Why it belongs in the cookbook

Directly demonstrates OpenAI's function calling / tool use feature
Solves a common production problem (math accuracy)
Complements the existing Developing_hallucination_guardrails.ipynb
Pattern generalizes beyond math (facts, code, dates)

Short, practical, copy-pasteable. Links to Math Swarm for the full 1,079-test, 12-category implementation.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 760ea983e3

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-09T02:00:39Z

+    "        tools=[math_tool],\n",
+    "        tool_choice=\"auto\",\n",
+    "        temperature=0,\n",


Force math tool invocation in deterministic path

The “WITH SYMPY TOOL (deterministic computation)” call uses tool_choice="auto", which still allows the model to skip the tool and answer from token prediction (your own else branch already handles [NO TOOL USED]). That means this path cannot guarantee deterministic/no-hallucination behavior and can regress to incorrect answers for some prompts; use a required/specific function tool choice for this section.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-09T02:00:39Z

+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Eliminating Mathematical Hallucinations with Deterministic Tool Use\n",


Add registry entry for the new notebook

This commit introduces a new cookbook notebook but does not update registry.yaml, so the publication pipeline will not index/render this page on cookbook.openai.com. Per this repo’s metadata workflow, new content must be added to the registry in the same change to avoid shipping an effectively hidden example.

Useful? React with 👍 / 👎.

Add: Eliminating mathematical hallucinations with deterministic tool use

760ea98

chatgpt-codex-connector bot reviewed Apr 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add: Eliminating mathematical hallucinations with deterministic tool use#2599

Add: Eliminating mathematical hallucinations with deterministic tool use#2599
michaelwinczuk wants to merge 1 commit intoopenai:mainfrom
michaelwinczuk:add-math-hallucination-elimination

michaelwinczuk commented Apr 9, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Apr 9, 2026

Uh oh!

chatgpt-codex-connector bot Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

michaelwinczuk commented Apr 9, 2026

Summary

What it covers

Why it belongs in the cookbook

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant