gh-131798: Narrow the return type of `_FORMAT_SIMPLE` and `_FORMAT_WITH_SPEC` to `str` by NekoAsakura · Pull Request #146639 · python/cpython

NekoAsakura · 2026-03-30T17:43:58Z

Issue: Better uop coverage in the JIT optimizer #131798

…MAT_WITH_SPEC` to `str`

eendebakpt · 2026-03-30T18:34:37Z

+        res = sym_new_type(ctx, &PyUnicode_Type);
+    }
+
+    op(_FORMAT_WITH_SPEC, (value, fmt_spec -- res)) {


This can return a subclass, e.g.

class my(str): def __format__(self, spec): return self m=my('aap') type(f'{m}')

Does the sym_new_type(ctx, &PyUnicode_Type); not guarantee we have an exact type?

Good catch! I oversimplified this. Narrowing type for format isn't as straightforward as I thought.

That said, I'm curious why FORMAT_SIMPLE needs to preserve str subclasses. Had a look through the git history it's been this way since 2015, and seems not a deliberate type-preservation contract.

Current behaviour is already inconsistent:

class my(str): def __format__(self, spec): return self m = my('aap') type('{}'.format(m)) # <class 'my'> type('x{}'.format(m)) # <class 'str'> type(f'{m}') # <class 'my'> type(f'x{m}') # <class 'str'>

Multi-piece concatenation already strips it (both _PyUnicodeWriter in str.format and BUILD_STRING in f-strings create exact str).

If FORMAT_SIMPLE normalised to exact str, this inconsistency goes away and type narrowing becomes unconditionally correct. Would that be a reasonable change, or is there a reason to preserve str subclass identity through formatting that I'm missing?

Huh that's kinda scary. The semantics look a little strange, so maybe we should leave this alone for now?

Fair enough. I just noticed the inconsistency and thought it was worth raising.

For the narrowing itself: we could do conditional narrowing for input types where we can prove __format__ returns exact str (e.g. int, float). But honestly it feels not pythonic to hardcode a list of "safe" types for a uop that's less then 0.1%. I'd prefer just closing this PR unless you think it's worth keeping with that approach?

Actually we already have something like that called sym_is_safe_type, you can use that

I couldn't find sym_is_safe_type anywhere. The closest thing I can see is sym_is_safe_const, but that checks for known constant values rather than types. Have I overlooked something?

Oh yeah sorry that's the one! Just factor out the type checks in _Py_uop_sym_is_safe_const into another function and use that in this case, I think that's fine!

Done. How does it look now?

…MAT_WITH_SPEC` to str for built-in types

Fidget-Spinner · 2026-04-01T16:11:14Z

+    return (typ == &PyLong_Type) ||
+           (typ == &PyUnicode_Type) ||
+           (typ == &PyFloat_Type) ||
+           (typ == &_PyNone_Type) ||
+           (typ == &PyBool_Type) ||
+           (typ == &PyFrozenDict_Type);


Oops, I meant to factor this out into a common function, sorry!

You mean factoring safe types into a static helper? No problem.

…MAT_WITH_SPEC` to str for built-in types

Fidget-Spinner · 2026-04-03T00:24:15Z

Going to close and reopen this PR to re-trigger CI. Sorry!

# Conflicts: # Python/optimizer_symbols.c

markshannon

Looks good overall. The is_safe_builtin_type function needs fixing up though.

markshannon · 2026-05-01T11:15:21Z

 }

+static bool
+is_safe_builtin_type(PyTypeObject *typ)


I don't like this name. "is safe" for what?

For how we are using this, I don't think frozendict and frozenset should be included as they can contain "unsafe" objects. You should be able to add bytes to the list.

Maybe is_atomic_builtin_type?

Also add a comment explaining that "atomic" means that its values can be evaluated without side effects as they aren't containers. Also that common functions on them are pure and return the expected type, like str(ob) returns a str, int(ob) returns an obj, etc.

Looks like frozendict and frozenset aren't safe for const folding either. I've pushed a test that captures the behaviour. I am not sure if there is a better fix than just dropping them. Do you want that addressed here, or track in a separate issue?

This reverts commit 3752820.

read-the-docs-community · 2026-05-03T02:57:07Z

Documentation build overview

📚 cpython-previews | 🛠️ Build #32512649 | 📁 Comparing 02c37ad against main (c1940bc)

🔍 Preview build

40 files changed · ± 40 modified

± Modified

pythonGH-131798: Narrow the return type of _FORMAT_SIMPLE and `_FOR…

c1b9a83

…MAT_WITH_SPEC` to `str`

NekoAsakura requested review from Fidget-Spinner, markshannon, savannahostrowski and tomasr8 as code owners March 30, 2026 17:43

bedevere-app Bot mentioned this pull request Mar 30, 2026

Better uop coverage in the JIT optimizer #131798

Open

bedevere-app Bot added the awaiting review label Mar 30, 2026

eendebakpt reviewed Mar 30, 2026

View reviewed changes

pythongh-131798: Narrow the return type of _FORMAT_SIMPLE and `_FOR…

ed1204e

…MAT_WITH_SPEC` to str for built-in types

Fidget-Spinner reviewed Apr 1, 2026

View reviewed changes

pythongh-131798: Narrow the return type of _FORMAT_SIMPLE and `_FOR…

d09a7cc

…MAT_WITH_SPEC` to str for built-in types

Fidget-Spinner added the skip news label Apr 1, 2026

Fidget-Spinner closed this Apr 3, 2026

Fidget-Spinner reopened this Apr 3, 2026

Fidget-Spinner and others added 3 commits April 3, 2026 23:23

Merge branch 'main' into format-type-narrowing

3dea0df

Merge remote-tracking branch 'upstream/main' into format-type-narrowing

253f55b

# Conflicts: # Python/optimizer_symbols.c

Merge branch 'main' into format-type-narrowing

0efc0b5

markshannon reviewed May 1, 2026

View reviewed changes

NekoAsakura added 4 commits May 1, 2026 10:06

address feedback

e1c0417

Merge branch 'main' into format-type-narrowing

374b4af

add constant folding test

3752820

Revert "add constant folding test"

02c37ad

This reverts commit 3752820.

Uh oh!

Conversation

NekoAsakura commented Mar 30, 2026 • edited by bedevere-app Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Fidget-Spinner commented Apr 3, 2026

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

read-the-docs-community Bot commented May 3, 2026

Documentation build overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

NekoAsakura commented Mar 30, 2026 •

edited by bedevere-app Bot

Loading