CINXE.COM
PEP 533 – Deterministic cleanup for iterators | peps.python.org
<!DOCTYPE html> <html lang="en"> <head> <meta charset="utf-8"> <meta name="viewport" content="width=device-width, initial-scale=1.0"> <meta name="color-scheme" content="light dark"> <title>PEP 533 – Deterministic cleanup for iterators | peps.python.org</title> <link rel="shortcut icon" href="../_static/py.png"> <link rel="canonical" href="https://peps.python.org/pep-0533/"> <link rel="stylesheet" href="../_static/style.css" type="text/css"> <link rel="stylesheet" href="../_static/mq.css" type="text/css"> <link rel="stylesheet" href="../_static/pygments.css" type="text/css" media="(prefers-color-scheme: light)" id="pyg-light"> <link rel="stylesheet" href="../_static/pygments_dark.css" type="text/css" media="(prefers-color-scheme: dark)" id="pyg-dark"> <link rel="alternate" type="application/rss+xml" title="Latest PEPs" href="https://peps.python.org/peps.rss"> <meta property="og:title" content='PEP 533 – Deterministic cleanup for iterators | peps.python.org'> <meta property="og:description" content="We propose to extend the iterator protocol with a new __(a)iterclose__ slot, which is called automatically on exit from (async) for loops, regardless of how they exit. This allows for convenient, deterministic cleanup of resources held by iterators with..."> <meta property="og:type" content="website"> <meta property="og:url" content="https://peps.python.org/pep-0533/"> <meta property="og:site_name" content="Python Enhancement Proposals (PEPs)"> <meta property="og:image" content="https://peps.python.org/_static/og-image.png"> <meta property="og:image:alt" content="Python PEPs"> <meta property="og:image:width" content="200"> <meta property="og:image:height" content="200"> <meta name="description" content="We propose to extend the iterator protocol with a new __(a)iterclose__ slot, which is called automatically on exit from (async) for loops, regardless of how they exit. This allows for convenient, deterministic cleanup of resources held by iterators with..."> <meta name="theme-color" content="#3776ab"> </head> <body> <svg xmlns="http://www.w3.org/2000/svg" style="display: none;"> <symbol id="svg-sun-half" viewBox="0 0 24 24" pointer-events="all"> <title>Following system colour scheme</title> <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"> <circle cx="12" cy="12" r="9"></circle> <path d="M12 3v18m0-12l4.65-4.65M12 14.3l7.37-7.37M12 19.6l8.85-8.85"></path> </svg> </symbol> <symbol id="svg-moon" viewBox="0 0 24 24" pointer-events="all"> <title>Selected dark colour scheme</title> <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"> <path stroke="none" d="M0 0h24v24H0z" fill="none"></path> <path d="M12 3c.132 0 .263 0 .393 0a7.5 7.5 0 0 0 7.92 12.446a9 9 0 1 1 -8.313 -12.454z"></path> </svg> </symbol> <symbol id="svg-sun" viewBox="0 0 24 24" pointer-events="all"> <title>Selected light colour scheme</title> <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"> <circle cx="12" cy="12" r="5"></circle> <line x1="12" y1="1" x2="12" y2="3"></line> <line x1="12" y1="21" x2="12" y2="23"></line> <line x1="4.22" y1="4.22" x2="5.64" y2="5.64"></line> <line x1="18.36" y1="18.36" x2="19.78" y2="19.78"></line> <line x1="1" y1="12" x2="3" y2="12"></line> <line x1="21" y1="12" x2="23" y2="12"></line> <line x1="4.22" y1="19.78" x2="5.64" y2="18.36"></line> <line x1="18.36" y1="5.64" x2="19.78" y2="4.22"></line> </svg> </symbol> </svg> <script> document.documentElement.dataset.colour_scheme = localStorage.getItem("colour_scheme") || "auto" </script> <section id="pep-page-section"> <header> <h1>Python Enhancement Proposals</h1> <ul class="breadcrumbs"> <li><a href="https://www.python.org/" title="The Python Programming Language">Python</a> » </li> <li><a href="../pep-0000/">PEP Index</a> » </li> <li>PEP 533</li> </ul> <button id="colour-scheme-cycler" onClick="setColourScheme(nextColourScheme())"> <svg aria-hidden="true" class="colour-scheme-icon-when-auto"><use href="#svg-sun-half"></use></svg> <svg aria-hidden="true" class="colour-scheme-icon-when-dark"><use href="#svg-moon"></use></svg> <svg aria-hidden="true" class="colour-scheme-icon-when-light"><use href="#svg-sun"></use></svg> <span class="visually-hidden">Toggle light / dark / auto colour theme</span> </button> </header> <article> <section id="pep-content"> <h1 class="page-title">PEP 533 – Deterministic cleanup for iterators</h1> <dl class="rfc2822 field-list simple"> <dt class="field-odd">Author<span class="colon">:</span></dt> <dd class="field-odd">Nathaniel J. Smith</dd> <dt class="field-even">BDFL-Delegate<span class="colon">:</span></dt> <dd class="field-even">Yury Selivanov <yury at edgedb.com></dd> <dt class="field-odd">Status<span class="colon">:</span></dt> <dd class="field-odd"><abbr title="Inactive draft that may be taken up again at a later time">Deferred</abbr></dd> <dt class="field-even">Type<span class="colon">:</span></dt> <dd class="field-even"><abbr title="Normative PEP with a new feature for Python, implementation change for CPython or interoperability standard for the ecosystem">Standards Track</abbr></dd> <dt class="field-odd">Created<span class="colon">:</span></dt> <dd class="field-odd">18-Oct-2016</dd> <dt class="field-even">Post-History<span class="colon">:</span></dt> <dd class="field-even">18-Oct-2016</dd> </dl> <hr class="docutils" /> <section id="contents"> <details><summary>Table of Contents</summary><ul class="simple"> <li><a class="reference internal" href="#abstract">Abstract</a></li> <li><a class="reference internal" href="#note-on-timing">Note on timing</a></li> <li><a class="reference internal" href="#background-and-motivation">Background and motivation</a></li> <li><a class="reference internal" href="#alternatives">Alternatives</a><ul> <li><a class="reference internal" href="#pep-525-asyncgen-hooks">PEP 525 asyncgen hooks</a></li> <li><a class="reference internal" href="#always-inject-resources-and-do-all-cleanup-at-the-top-level">Always inject resources, and do all cleanup at the top level</a></li> <li><a class="reference internal" href="#more-complex-variants-of-a-iterclose">More complex variants of __(a)iterclose__</a></li> </ul> </li> <li><a class="reference internal" href="#specification">Specification</a><ul> <li><a class="reference internal" href="#guiding-principles">Guiding principles</a></li> <li><a class="reference internal" href="#changes-to-iteration">Changes to iteration</a></li> <li><a class="reference internal" href="#changes-to-async-iteration">Changes to async iteration</a></li> <li><a class="reference internal" href="#modifications-to-basic-iterator-types">Modifications to basic iterator types</a></li> <li><a class="reference internal" href="#new-convenience-functions">New convenience functions</a></li> <li><a class="reference internal" href="#iterclose-implementations-for-iterator-wrappers">__iterclose__ implementations for iterator wrappers</a></li> <li><a class="reference internal" href="#example-rationale">Example / Rationale</a></li> </ul> </li> <li><a class="reference internal" href="#transition-plan">Transition plan</a></li> <li><a class="reference internal" href="#acknowledgements">Acknowledgements</a></li> <li><a class="reference internal" href="#copyright">Copyright</a></li> </ul> </details></section> <section id="abstract"> <h2><a class="toc-backref" href="#abstract" role="doc-backlink">Abstract</a></h2> <p>We propose to extend the iterator protocol with a new <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code> slot, which is called automatically on exit from <code class="docutils literal notranslate"><span class="pre">(async)</span> <span class="pre">for</span></code> loops, regardless of how they exit. This allows for convenient, deterministic cleanup of resources held by iterators without reliance on the garbage collector. This is especially valuable for asynchronous generators.</p> </section> <section id="note-on-timing"> <h2><a class="toc-backref" href="#note-on-timing" role="doc-backlink">Note on timing</a></h2> <p>In practical terms, the proposal here is divided into two separate parts: the handling of async iterators, which should ideally be implemented ASAP, and the handling of regular iterators, which is a larger but more relaxed project that can’t start until 3.7 at the earliest. But since the changes are closely related, and we probably don’t want to end up with async iterators and regular iterators diverging in the long run, it seems useful to look at them together.</p> </section> <section id="background-and-motivation"> <h2><a class="toc-backref" href="#background-and-motivation" role="doc-backlink">Background and motivation</a></h2> <p>Python iterables often hold resources which require cleanup. For example: <code class="docutils literal notranslate"><span class="pre">file</span></code> objects need to be closed; the <a class="pep reference internal" href="../pep-0333/" title="PEP 333 – Python Web Server Gateway Interface v1.0">WSGI spec</a> adds a <code class="docutils literal notranslate"><span class="pre">close</span></code> method on top of the regular iterator protocol and demands that consumers call it at the appropriate time (though forgetting to do so is a <a class="reference external" href="http://blog.dscpl.com.au/2012/10/obligations-for-calling-close-on.html">frequent source of bugs</a>); and <a class="pep reference internal" href="../pep-0342/" title="PEP 342 – Coroutines via Enhanced Generators">PEP 342</a> (based on <a class="pep reference internal" href="../pep-0325/" title="PEP 325 – Resource-Release Support for Generators">PEP 325</a>) extended generator objects to add a <code class="docutils literal notranslate"><span class="pre">close</span></code> method to allow generators to clean up after themselves.</p> <p>Generally, objects that need to clean up after themselves also define a <code class="docutils literal notranslate"><span class="pre">__del__</span></code> method to ensure that this cleanup will happen eventually, when the object is garbage collected. However, relying on the garbage collector for cleanup like this causes serious problems in several cases:</p> <ul class="simple"> <li>In Python implementations that do not use reference counting (e.g. PyPy, Jython), calls to <code class="docutils literal notranslate"><span class="pre">__del__</span></code> may be arbitrarily delayed – yet many situations require <em>prompt</em> cleanup of resources. Delayed cleanup produces problems like crashes due to file descriptor exhaustion, or WSGI timing middleware that collects bogus times.</li> <li>Async generators (<a class="pep reference internal" href="../pep-0525/" title="PEP 525 – Asynchronous Generators">PEP 525</a>) can only perform cleanup under the supervision of the appropriate coroutine runner. <code class="docutils literal notranslate"><span class="pre">__del__</span></code> doesn’t have access to the coroutine runner; indeed, the coroutine runner might be garbage collected before the generator object. So relying on the garbage collector is effectively impossible without some kind of language extension. (<a class="pep reference internal" href="../pep-0525/" title="PEP 525 – Asynchronous Generators">PEP 525</a> does provide such an extension, but it has a number of limitations that this proposal fixes; see the “alternatives” section below for discussion.)</li> </ul> <p>Fortunately, Python provides a standard tool for doing resource cleanup in a more structured way: <code class="docutils literal notranslate"><span class="pre">with</span></code> blocks. For example, this code opens a file but relies on the garbage collector to close it:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">def</span><span class="w"> </span><span class="nf">read_newline_separated_json</span><span class="p">(</span><span class="n">path</span><span class="p">):</span> <span class="k">for</span> <span class="n">line</span> <span class="ow">in</span> <span class="nb">open</span><span class="p">(</span><span class="n">path</span><span class="p">):</span> <span class="k">yield</span> <span class="n">json</span><span class="o">.</span><span class="n">loads</span><span class="p">(</span><span class="n">line</span><span class="p">)</span> <span class="k">for</span> <span class="n">document</span> <span class="ow">in</span> <span class="n">read_newline_separated_json</span><span class="p">(</span><span class="n">path</span><span class="p">):</span> <span class="o">...</span> </pre></div> </div> <p>and recent versions of CPython will point this out by issuing a <code class="docutils literal notranslate"><span class="pre">ResourceWarning</span></code>, nudging us to fix it by adding a <code class="docutils literal notranslate"><span class="pre">with</span></code> block:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">def</span><span class="w"> </span><span class="nf">read_newline_separated_json</span><span class="p">(</span><span class="n">path</span><span class="p">):</span> <span class="k">with</span> <span class="nb">open</span><span class="p">(</span><span class="n">path</span><span class="p">)</span> <span class="k">as</span> <span class="n">file_handle</span><span class="p">:</span> <span class="c1"># <-- with block</span> <span class="k">for</span> <span class="n">line</span> <span class="ow">in</span> <span class="n">file_handle</span><span class="p">:</span> <span class="k">yield</span> <span class="n">json</span><span class="o">.</span><span class="n">loads</span><span class="p">(</span><span class="n">line</span><span class="p">)</span> <span class="k">for</span> <span class="n">document</span> <span class="ow">in</span> <span class="n">read_newline_separated_json</span><span class="p">(</span><span class="n">path</span><span class="p">):</span> <span class="c1"># <-- outer for loop</span> <span class="o">...</span> </pre></div> </div> <p>But there’s a subtlety here, caused by the interaction of <code class="docutils literal notranslate"><span class="pre">with</span></code> blocks and generators. <code class="docutils literal notranslate"><span class="pre">with</span></code> blocks are Python’s main tool for managing cleanup, and they’re a powerful one, because they pin the lifetime of a resource to the lifetime of a stack frame. But this assumes that someone will take care of cleaning up the stack frame… and for generators, this requires that someone <code class="docutils literal notranslate"><span class="pre">close</span></code> them.</p> <p>In this case, adding the <code class="docutils literal notranslate"><span class="pre">with</span></code> block <em>is</em> enough to shut up the <code class="docutils literal notranslate"><span class="pre">ResourceWarning</span></code>, but this is misleading – the file object cleanup here is still dependent on the garbage collector. The <code class="docutils literal notranslate"><span class="pre">with</span></code> block will only be unwound when the <code class="docutils literal notranslate"><span class="pre">read_newline_separated_json</span></code> generator is closed. If the outer <code class="docutils literal notranslate"><span class="pre">for</span></code> loop runs to completion then the cleanup will happen immediately; but if this loop is terminated early by a <code class="docutils literal notranslate"><span class="pre">break</span></code> or an exception, then the <code class="docutils literal notranslate"><span class="pre">with</span></code> block won’t fire until the generator object is garbage collected.</p> <p>The correct solution requires that all <em>users</em> of this API wrap every <code class="docutils literal notranslate"><span class="pre">for</span></code> loop in its own <code class="docutils literal notranslate"><span class="pre">with</span></code> block:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">with</span> <span class="n">closing</span><span class="p">(</span><span class="n">read_newline_separated_json</span><span class="p">(</span><span class="n">path</span><span class="p">))</span> <span class="k">as</span> <span class="n">genobj</span><span class="p">:</span> <span class="k">for</span> <span class="n">document</span> <span class="ow">in</span> <span class="n">genobj</span><span class="p">:</span> <span class="o">...</span> </pre></div> </div> <p>This gets even worse if we consider the idiom of decomposing a complex pipeline into multiple nested generators:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">def</span><span class="w"> </span><span class="nf">read_users</span><span class="p">(</span><span class="n">path</span><span class="p">):</span> <span class="k">with</span> <span class="n">closing</span><span class="p">(</span><span class="n">read_newline_separated_json</span><span class="p">(</span><span class="n">path</span><span class="p">))</span> <span class="k">as</span> <span class="n">gen</span><span class="p">:</span> <span class="k">for</span> <span class="n">document</span> <span class="ow">in</span> <span class="n">gen</span><span class="p">:</span> <span class="k">yield</span> <span class="n">User</span><span class="o">.</span><span class="n">from_json</span><span class="p">(</span><span class="n">document</span><span class="p">)</span> <span class="k">def</span><span class="w"> </span><span class="nf">users_in_group</span><span class="p">(</span><span class="n">path</span><span class="p">,</span> <span class="n">group</span><span class="p">):</span> <span class="k">with</span> <span class="n">closing</span><span class="p">(</span><span class="n">read_users</span><span class="p">(</span><span class="n">path</span><span class="p">))</span> <span class="k">as</span> <span class="n">gen</span><span class="p">:</span> <span class="k">for</span> <span class="n">user</span> <span class="ow">in</span> <span class="n">gen</span><span class="p">:</span> <span class="k">if</span> <span class="n">user</span><span class="o">.</span><span class="n">group</span> <span class="o">==</span> <span class="n">group</span><span class="p">:</span> <span class="k">yield</span> <span class="n">user</span> </pre></div> </div> <p>In general if you have N nested generators then you need N+1 <code class="docutils literal notranslate"><span class="pre">with</span></code> blocks to clean up 1 file. And good defensive programming would suggest that any time we use a generator, we should assume the possibility that there could be at least one <code class="docutils literal notranslate"><span class="pre">with</span></code> block somewhere in its (potentially transitive) call stack, either now or in the future, and thus always wrap it in a <code class="docutils literal notranslate"><span class="pre">with</span></code>. But in practice, basically nobody does this, because programmers would rather write buggy code than tiresome repetitive code. In simple cases like this there are some workarounds that good Python developers know (e.g. in this simple case it would be idiomatic to pass in a file handle instead of a path and move the resource management to the top level), but in general we cannot avoid the use of <code class="docutils literal notranslate"><span class="pre">with</span></code>/<code class="docutils literal notranslate"><span class="pre">finally</span></code> inside of generators, and thus dealing with this problem one way or another. When beauty and correctness fight then beauty tends to win, so it’s important to make correct code beautiful.</p> <p>Still, is this worth fixing? Until async generators came along I would have argued yes, but that it was a low priority, since everyone seems to be muddling along okay – but async generators make it much more urgent. Async generators cannot do cleanup <em>at all</em> without some mechanism for deterministic cleanup that people will actually use, and async generators are particularly likely to hold resources like file descriptors. (After all, if they weren’t doing I/O, they’d be generators, not async generators.) So we have to do something, and it might as well be a comprehensive fix to the underlying problem. And it’s much easier to fix this now when async generators are first rolling out, than it will be to fix it later.</p> <p>The proposal itself is simple in concept: add a <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code> method to the iterator protocol, and have (async) <code class="docutils literal notranslate"><span class="pre">for</span></code> loops call it when the loop is exited, even if this occurs via <code class="docutils literal notranslate"><span class="pre">break</span></code> or exception unwinding. Effectively, we’re taking the current cumbersome idiom (<code class="docutils literal notranslate"><span class="pre">with</span></code> block + <code class="docutils literal notranslate"><span class="pre">for</span></code> loop) and merging them together into a fancier <code class="docutils literal notranslate"><span class="pre">for</span></code>. This may seem non-orthogonal, but makes sense when you consider that the existence of generators means that <code class="docutils literal notranslate"><span class="pre">with</span></code> blocks actually depend on iterator cleanup to work reliably, plus experience showing that iterator cleanup is often a desirable feature in its own right.</p> </section> <section id="alternatives"> <h2><a class="toc-backref" href="#alternatives" role="doc-backlink">Alternatives</a></h2> <section id="pep-525-asyncgen-hooks"> <h3><a class="toc-backref" href="#pep-525-asyncgen-hooks" role="doc-backlink">PEP 525 asyncgen hooks</a></h3> <p><a class="pep reference internal" href="../pep-0525/#finalization" title="PEP 525 – Asynchronous Generators § Finalization">PEP 525 proposes a set of global thread-local hooks</a> managed by new <code class="docutils literal notranslate"><span class="pre">sys.{get/set}_asyncgen_hooks()</span></code> functions, which allow event loops to integrate with the garbage collector to run cleanup for async generators. In principle, this proposal and <a class="pep reference internal" href="../pep-0525/" title="PEP 525 – Asynchronous Generators">PEP 525</a> are complementary, in the same way that <code class="docutils literal notranslate"><span class="pre">with</span></code> blocks and <code class="docutils literal notranslate"><span class="pre">__del__</span></code> are complementary: this proposal takes care of ensuring deterministic cleanup in most cases, while <a class="pep reference internal" href="../pep-0525/" title="PEP 525 – Asynchronous Generators">PEP 525</a>’s GC hooks clean up anything that gets missed. But <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code> provides a number of advantages over GC hooks alone:</p> <ul> <li>The GC hook semantics aren’t part of the abstract async iterator protocol, but are instead restricted <a class="reference external" href="https://mail.python.org/pipermail/python-dev/2016-September/146129.html">specifically to the async generator concrete type</a>. If you have an async iterator implemented using a class, like:<div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">class</span><span class="w"> </span><span class="nc">MyAsyncIterator</span><span class="p">:</span> <span class="k">async</span> <span class="k">def</span><span class="w"> </span><span class="fm">__anext__</span><span class="p">():</span> <span class="o">...</span> </pre></div> </div> <p>then you can’t refactor this into an async generator without changing its semantics, and vice-versa. This seems very unpythonic. (It also leaves open the question of what exactly class-based async iterators are supposed to do, given that they face exactly the same cleanup problems as async generators.) <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code>, on the other hand, is defined at the protocol level, so it’s duck-type friendly and works for all iterators, not just generators.</p> </li> <li>Code that wants to work on non-CPython implementations like PyPy cannot in general rely on GC for cleanup. Without <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code>, it’s more or less guaranteed that developers who develop and test on CPython will produce libraries that leak resources when used on PyPy. Developers who do want to target alternative implementations will either have to take the defensive approach of wrapping every <code class="docutils literal notranslate"><span class="pre">for</span></code> loop in a <code class="docutils literal notranslate"><span class="pre">with</span></code> block, or else carefully audit their code to figure out which generators might possibly contain cleanup code and add <code class="docutils literal notranslate"><span class="pre">with</span></code> blocks around those only. With <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code>, writing portable code becomes easy and natural.</li> <li>An important part of building robust software is making sure that exceptions always propagate correctly without being lost. One of the most exciting things about async/await compared to traditional callback-based systems is that instead of requiring manual chaining, the runtime can now do the heavy lifting of propagating errors, making it <em>much</em> easier to write robust code. But, this beautiful new picture has one major gap: if we rely on the GC for generator cleanup, then exceptions raised during cleanup are lost. So, again, with <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code>, developers who care about this kind of robustness will either have to take the defensive approach of wrapping every <code class="docutils literal notranslate"><span class="pre">for</span></code> loop in a <code class="docutils literal notranslate"><span class="pre">with</span></code> block, or else carefully audit their code to figure out which generators might possibly contain cleanup code. <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code> plugs this hole by performing cleanup in the caller’s context, so writing more robust code becomes the path of least resistance.</li> <li>The WSGI experience suggests that there exist important iterator-based APIs that need prompt cleanup and cannot rely on the GC, even in CPython. For example, consider a hypothetical WSGI-like API based around async/await and async iterators, where a response handler is an async generator that takes request headers + an async iterator over the request body, and yields response headers + the response body. (This is actually the use case that got me interested in async generators in the first place, i.e. this isn’t hypothetical.) If we follow WSGI in requiring that child iterators must be closed properly, then without <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code> the absolute most minimalistic middleware in our system looks something like:<div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">async</span> <span class="k">def</span><span class="w"> </span><span class="nf">noop_middleware</span><span class="p">(</span><span class="n">handler</span><span class="p">,</span> <span class="n">request_header</span><span class="p">,</span> <span class="n">request_body</span><span class="p">):</span> <span class="k">async</span> <span class="k">with</span> <span class="n">aclosing</span><span class="p">(</span><span class="n">handler</span><span class="p">(</span><span class="n">request_body</span><span class="p">,</span> <span class="n">request_body</span><span class="p">))</span> <span class="k">as</span> <span class="nb">aiter</span><span class="p">:</span> <span class="k">async</span> <span class="k">for</span> <span class="n">response_item</span> <span class="ow">in</span> <span class="nb">aiter</span><span class="p">:</span> <span class="k">yield</span> <span class="n">response_item</span> </pre></div> </div> <p>Arguably in regular code one can get away with skipping the <code class="docutils literal notranslate"><span class="pre">with</span></code> block around <code class="docutils literal notranslate"><span class="pre">for</span></code> loops, depending on how confident one is that one understands the internal implementation of the generator. But here we have to cope with arbitrary response handlers, so without <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code>, this <code class="docutils literal notranslate"><span class="pre">with</span></code> construction is a mandatory part of every middleware.</p> <p><code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code> allows us to eliminate the mandatory boilerplate and an extra level of indentation from every middleware:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">async</span> <span class="k">def</span><span class="w"> </span><span class="nf">noop_middleware</span><span class="p">(</span><span class="n">handler</span><span class="p">,</span> <span class="n">request_header</span><span class="p">,</span> <span class="n">request_body</span><span class="p">):</span> <span class="k">async</span> <span class="k">for</span> <span class="n">response_item</span> <span class="ow">in</span> <span class="n">handler</span><span class="p">(</span><span class="n">request_header</span><span class="p">,</span> <span class="n">request_body</span><span class="p">):</span> <span class="k">yield</span> <span class="n">response_item</span> </pre></div> </div> </li> </ul> <p>So the <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code> approach provides substantial advantages over GC hooks.</p> <p>This leaves open the question of whether we want a combination of GC hooks + <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code>, or just <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code> alone. Since the vast majority of generators are iterated over using a <code class="docutils literal notranslate"><span class="pre">for</span></code> loop or equivalent, <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code> handles most situations before the GC has a chance to get involved. The case where GC hooks provide additional value is in code that does manual iteration, e.g.:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="n">agen</span> <span class="o">=</span> <span class="n">fetch_newline_separated_json_from_url</span><span class="p">(</span><span class="o">...</span><span class="p">)</span> <span class="k">while</span> <span class="kc">True</span><span class="p">:</span> <span class="n">document</span> <span class="o">=</span> <span class="k">await</span> <span class="nb">type</span><span class="p">(</span><span class="n">agen</span><span class="p">)</span><span class="o">.</span><span class="fm">__anext__</span><span class="p">(</span><span class="n">agen</span><span class="p">)</span> <span class="k">if</span> <span class="n">document</span><span class="p">[</span><span class="s2">"id"</span><span class="p">]</span> <span class="o">==</span> <span class="n">needle</span><span class="p">:</span> <span class="k">break</span> <span class="c1"># doesn't do 'await agen.aclose()'</span> </pre></div> </div> <p>If we go with the GC-hooks + <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code> approach, this generator will eventually be cleaned up by GC calling the generator <code class="docutils literal notranslate"><span class="pre">__del__</span></code> method, which then will use the hooks to call back into the event loop to run the cleanup code.</p> <p>If we go with the no-GC-hooks approach, this generator will eventually be garbage collected, with the following effects:</p> <ul class="simple"> <li>its <code class="docutils literal notranslate"><span class="pre">__del__</span></code> method will issue a warning that the generator was not closed (similar to the existing “coroutine never awaited” warning).</li> <li>The underlying resources involved will still be cleaned up, because the generator frame will still be garbage collected, causing it to drop references to any file handles or sockets it holds, and then those objects’s <code class="docutils literal notranslate"><span class="pre">__del__</span></code> methods will release the actual operating system resources.</li> <li>But, any cleanup code inside the generator itself (e.g. logging, buffer flushing) will not get a chance to run.</li> </ul> <p>The solution here – as the warning would indicate – is to fix the code so that it calls <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code>, e.g. by using a <code class="docutils literal notranslate"><span class="pre">with</span></code> block:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">async</span> <span class="k">with</span> <span class="n">aclosing</span><span class="p">(</span><span class="n">fetch_newline_separated_json_from_url</span><span class="p">(</span><span class="o">...</span><span class="p">))</span> <span class="k">as</span> <span class="n">agen</span><span class="p">:</span> <span class="k">while</span> <span class="kc">True</span><span class="p">:</span> <span class="n">document</span> <span class="o">=</span> <span class="k">await</span> <span class="nb">type</span><span class="p">(</span><span class="n">agen</span><span class="p">)</span><span class="o">.</span><span class="fm">__anext__</span><span class="p">(</span><span class="n">agen</span><span class="p">)</span> <span class="k">if</span> <span class="n">document</span><span class="p">[</span><span class="s2">"id"</span><span class="p">]</span> <span class="o">==</span> <span class="n">needle</span><span class="p">:</span> <span class="k">break</span> </pre></div> </div> <p>Basically in this approach, the rule would be that if you want to manually implement the iterator protocol, then it’s your responsibility to implement all of it, and that now includes <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code>.</p> <p>GC hooks add non-trivial complexity in the form of (a) new global interpreter state, (b) a somewhat complicated control flow (e.g., async generator GC always involves resurrection, so the details of PEP 442 are important), and (c) a new public API in asyncio (<code class="docutils literal notranslate"><span class="pre">await</span> <span class="pre">loop.shutdown_asyncgens()</span></code>) that users have to remember to call at the appropriate time. (This last point in particular somewhat undermines the argument that GC hooks provide a safe backup to guarantee cleanup, since if <code class="docutils literal notranslate"><span class="pre">shutdown_asyncgens()</span></code> isn’t called correctly then I <em>think</em> it’s possible for generators to be silently discarded without their cleanup code being called; compare this to the <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code>-only approach where in the worst case we still at least get a warning printed. This might be fixable.) All this considered, GC hooks arguably aren’t worth it, given that the only people they help are those who want to manually call <code class="docutils literal notranslate"><span class="pre">__anext__</span></code> yet don’t want to manually call <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code>. But Yury disagrees with me on this :-). And both options are viable.</p> </section> <section id="always-inject-resources-and-do-all-cleanup-at-the-top-level"> <h3><a class="toc-backref" href="#always-inject-resources-and-do-all-cleanup-at-the-top-level" role="doc-backlink">Always inject resources, and do all cleanup at the top level</a></h3> <p>Several commentators on python-dev and python-ideas have suggested that a pattern to avoid these problems is to always pass resources in from above, e.g. <code class="docutils literal notranslate"><span class="pre">read_newline_separated_json</span></code> should take a file object rather than a path, with cleanup handled at the top level:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">def</span><span class="w"> </span><span class="nf">read_newline_separated_json</span><span class="p">(</span><span class="n">file_handle</span><span class="p">):</span> <span class="k">for</span> <span class="n">line</span> <span class="ow">in</span> <span class="n">file_handle</span><span class="p">:</span> <span class="k">yield</span> <span class="n">json</span><span class="o">.</span><span class="n">loads</span><span class="p">(</span><span class="n">line</span><span class="p">)</span> <span class="k">def</span><span class="w"> </span><span class="nf">read_users</span><span class="p">(</span><span class="n">file_handle</span><span class="p">):</span> <span class="k">for</span> <span class="n">document</span> <span class="ow">in</span> <span class="n">read_newline_separated_json</span><span class="p">(</span><span class="n">file_handle</span><span class="p">):</span> <span class="k">yield</span> <span class="n">User</span><span class="o">.</span><span class="n">from_json</span><span class="p">(</span><span class="n">document</span><span class="p">)</span> <span class="k">with</span> <span class="nb">open</span><span class="p">(</span><span class="n">path</span><span class="p">)</span> <span class="k">as</span> <span class="n">file_handle</span><span class="p">:</span> <span class="k">for</span> <span class="n">user</span> <span class="ow">in</span> <span class="n">read_users</span><span class="p">(</span><span class="n">file_handle</span><span class="p">):</span> <span class="o">...</span> </pre></div> </div> <p>This works well in simple cases; here it lets us avoid the “N+1 <code class="docutils literal notranslate"><span class="pre">with</span></code> blocks problem”. But unfortunately, it breaks down quickly when things get more complex. Consider if instead of reading from a file, our generator was reading from a streaming HTTP GET request – while handling redirects and authentication via OAUTH. Then we’d really want the sockets to be managed down inside our HTTP client library, not at the top level. Plus there are other cases where <code class="docutils literal notranslate"><span class="pre">finally</span></code> blocks embedded inside generators are important in their own right: db transaction management, emitting logging information during cleanup (one of the major motivating use cases for WSGI <code class="docutils literal notranslate"><span class="pre">close</span></code>), and so forth. So this is really a workaround for simple cases, not a general solution.</p> </section> <section id="more-complex-variants-of-a-iterclose"> <h3><a class="toc-backref" href="#more-complex-variants-of-a-iterclose" role="doc-backlink">More complex variants of __(a)iterclose__</a></h3> <p>The semantics of <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code> are somewhat inspired by <code class="docutils literal notranslate"><span class="pre">with</span></code> blocks, but context managers are more powerful: <code class="docutils literal notranslate"><span class="pre">__(a)exit__</span></code> can distinguish between a normal exit versus exception unwinding, and in the case of an exception it can examine the exception details and optionally suppress propagation. <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code> as proposed here does not have these powers, but one can imagine an alternative design where it did.</p> <p>However, this seems like unwarranted complexity: experience suggests that it’s common for iterables to have <code class="docutils literal notranslate"><span class="pre">close</span></code> methods, and even to have <code class="docutils literal notranslate"><span class="pre">__exit__</span></code> methods that call <code class="docutils literal notranslate"><span class="pre">self.close()</span></code>, but I’m not aware of any common cases that make use of <code class="docutils literal notranslate"><span class="pre">__exit__</span></code>’s full power. I also can’t think of any examples where this would be useful. And it seems unnecessarily confusing to allow iterators to affect flow control by swallowing exceptions – if you’re in a situation where you really want that, then you should probably use a real <code class="docutils literal notranslate"><span class="pre">with</span></code> block anyway.</p> </section> </section> <section id="specification"> <h2><a class="toc-backref" href="#specification" role="doc-backlink">Specification</a></h2> <p>This section describes where we want to eventually end up, though there are some backwards compatibility issues that mean we can’t jump directly here. A later section describes the transition plan.</p> <section id="guiding-principles"> <h3><a class="toc-backref" href="#guiding-principles" role="doc-backlink">Guiding principles</a></h3> <p>Generally, <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code> implementations should:</p> <ul class="simple"> <li>be idempotent,</li> <li>perform any cleanup that is appropriate on the assumption that the iterator will not be used again after <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code> is called. In particular, once <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code> has been called then calling <code class="docutils literal notranslate"><span class="pre">__(a)next__</span></code> produces undefined behavior.</li> </ul> <p>And generally, any code which starts iterating through an iterable with the intention of exhausting it, should arrange to make sure that <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code> is eventually called, whether or not the iterator is actually exhausted.</p> </section> <section id="changes-to-iteration"> <h3><a class="toc-backref" href="#changes-to-iteration" role="doc-backlink">Changes to iteration</a></h3> <p>The core proposal is the change in behavior of <code class="docutils literal notranslate"><span class="pre">for</span></code> loops. Given this Python code:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">for</span> <span class="n">VAR</span> <span class="ow">in</span> <span class="n">ITERABLE</span><span class="p">:</span> <span class="n">LOOP</span><span class="o">-</span><span class="n">BODY</span> <span class="k">else</span><span class="p">:</span> <span class="n">ELSE</span><span class="o">-</span><span class="n">BODY</span> </pre></div> </div> <p>we desugar to the equivalent of:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="n">_iter</span> <span class="o">=</span> <span class="nb">iter</span><span class="p">(</span><span class="n">ITERABLE</span><span class="p">)</span> <span class="n">_iterclose</span> <span class="o">=</span> <span class="nb">getattr</span><span class="p">(</span><span class="nb">type</span><span class="p">(</span><span class="n">_iter</span><span class="p">),</span> <span class="s2">"__iterclose__"</span><span class="p">,</span> <span class="k">lambda</span><span class="p">:</span> <span class="kc">None</span><span class="p">)</span> <span class="k">try</span><span class="p">:</span> <span class="n">traditional</span><span class="o">-</span><span class="k">for</span> <span class="n">VAR</span> <span class="ow">in</span> <span class="n">_iter</span><span class="p">:</span> <span class="n">LOOP</span><span class="o">-</span><span class="n">BODY</span> <span class="k">else</span><span class="p">:</span> <span class="n">ELSE</span><span class="o">-</span><span class="n">BODY</span> <span class="k">finally</span><span class="p">:</span> <span class="n">_iterclose</span><span class="p">(</span><span class="n">_iter</span><span class="p">)</span> </pre></div> </div> <p>where the “traditional-for statement” here is meant as a shorthand for the classic 3.5-and-earlier <code class="docutils literal notranslate"><span class="pre">for</span></code> loop semantics.</p> <p>Besides the top-level <code class="docutils literal notranslate"><span class="pre">for</span></code> statement, Python also contains several other places where iterators are consumed. For consistency, these should call <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> as well using semantics equivalent to the above. This includes:</p> <ul class="simple"> <li><code class="docutils literal notranslate"><span class="pre">for</span></code> loops inside comprehensions</li> <li><code class="docutils literal notranslate"><span class="pre">*</span></code> unpacking</li> <li>functions which accept and fully consume iterables, like <code class="docutils literal notranslate"><span class="pre">list(it)</span></code>, <code class="docutils literal notranslate"><span class="pre">tuple(it)</span></code>, <code class="docutils literal notranslate"><span class="pre">itertools.product(it1,</span> <span class="pre">it2,</span> <span class="pre">...)</span></code>, and others.</li> </ul> <p>In addition, a <code class="docutils literal notranslate"><span class="pre">yield</span> <span class="pre">from</span></code> that successfully exhausts the called generator should as a last step call its <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> method. (Rationale: <code class="docutils literal notranslate"><span class="pre">yield</span> <span class="pre">from</span></code> already links the lifetime of the calling generator to the called generator; if the calling generator is closed when half-way through a <code class="docutils literal notranslate"><span class="pre">yield</span> <span class="pre">from</span></code>, then this will already automatically close the called generator.)</p> </section> <section id="changes-to-async-iteration"> <h3><a class="toc-backref" href="#changes-to-async-iteration" role="doc-backlink">Changes to async iteration</a></h3> <p>We also make the analogous changes to async iteration constructs, except that the new slot is called <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code>, and it’s an async method that gets <code class="docutils literal notranslate"><span class="pre">await</span></code>ed.</p> </section> <section id="modifications-to-basic-iterator-types"> <h3><a class="toc-backref" href="#modifications-to-basic-iterator-types" role="doc-backlink">Modifications to basic iterator types</a></h3> <p>Generator objects (including those created by generator comprehensions):</p> <ul class="simple"> <li><code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> calls <code class="docutils literal notranslate"><span class="pre">self.close()</span></code></li> <li><code class="docutils literal notranslate"><span class="pre">__del__</span></code> calls <code class="docutils literal notranslate"><span class="pre">self.close()</span></code> (same as now), and additionally issues a <code class="docutils literal notranslate"><span class="pre">ResourceWarning</span></code> if the generator wasn’t exhausted. This warning is hidden by default, but can be enabled for those who want to make sure they aren’t inadvertently relying on CPython-specific GC semantics.</li> </ul> <p>Async generator objects (including those created by async generator comprehensions):</p> <ul class="simple"> <li><code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code> calls <code class="docutils literal notranslate"><span class="pre">self.aclose()</span></code></li> <li><code class="docutils literal notranslate"><span class="pre">__del__</span></code> issues a <code class="docutils literal notranslate"><span class="pre">RuntimeWarning</span></code> if <code class="docutils literal notranslate"><span class="pre">aclose</span></code> has not been called, since this probably indicates a latent bug, similar to the “coroutine never awaited” warning.</li> </ul> <p>QUESTION: should file objects implement <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> to close the file? On the one hand this would make this change more disruptive; on the other hand people really like writing <code class="docutils literal notranslate"><span class="pre">for</span> <span class="pre">line</span> <span class="pre">in</span> <span class="pre">open(...):</span> <span class="pre">...</span></code>, and if we get used to iterators taking care of their own cleanup then it might become very weird if files don’t.</p> </section> <section id="new-convenience-functions"> <h3><a class="toc-backref" href="#new-convenience-functions" role="doc-backlink">New convenience functions</a></h3> <p>The <code class="docutils literal notranslate"><span class="pre">operator</span></code> module gains two new functions, with semantics equivalent to the following:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">def</span><span class="w"> </span><span class="nf">iterclose</span><span class="p">(</span><span class="n">it</span><span class="p">):</span> <span class="k">if</span> <span class="ow">not</span> <span class="nb">isinstance</span><span class="p">(</span><span class="n">it</span><span class="p">,</span> <span class="n">collections</span><span class="o">.</span><span class="n">abc</span><span class="o">.</span><span class="n">Iterator</span><span class="p">):</span> <span class="k">raise</span> <span class="ne">TypeError</span><span class="p">(</span><span class="s2">"not an iterator"</span><span class="p">)</span> <span class="k">if</span> <span class="nb">hasattr</span><span class="p">(</span><span class="nb">type</span><span class="p">(</span><span class="n">it</span><span class="p">),</span> <span class="s2">"__iterclose__"</span><span class="p">):</span> <span class="nb">type</span><span class="p">(</span><span class="n">it</span><span class="p">)</span><span class="o">.</span><span class="n">__iterclose__</span><span class="p">(</span><span class="n">it</span><span class="p">)</span> <span class="k">async</span> <span class="k">def</span><span class="w"> </span><span class="nf">aiterclose</span><span class="p">(</span><span class="n">ait</span><span class="p">):</span> <span class="k">if</span> <span class="ow">not</span> <span class="nb">isinstance</span><span class="p">(</span><span class="n">it</span><span class="p">,</span> <span class="n">collections</span><span class="o">.</span><span class="n">abc</span><span class="o">.</span><span class="n">AsyncIterator</span><span class="p">):</span> <span class="k">raise</span> <span class="ne">TypeError</span><span class="p">(</span><span class="s2">"not an iterator"</span><span class="p">)</span> <span class="k">if</span> <span class="nb">hasattr</span><span class="p">(</span><span class="nb">type</span><span class="p">(</span><span class="n">ait</span><span class="p">),</span> <span class="s2">"__aiterclose__"</span><span class="p">):</span> <span class="k">await</span> <span class="nb">type</span><span class="p">(</span><span class="n">ait</span><span class="p">)</span><span class="o">.</span><span class="n">__aiterclose__</span><span class="p">(</span><span class="n">ait</span><span class="p">)</span> </pre></div> </div> <p>The <code class="docutils literal notranslate"><span class="pre">itertools</span></code> module gains a new iterator wrapper that can be used to selectively disable the new <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> behavior:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="c1"># QUESTION: I feel like there might be a better name for this one?</span> <span class="k">class</span><span class="w"> </span><span class="nc">preserve</span><span class="p">(</span><span class="n">iterable</span><span class="p">):</span> <span class="k">def</span><span class="w"> </span><span class="fm">__init__</span><span class="p">(</span><span class="bp">self</span><span class="p">,</span> <span class="n">iterable</span><span class="p">):</span> <span class="bp">self</span><span class="o">.</span><span class="n">_it</span> <span class="o">=</span> <span class="nb">iter</span><span class="p">(</span><span class="n">iterable</span><span class="p">)</span> <span class="k">def</span><span class="w"> </span><span class="fm">__iter__</span><span class="p">(</span><span class="bp">self</span><span class="p">):</span> <span class="k">return</span> <span class="bp">self</span> <span class="k">def</span><span class="w"> </span><span class="fm">__next__</span><span class="p">(</span><span class="bp">self</span><span class="p">):</span> <span class="k">return</span> <span class="nb">next</span><span class="p">(</span><span class="bp">self</span><span class="o">.</span><span class="n">_it</span><span class="p">)</span> <span class="k">def</span><span class="w"> </span><span class="nf">__iterclose__</span><span class="p">(</span><span class="bp">self</span><span class="p">):</span> <span class="c1"># Swallow __iterclose__ without passing it on</span> <span class="k">pass</span> </pre></div> </div> <p>Example usage (assuming that file objects implements <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code>):</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">with</span> <span class="nb">open</span><span class="p">(</span><span class="o">...</span><span class="p">)</span> <span class="k">as</span> <span class="n">handle</span><span class="p">:</span> <span class="c1"># Iterate through the same file twice:</span> <span class="k">for</span> <span class="n">line</span> <span class="ow">in</span> <span class="n">itertools</span><span class="o">.</span><span class="n">preserve</span><span class="p">(</span><span class="n">handle</span><span class="p">):</span> <span class="o">...</span> <span class="n">handle</span><span class="o">.</span><span class="n">seek</span><span class="p">(</span><span class="mi">0</span><span class="p">)</span> <span class="k">for</span> <span class="n">line</span> <span class="ow">in</span> <span class="n">itertools</span><span class="o">.</span><span class="n">preserve</span><span class="p">(</span><span class="n">handle</span><span class="p">):</span> <span class="o">...</span> </pre></div> </div> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="nd">@contextlib</span><span class="o">.</span><span class="n">contextmanager</span> <span class="k">def</span><span class="w"> </span><span class="nf">iterclosing</span><span class="p">(</span><span class="n">iterable</span><span class="p">):</span> <span class="n">it</span> <span class="o">=</span> <span class="nb">iter</span><span class="p">(</span><span class="n">iterable</span><span class="p">)</span> <span class="k">try</span><span class="p">:</span> <span class="k">yield</span> <span class="n">preserve</span><span class="p">(</span><span class="n">it</span><span class="p">)</span> <span class="k">finally</span><span class="p">:</span> <span class="n">iterclose</span><span class="p">(</span><span class="n">it</span><span class="p">)</span> </pre></div> </div> </section> <section id="iterclose-implementations-for-iterator-wrappers"> <h3><a class="toc-backref" href="#iterclose-implementations-for-iterator-wrappers" role="doc-backlink">__iterclose__ implementations for iterator wrappers</a></h3> <p>Python ships a number of iterator types that act as wrappers around other iterators: <code class="docutils literal notranslate"><span class="pre">map</span></code>, <code class="docutils literal notranslate"><span class="pre">zip</span></code>, <code class="docutils literal notranslate"><span class="pre">itertools.accumulate</span></code>, <code class="docutils literal notranslate"><span class="pre">csv.reader</span></code>, and others. These iterators should define a <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> method which calls <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> in turn on their underlying iterators. For example, <code class="docutils literal notranslate"><span class="pre">map</span></code> could be implemented as:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="c1"># Helper function</span> <span class="n">map_chaining_exceptions</span><span class="p">(</span><span class="n">fn</span><span class="p">,</span> <span class="n">items</span><span class="p">,</span> <span class="n">last_exc</span><span class="o">=</span><span class="kc">None</span><span class="p">):</span> <span class="k">for</span> <span class="n">item</span> <span class="ow">in</span> <span class="n">items</span><span class="p">:</span> <span class="k">try</span><span class="p">:</span> <span class="n">fn</span><span class="p">(</span><span class="n">item</span><span class="p">)</span> <span class="k">except</span> <span class="ne">BaseException</span> <span class="k">as</span> <span class="n">new_exc</span><span class="p">:</span> <span class="k">if</span> <span class="n">new_exc</span><span class="o">.</span><span class="n">__context__</span> <span class="ow">is</span> <span class="kc">None</span><span class="p">:</span> <span class="n">new_exc</span><span class="o">.</span><span class="n">__context__</span> <span class="o">=</span> <span class="n">last_exc</span> <span class="n">last_exc</span> <span class="o">=</span> <span class="n">new_exc</span> <span class="k">if</span> <span class="n">last_exc</span> <span class="ow">is</span> <span class="ow">not</span> <span class="kc">None</span><span class="p">:</span> <span class="k">raise</span> <span class="n">last_exc</span> <span class="k">class</span><span class="w"> </span><span class="nc">map</span><span class="p">:</span> <span class="k">def</span><span class="w"> </span><span class="fm">__init__</span><span class="p">(</span><span class="bp">self</span><span class="p">,</span> <span class="n">fn</span><span class="p">,</span> <span class="o">*</span><span class="n">iterables</span><span class="p">):</span> <span class="bp">self</span><span class="o">.</span><span class="n">_fn</span> <span class="o">=</span> <span class="n">fn</span> <span class="bp">self</span><span class="o">.</span><span class="n">_iters</span> <span class="o">=</span> <span class="p">[</span><span class="nb">iter</span><span class="p">(</span><span class="n">iterable</span><span class="p">)</span> <span class="k">for</span> <span class="n">iterable</span> <span class="ow">in</span> <span class="n">iterables</span><span class="p">]</span> <span class="k">def</span><span class="w"> </span><span class="fm">__iter__</span><span class="p">(</span><span class="bp">self</span><span class="p">):</span> <span class="k">return</span> <span class="bp">self</span> <span class="k">def</span><span class="w"> </span><span class="fm">__next__</span><span class="p">(</span><span class="bp">self</span><span class="p">):</span> <span class="k">return</span> <span class="bp">self</span><span class="o">.</span><span class="n">_fn</span><span class="p">(</span><span class="o">*</span><span class="p">[</span><span class="nb">next</span><span class="p">(</span><span class="n">it</span><span class="p">)</span> <span class="k">for</span> <span class="n">it</span> <span class="ow">in</span> <span class="bp">self</span><span class="o">.</span><span class="n">_iters</span><span class="p">])</span> <span class="k">def</span><span class="w"> </span><span class="nf">__iterclose__</span><span class="p">(</span><span class="bp">self</span><span class="p">):</span> <span class="n">map_chaining_exceptions</span><span class="p">(</span><span class="n">operator</span><span class="o">.</span><span class="n">iterclose</span><span class="p">,</span> <span class="bp">self</span><span class="o">.</span><span class="n">_iters</span><span class="p">)</span> <span class="k">def</span><span class="w"> </span><span class="nf">chain</span><span class="p">(</span><span class="o">*</span><span class="n">iterables</span><span class="p">):</span> <span class="k">try</span><span class="p">:</span> <span class="k">while</span> <span class="n">iterables</span><span class="p">:</span> <span class="k">for</span> <span class="n">element</span> <span class="ow">in</span> <span class="n">iterables</span><span class="o">.</span><span class="n">pop</span><span class="p">(</span><span class="mi">0</span><span class="p">):</span> <span class="k">yield</span> <span class="n">element</span> <span class="k">except</span> <span class="ne">BaseException</span> <span class="k">as</span> <span class="n">e</span><span class="p">:</span> <span class="k">def</span><span class="w"> </span><span class="nf">iterclose_iterable</span><span class="p">(</span><span class="n">iterable</span><span class="p">):</span> <span class="n">operations</span><span class="o">.</span><span class="n">iterclose</span><span class="p">(</span><span class="nb">iter</span><span class="p">(</span><span class="n">iterable</span><span class="p">))</span> <span class="n">map_chaining_exceptions</span><span class="p">(</span><span class="n">iterclose_iterable</span><span class="p">,</span> <span class="n">iterables</span><span class="p">,</span> <span class="n">last_exc</span><span class="o">=</span><span class="n">e</span><span class="p">)</span> </pre></div> </div> <p>In some cases this requires some subtlety; for example, <a class="reference external" href="https://docs.python.org/3/library/itertools.html#itertools.tee">itertools.tee</a> should not call <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> on the underlying iterator until it has been called on <em>all</em> of the clone iterators.</p> </section> <section id="example-rationale"> <h3><a class="toc-backref" href="#example-rationale" role="doc-backlink">Example / Rationale</a></h3> <p>The payoff for all this is that we can now write straightforward code like:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">def</span><span class="w"> </span><span class="nf">read_newline_separated_json</span><span class="p">(</span><span class="n">path</span><span class="p">):</span> <span class="k">for</span> <span class="n">line</span> <span class="ow">in</span> <span class="nb">open</span><span class="p">(</span><span class="n">path</span><span class="p">):</span> <span class="k">yield</span> <span class="n">json</span><span class="o">.</span><span class="n">loads</span><span class="p">(</span><span class="n">line</span><span class="p">)</span> </pre></div> </div> <p>and be confident that the file will receive deterministic cleanup <em>without the end-user having to take any special effort</em>, even in complex cases. For example, consider this silly pipeline:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="nb">list</span><span class="p">(</span><span class="nb">map</span><span class="p">(</span><span class="k">lambda</span> <span class="n">key</span><span class="p">:</span> <span class="n">key</span><span class="o">.</span><span class="n">upper</span><span class="p">(),</span> <span class="n">doc</span><span class="p">[</span><span class="s2">"key"</span><span class="p">]</span> <span class="k">for</span> <span class="n">doc</span> <span class="ow">in</span> <span class="n">read_newline_separated_json</span><span class="p">(</span><span class="n">path</span><span class="p">)))</span> </pre></div> </div> <p>If our file contains a document where <code class="docutils literal notranslate"><span class="pre">doc["key"]</span></code> turns out to be an integer, then the following sequence of events will happen:</p> <ol class="arabic simple"> <li><code class="docutils literal notranslate"><span class="pre">key.upper()</span></code> raises an <code class="docutils literal notranslate"><span class="pre">AttributeError</span></code>, which propagates out of the <code class="docutils literal notranslate"><span class="pre">map</span></code> and triggers the implicit <code class="docutils literal notranslate"><span class="pre">finally</span></code> block inside <code class="docutils literal notranslate"><span class="pre">list</span></code>.</li> <li>The <code class="docutils literal notranslate"><span class="pre">finally</span></code> block in <code class="docutils literal notranslate"><span class="pre">list</span></code> calls <code class="docutils literal notranslate"><span class="pre">__iterclose__()</span></code> on the map object.</li> <li><code class="docutils literal notranslate"><span class="pre">map.__iterclose__()</span></code> calls <code class="docutils literal notranslate"><span class="pre">__iterclose__()</span></code> on the generator comprehension object.</li> <li>This injects a <code class="docutils literal notranslate"><span class="pre">GeneratorExit</span></code> exception into the generator comprehension body, which is currently suspended inside the comprehension’s <code class="docutils literal notranslate"><span class="pre">for</span></code> loop body.</li> <li>The exception propagates out of the <code class="docutils literal notranslate"><span class="pre">for</span></code> loop, triggering the <code class="docutils literal notranslate"><span class="pre">for</span></code> loop’s implicit <code class="docutils literal notranslate"><span class="pre">finally</span></code> block, which calls <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> on the generator object representing the call to <code class="docutils literal notranslate"><span class="pre">read_newline_separated_json</span></code>.</li> <li>This injects an inner <code class="docutils literal notranslate"><span class="pre">GeneratorExit</span></code> exception into the body of <code class="docutils literal notranslate"><span class="pre">read_newline_separated_json</span></code>, currently suspended at the <code class="docutils literal notranslate"><span class="pre">yield</span></code>.</li> <li>The inner <code class="docutils literal notranslate"><span class="pre">GeneratorExit</span></code> propagates out of the <code class="docutils literal notranslate"><span class="pre">for</span></code> loop, triggering the <code class="docutils literal notranslate"><span class="pre">for</span></code> loop’s implicit <code class="docutils literal notranslate"><span class="pre">finally</span></code> block, which calls <code class="docutils literal notranslate"><span class="pre">__iterclose__()</span></code> on the file object.</li> <li>The file object is closed.</li> <li>The inner <code class="docutils literal notranslate"><span class="pre">GeneratorExit</span></code> resumes propagating, hits the boundary of the generator function, and causes <code class="docutils literal notranslate"><span class="pre">read_newline_separated_json</span></code>’s <code class="docutils literal notranslate"><span class="pre">__iterclose__()</span></code> method to return successfully.</li> <li>Control returns to the generator comprehension body, and the outer <code class="docutils literal notranslate"><span class="pre">GeneratorExit</span></code> continues propagating, allowing the comprehension’s <code class="docutils literal notranslate"><span class="pre">__iterclose__()</span></code> to return successfully.</li> <li>The rest of the <code class="docutils literal notranslate"><span class="pre">__iterclose__()</span></code> calls unwind without incident, back into the body of <code class="docutils literal notranslate"><span class="pre">list</span></code>.</li> <li>The original <code class="docutils literal notranslate"><span class="pre">AttributeError</span></code> resumes propagating.</li> </ol> <p>(The details above assume that we implement <code class="docutils literal notranslate"><span class="pre">file.__iterclose__</span></code>; if not then add a <code class="docutils literal notranslate"><span class="pre">with</span></code> block to <code class="docutils literal notranslate"><span class="pre">read_newline_separated_json</span></code> and essentially the same logic goes through.)</p> <p>Of course, from the user’s point of view, this can be simplified down to just:</p> <p>1. <code class="docutils literal notranslate"><span class="pre">int.upper()</span></code> raises an <code class="docutils literal notranslate"><span class="pre">AttributeError</span></code> 1. The file object is closed. 2. The <code class="docutils literal notranslate"><span class="pre">AttributeError</span></code> propagates out of <code class="docutils literal notranslate"><span class="pre">list</span></code></p> <p>So we’ve accomplished our goal of making this “just work” without the user having to think about it.</p> </section> </section> <section id="transition-plan"> <h2><a class="toc-backref" href="#transition-plan" role="doc-backlink">Transition plan</a></h2> <p>While the majority of existing <code class="docutils literal notranslate"><span class="pre">for</span></code> loops will continue to produce identical results, the proposed changes will produce backwards-incompatible behavior in some cases. Example:</p> <div class="highlight-default notranslate"><div class="highlight"><pre><span></span><span class="k">def</span><span class="w"> </span><span class="nf">read_csv_with_header</span><span class="p">(</span><span class="n">lines_iterable</span><span class="p">):</span> <span class="n">lines_iterator</span> <span class="o">=</span> <span class="nb">iter</span><span class="p">(</span><span class="n">lines_iterable</span><span class="p">)</span> <span class="k">for</span> <span class="n">line</span> <span class="ow">in</span> <span class="n">lines_iterator</span><span class="p">:</span> <span class="n">column_names</span> <span class="o">=</span> <span class="n">line</span><span class="o">.</span><span class="n">strip</span><span class="p">()</span><span class="o">.</span><span class="n">split</span><span class="p">(</span><span class="s2">"</span><span class="se">\t</span><span class="s2">"</span><span class="p">)</span> <span class="k">break</span> <span class="k">for</span> <span class="n">line</span> <span class="ow">in</span> <span class="n">lines_iterator</span><span class="p">:</span> <span class="n">values</span> <span class="o">=</span> <span class="n">line</span><span class="o">.</span><span class="n">strip</span><span class="p">()</span><span class="o">.</span><span class="n">split</span><span class="p">(</span><span class="s2">"</span><span class="se">\t</span><span class="s2">"</span><span class="p">)</span> <span class="n">record</span> <span class="o">=</span> <span class="nb">dict</span><span class="p">(</span><span class="nb">zip</span><span class="p">(</span><span class="n">column_names</span><span class="p">,</span> <span class="n">values</span><span class="p">))</span> <span class="k">yield</span> <span class="n">record</span> </pre></div> </div> <p>This code used to be correct, but after this proposal is implemented will require an <code class="docutils literal notranslate"><span class="pre">itertools.preserve</span></code> call added to the first <code class="docutils literal notranslate"><span class="pre">for</span></code> loop.</p> <p>[QUESTION: currently, if you close a generator and then try to iterate over it then it just raises <code class="docutils literal notranslate"><span class="pre">Stop(Async)Iteration</span></code>, so code the passes the same generator object to multiple <code class="docutils literal notranslate"><span class="pre">for</span></code> loops but forgets to use <code class="docutils literal notranslate"><span class="pre">itertools.preserve</span></code> won’t see an obvious error – the second <code class="docutils literal notranslate"><span class="pre">for</span></code> loop will just exit immediately. Perhaps it would be better if iterating a closed generator raised a <code class="docutils literal notranslate"><span class="pre">RuntimeError</span></code>? Note that files don’t have this problem – attempting to iterate a closed file object already raises <code class="docutils literal notranslate"><span class="pre">ValueError</span></code>.]</p> <p>Specifically, the incompatibility happens when all of these factors come together:</p> <ul class="simple"> <li>The automatic calling of <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code> is enabled</li> <li>The iterable did not previously define <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code></li> <li>The iterable does now define <code class="docutils literal notranslate"><span class="pre">__(a)iterclose__</span></code></li> <li>The iterable is re-used after the <code class="docutils literal notranslate"><span class="pre">for</span></code> loop exits</li> </ul> <p>So the problem is how to manage this transition, and those are the levers we have to work with.</p> <p>First, observe that the only async iterables where we propose to add <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code> are async generators, and there is currently no existing code using async generators (though this will start changing very soon), so the async changes do not produce any backwards incompatibilities. (There is existing code using async iterators, but using the new async for loop on an old async iterator is harmless, because old async iterators don’t have <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code>.) In addition, <a class="pep reference internal" href="../pep-0525/" title="PEP 525 – Asynchronous Generators">PEP 525</a> was accepted on a provisional basis, and async generators are by far the biggest beneficiary of this PEP’s proposed changes. Therefore, I think we should strongly consider enabling <code class="docutils literal notranslate"><span class="pre">__aiterclose__</span></code> for <code class="docutils literal notranslate"><span class="pre">async</span> <span class="pre">for</span></code> loops and async generators ASAP, ideally for 3.6.0 or 3.6.1.</p> <p>For the non-async world, things are harder, but here’s a potential transition path:</p> <p>In 3.7:</p> <p>Our goal is that existing unsafe code will start emitting warnings, while those who want to opt-in to the future can do that immediately:</p> <ul class="simple"> <li>We immediately add all the <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> methods described above.</li> <li>If <code class="docutils literal notranslate"><span class="pre">from</span> <span class="pre">__future__</span> <span class="pre">import</span> <span class="pre">iterclose</span></code> is in effect, then <code class="docutils literal notranslate"><span class="pre">for</span></code> loops and <code class="docutils literal notranslate"><span class="pre">*</span></code> unpacking call <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> as specified above.</li> <li>If the future is <em>not</em> enabled, then <code class="docutils literal notranslate"><span class="pre">for</span></code> loops and <code class="docutils literal notranslate"><span class="pre">*</span></code> unpacking do <em>not</em> call <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code>. But they do call some other method instead, e.g. <code class="docutils literal notranslate"><span class="pre">__iterclose_warning__</span></code>.</li> <li>Similarly, functions like <code class="docutils literal notranslate"><span class="pre">list</span></code> use stack introspection (!!) to check whether their direct caller has <code class="docutils literal notranslate"><span class="pre">__future__.iterclose</span></code> enabled, and use this to decide whether to call <code class="docutils literal notranslate"><span class="pre">__iterclose__</span></code> or <code class="docutils literal notranslate"><span class="pre">__iterclose_warning__</span></code>.</li> <li>For all the wrapper iterators, we also add <code class="docutils literal notranslate"><span class="pre">__iterclose_warning__</span></code> methods that forward to the <code class="docutils literal notranslate"><span class="pre">__iterclose_warning__</span></code> method of the underlying iterator or iterators.</li> <li>For generators (and files, if we decide to do that), <code class="docutils literal notranslate"><span class="pre">__iterclose_warning__</span></code> is defined to set an internal flag, and other methods on the object are modified to check for this flag. If they find the flag set, they issue a <code class="docutils literal notranslate"><span class="pre">PendingDeprecationWarning</span></code> to inform the user that in the future this sequence would have led to a use-after-close situation and the user should use <code class="docutils literal notranslate"><span class="pre">preserve()</span></code>.</li> </ul> <p>In 3.8:</p> <ul class="simple"> <li>Switch from <code class="docutils literal notranslate"><span class="pre">PendingDeprecationWarning</span></code> to <code class="docutils literal notranslate"><span class="pre">DeprecationWarning</span></code></li> </ul> <p>In 3.9:</p> <ul class="simple"> <li>Enable the <code class="docutils literal notranslate"><span class="pre">__future__</span></code> unconditionally and remove all the <code class="docutils literal notranslate"><span class="pre">__iterclose_warning__</span></code> stuff.</li> </ul> <p>I believe that this satisfies the normal requirements for this kind of transition – opt-in initially, with warnings targeted precisely to the cases that will be effected, and a long deprecation cycle.</p> <p>Probably the most controversial / risky part of this is the use of stack introspection to make the iterable-consuming functions sensitive to a <code class="docutils literal notranslate"><span class="pre">__future__</span></code> setting, though I haven’t thought of any situation where it would actually go wrong yet…</p> </section> <section id="acknowledgements"> <h2><a class="toc-backref" href="#acknowledgements" role="doc-backlink">Acknowledgements</a></h2> <p>Thanks to Yury Selivanov, Armin Rigo, and Carl Friedrich Bolz for helpful discussion on earlier versions of this idea.</p> </section> <section id="copyright"> <h2><a class="toc-backref" href="#copyright" role="doc-backlink">Copyright</a></h2> <p>This document has been placed in the public domain.</p> </section> </section> <hr class="docutils" /> <p>Source: <a class="reference external" href="https://github.com/python/peps/blob/main/peps/pep-0533.rst">https://github.com/python/peps/blob/main/peps/pep-0533.rst</a></p> <p>Last modified: <a class="reference external" href="https://github.com/python/peps/commits/main/peps/pep-0533.rst">2025-02-01 08:59:27 GMT</a></p> </article> <nav id="pep-sidebar"> <h2>Contents</h2> <ul> <li><a class="reference internal" href="#abstract">Abstract</a></li> <li><a class="reference internal" href="#note-on-timing">Note on timing</a></li> <li><a class="reference internal" href="#background-and-motivation">Background and motivation</a></li> <li><a class="reference internal" href="#alternatives">Alternatives</a><ul> <li><a class="reference internal" href="#pep-525-asyncgen-hooks">PEP 525 asyncgen hooks</a></li> <li><a class="reference internal" href="#always-inject-resources-and-do-all-cleanup-at-the-top-level">Always inject resources, and do all cleanup at the top level</a></li> <li><a class="reference internal" href="#more-complex-variants-of-a-iterclose">More complex variants of __(a)iterclose__</a></li> </ul> </li> <li><a class="reference internal" href="#specification">Specification</a><ul> <li><a class="reference internal" href="#guiding-principles">Guiding principles</a></li> <li><a class="reference internal" href="#changes-to-iteration">Changes to iteration</a></li> <li><a class="reference internal" href="#changes-to-async-iteration">Changes to async iteration</a></li> <li><a class="reference internal" href="#modifications-to-basic-iterator-types">Modifications to basic iterator types</a></li> <li><a class="reference internal" href="#new-convenience-functions">New convenience functions</a></li> <li><a class="reference internal" href="#iterclose-implementations-for-iterator-wrappers">__iterclose__ implementations for iterator wrappers</a></li> <li><a class="reference internal" href="#example-rationale">Example / Rationale</a></li> </ul> </li> <li><a class="reference internal" href="#transition-plan">Transition plan</a></li> <li><a class="reference internal" href="#acknowledgements">Acknowledgements</a></li> <li><a class="reference internal" href="#copyright">Copyright</a></li> </ul> <br> <a id="source" href="https://github.com/python/peps/blob/main/peps/pep-0533.rst">Page Source (GitHub)</a> </nav> </section> <script src="../_static/colour_scheme.js"></script> <script src="../_static/wrap_tables.js"></script> <script src="../_static/sticky_banner.js"></script> </body> </html>