Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problems with recursive automatic exception chaining #63061

Open
nikratio mannequin opened this issue Aug 28, 2013 · 19 comments
Open

Problems with recursive automatic exception chaining #63061

nikratio mannequin opened this issue Aug 28, 2013 · 19 comments
Labels
3.11 only security fixes interpreter-core (Objects, Python, Grammar, and Parser dirs) type-bug An unexpected behavior, bug, or error

Comments

@nikratio
Copy link
Mannequin

nikratio mannequin commented Aug 28, 2013

BPO 18861
Nosy @ncoghlan, @hniksic, @benjaminp, @njsmith, @cjerdonek, @vadmium, @iritkatriel

Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.

Show more details

GitHub fields:

assignee = None
closed_at = None
created_at = <Date 2013-08-28.02:42:07.875>
labels = ['interpreter-core', 'type-bug', '3.11']
title = 'Problems with recursive automatic exception chaining'
updated_at = <Date 2021-12-11.15:36:50.787>
user = 'https://bugs.python.org/nikratio'

bugs.python.org fields:

activity = <Date 2021-12-11.15:36:50.787>
actor = 'iritkatriel'
assignee = 'none'
closed = False
closed_date = None
closer = None
components = ['Interpreter Core']
creation = <Date 2013-08-28.02:42:07.875>
creator = 'nikratio'
dependencies = []
files = []
hgrepos = []
issue_num = 18861
keywords = []
message_count = 19.0
messages = ['196344', '196345', '201410', '201417', '201460', '201482', '202421', '202523', '202525', '202562', '202564', '202579', '202754', '202759', '203015', '203017', '203370', '390455', '408310']
nosy_count = 9.0
nosy_names = ['ncoghlan', 'hniksic', 'benjamin.peterson', 'njs', 'nikratio', 'chris.jerdonek', 'alonho', 'martin.panter', 'iritkatriel']
pr_nums = []
priority = 'normal'
resolution = None
stage = None
status = 'open'
superseder = None
type = 'behavior'
url = 'https://bugs.python.org/issue18861'
versions = ['Python 3.11']

@nikratio
Copy link
Mannequin Author

nikratio mannequin commented Aug 28, 2013

Consider this:

$ python3 test_exc.py 
Traceback (most recent call last):
  File "test_exc.py", line 14, in <module>
    fail1()
  File "test_exc.py", line 11, in fail1
    fail2()
  File "test_exc.py", line 5, in fail2
    raise RuntimeError('Third') from None
RuntimeError: Third
$ cat test_exc.py 
def fail2():
    try:
        raise RuntimeError('Second')
    except RuntimeError:
        raise RuntimeError('Third') from None
def fail1():
    try:
        raise RuntimeError('First')
    except:
        fail2()
        raise

fail1()

Any exception raised in fail2() is the immediate consequence of the 'First' exception should thus be chained to the 'First' exception.

However, if somewhere in the call stack under fail2() an exception is caught and re-raised from None (to convert between exception types), this also results in a loss of the chain to the initial exception.

The correct stacktrace (in my opinion) would be:
Traceback (most recent call last):
  File "test_exc.py", line 9, in fail1
    raise RuntimeError('First')
RuntimeError: First

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "test_exc.py", line 14, in <module>
    fail1()
  File "test_exc.py", line 11, in fail1
    fail2()
  File "test_exc.py", line 5, in fail2
    raise RuntimeError('Third')
RuntimeError: Third

@nikratio nikratio mannequin added interpreter-core (Objects, Python, Grammar, and Parser dirs) type-bug An unexpected behavior, bug, or error labels Aug 28, 2013
@nikratio
Copy link
Mannequin Author

nikratio mannequin commented Aug 28, 2013

The second paragraph should of course read "...consequence of the 'First' exception *and* should thus be chained..."

@nikratio
Copy link
Mannequin Author

nikratio mannequin commented Oct 27, 2013

*ping*

No one any comments on this at all?

@benjaminp
Copy link
Contributor

raise ... from None explictly silences the printing of exception context, so I don't see the problem.

@ncoghlan
Copy link
Contributor

Nothing is lost either way, it just isn't displayed by default.

While I can see the appeal of treating an "outer" context differently from an "inner" one, I don't see any way for the exception context suppression machinery to tell the difference.

The traceback *display* machinery might be able to figure it out by looking at the traceback details, but such a search could be quite expensive.

@nikratio
Copy link
Mannequin Author

nikratio mannequin commented Oct 27, 2013

Benjamin: I think that in most cases the intention of a ".. from None" is to disable printing of a stack trace for a specific exception when it occurs in the try .. except block. (In the example, that would be suppressing the stacktrace for the "Second" exception, because it is explicity converted to "Third"). I do not think that this ought to effect the printing of exceptions that occured previously and higher-up in the call chain.

In other words, I think it is natural to expect that

def foo():
   try:
      do_stuff
   except Something:
      raise SomethingElse from None

disables printing the stack trace for Something exceptions when they occur in the try..except block -- but not to hide the exception context for exceptions that occured before foo() was even called.

@ncoghlan
Copy link
Contributor

ncoghlan commented Nov 8, 2013

I actually think Nikratio is right about the way this *should* work (intuitively).

I'm just not sure it's feasible to *implement* those semantics in CPython without significant changes to the way exception handling works - I don't believe the exception chaining code can currently tell the difference between the cases:

# No context on Exception3 is exactly what we want
try:
try:
raise Exception1
except Exception1:
raise Exception2
except Exception2 as exc
raise Exception3 from Exception2

# Not seeing Exception1 as the context for Exception3 is surprising!
try:
raise Exception1
except Exception1:
try:
raise Exception2
except Exception2 as exc
raise Exception3 from Exception2

In a certain sense, exceptions need to be able to have *multiple* contexts to handle this case properly without losing data. Frames would need to tag exceptions appropriately with the context details as an unhandled exception passed through a frame that was currently running an exception handler.

So even though it doesn't require new syntax, I think it *does* require a PEP if we're going to change this (and we still haven't fully dealt with the consequence of the last one - the display options for tracebacks are still a bit limited)

@ncoghlan
Copy link
Contributor

It may not immediately look like it, but I think bpo-17828 offers an example of a related problem. In that issue, I didn't want to *change* the exception raised, I wanted to annotate it to say "Hey, something I called raised an exception, here's some relevant local state to help you figure out what is going on" (in that case, the information to be added is the specific codec being invoked and whether it is an encoding or decoding operation).

Setting the context is somewhat similar - you don't just want to know which specific exception happened to be active when the eventually caught exception was first raised - you also want to know which already active exception handlers you passed through while unwinding the stack.

So really, what may be desirable here is almost an "annotated traceback", where the interpreter can decide to hang additional information off the frame currently being unwound in the exceptions traceback, while leaving the exception itself alone.

That's definitely PEP territory, but there are two distinct problems with the current exception chaining mechanism to help drive a prospective design for 3.5 (the status quo doesn't bother me enough for me to work on it myself, but if you're interested Nikolaus...)

@ncoghlan
Copy link
Contributor

Unrelated to my previous comment, I'm also wondering if this may actually represent a behavioural difference between contextlib.ExitStack and the interpreter's own exception handling machinery.

ExitStack uses a recursive->iterative transformation for its stack unwinding (see bpo-14963), and it needs to do a bit of fiddling to get the context right (see bpo-19092).

While the contextlib test suite goes to great lengths to try to ensure the semantics of normal stack unwinding are preserved, that definitely doesn't currently cover this case, and I'm thinking the way it works may actually be more like the behaviour Nikolaus expected in the original post (i.e. setting the context as the stack is unwound rather than when the replacement exception is raised).

@nikratio
Copy link
Mannequin Author

nikratio mannequin commented Nov 10, 2013

Hi Nick,

I am interested in working on this, but I have never worked on the C parts of cpython before. Do you think this is a feasible project to start with? To me it looks a bit daunting, I'd certainly need some mentoring to even know where to start with this.

@benjaminp
Copy link
Contributor

The first thing to do is to carefully specificy what the behavior should be.

2013/11/10 Nikolaus Rath <report@bugs.python.org>:

Nikolaus Rath added the comment:

Hi Nick,

I am interested in working on this, but I have never worked on the C parts of cpython before. Do you think this is a feasible project to start with? To me it looks a bit daunting, I'd certainly need some mentoring to even know where to start with this.

----------


Python tracker <report@bugs.python.org>
<http://bugs.python.org/issue18861\>


@ncoghlan
Copy link
Contributor

Yes, I suggest using ExitStack to figure out the behaviour we *want* first,
before diving into the messy practical details of how to make that a
reality in CPython. We somehow have to get the state of the exception
object and its traceback to represent an appropriate stack *tree*, rather
than the traditionally assumed linear stack.

It also occurred to me there's another potentially related issue: frame
hiding, where we want to avoid showing infrastructure code in end user
tracebacks. importlib currently has a very hacky version of that. The
Jinja2 template library uses a different approach.

The reason I bring these other problems up is because I think they
illustrate a theme around altering how a traceback is displayed that may be
amenable to a common solution (preferably one that is contextlib and
asyncio friendly).

@ncoghlan
Copy link
Contributor

So, I've been pondering the idea of traceback/frame annotations and exception trees a bit.

And what I'm wondering is if it may make sense to have the ability to annotate *frames* at runtime, and these annotations can be qualified by module names. So, for example, you might be able to write things like:

    sys.annotate_frame("codecs", "encoding", the_encoding)
    sys.annotate_frame("codecs", "decoding", the_encoding)
    sys.annotate_frame("traceback", "hide", True)
    sys.annotate_frame("traceback", "context", exc)

And then the traceback display machinery would be updated to do something useful with the annotations.

I'm not sure how ExitStack would cope with that (or other code that fakes tracebacks) but it's something to ponder.

@ncoghlan
Copy link
Contributor

Walter suggested annotating the exceptions directly might work: https://mail.python.org/pipermail/python-dev/2013-November/130155.html

However, that potentially runs into nesting problems (e.g. the idna codec invokes the ascii codec), although Walter suggested a simpler mechanism that just involved appending to a list.

@ncoghlan
Copy link
Contributor

Adjusting the target version, since it isn't feasible to move away from the current behaviour any earlier than Python 3.5.

@ncoghlan
Copy link
Contributor

Something else that this might make simpler: the unittest.TestCase.subTest API is currently kind of ugly. If the subtest details could be stored as a frame annotation instead...

@ncoghlan
Copy link
Contributor

Actually, this won't the subTest example because that actually *suppresses* the errors, and reports them later. Annotations only help when the exception is allowed to escape.

@iritkatriel
Copy link
Member

For the issue of which contexts need to be suppressed when __suppress_context__ is true, I think we don't need to keep the whole tree of exceptions. It might be enough to save the dynamic "nesting depth of the try-excepts". Here is a solution with two integers per exception, plus one on the thread state. Would it work?

We add an int field called except_depth to the thread state (incremented in ceval just before the try of a try-except and decremented after the last except: block).

Whenever an exception is caught, this value is copied to it (to a new int we put on BaseException, call it depth).

The __suppress_context__ field we change from bool to int (alternatively: add a new field and leave this one alone). Instead of setting it to True when there is __cause__, we set it to the except-depth of the raise...from that raised it.

Then the code displaying the traceback can use the depths information to determine which contexts to suppress (the ones up to and not including the first one with depth < the suppress_context value).

Returning to Nick's examples from 2013-11-08 14:34:

# No context on Exception3 is exactly what we want
try:
try:
raise Exception1
except Exception1:
raise Exception2
except Exception2 as exc
raise Exception3 from Exception2

Exception1 - depth = 2
Exception2 - depth = 1
Exception3 - depth = ? suppress_context = 1
(suppress both Exception1 and Exception2, they have depths >= 1)

# Not seeing Exception1 as the context for Exception3 is surprising!
try:
raise Exception1
except Exception1:
try:
raise Exception2
except Exception2 as exc
raise Exception3 from Exception2

Exception1 - depth = 1
Exception2 - depth = 2
Exception3 - depth = ? suppress_context = 2
(suppress only Exception2, Exception1 is too shallow)

To consider more elaborate nesting:

try:
try:
raise Exception0
except T: # Does not catch Exception0
pass
except Exception1:
try:
raise Exception2
except Exception2 as exc
raise Exception3 from Exception2

Whatever happens in the topmost try-block, if Exception0 is the context of anything raised inside the "except Exception1" block, then its depth is 1 because it was caught by this except. So it won't be suppressed by anything raised at a deeper level.

@iritkatriel
Copy link
Member

I think this problem is actually simpler than what we've been discussing.

First, note that by-and-large our current system works:

>>> try:
...   raise VE(1)
... except VE as e1:
...   try:
...     raise VE(2)
...   except VE as e2:
...     raise VE(3) from e2
... 
Traceback (most recent call last):
  File "<stdin>", line 2, in <module>
ValueError: 1

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "<stdin>", line 5, in <module>
ValueError: 2

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "<stdin>", line 7, in <module>
ValueError: 3

Here VE(2) is the cause of VE(3) and VE(1) is the context of VE(2), so VE(1) is not hidden by the fact that VE(3) has a cause.

The reason that Nick's example didn't work is because he is doing

    raise VE(3) from VE(2)

i.e., creating a new exception VE(2) that doesn't have VE(1) as a context. I'm not sure there is a use case for this, so I don't think we need to worry about it.

The case of None does need fixing. We use None to indicate that there was no cause (while suppressing context). But None can't have a context, so VE(1) gets lost. We could, instead of None, use a NoException(Exception) class. This exception would be chained with the current exc_info() as context so that it works like VE(1) as above, and the traceback printing logic will know that it needs to be omitted from the output.

This proposal involves no changes to the exception propagation mechanism of the interpreter. It would require these changes:

  1. Define the new exception type
  2. in do_raise, in the case of raise-from the None case changes
  3. traceback display code needs to be updated to omit NoExceptions

@iritkatriel iritkatriel added the 3.11 only security fixes label Dec 11, 2021
@ezio-melotti ezio-melotti transferred this issue from another repository Apr 10, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3.11 only security fixes interpreter-core (Objects, Python, Grammar, and Parser dirs) type-bug An unexpected behavior, bug, or error
Projects
None yet
Development

No branches or pull requests

3 participants