New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
REPL shows continuation prompt (...) when comment or space entered #82854
Comments
This has always bothered me, and it shouldn't be necessary. This session: >>> #foo
...
>>> should really have been
It's confusing that the REPL prompt switches to "..." here, for no good reason. It should just treat the line as empty. Ditto if you enter a space (there's an invisible space after the first prompt): >>>
...
>>> |
As a person without much experience, it sounded like a simple enough task, but having dug a bit, I found it quite complicated. It seems to me that the interpreter loop (in the standard REPL, that you get when you start ./python, blocks for input somewhere inside a massive function called 'parsetok' (in Parser/parsetok.c). Now, I could maybe investigate further, to have it return to the interpreter loop if it reads a comment (or empty line), but I'm afraid to mess up something. From my understanding, there aren't that many other choices, because parsetok() doesn't return before you finish the statement (in other words, it does not return if you type a comment line or a blank line - it instead waits for more input, as indicated by the '... '). Am I way off in concluding that this would be a change to the parser? |
Yes, that's likely where the change should be made. I think if the *first* token encountered is either NL or COMMENT the parse should be abandoned by the tokenizer. |
Entering 'pass' or a completely blank line results in a new primary prompt, at least on Windows. The Windows REPL otherwise prints ... even for effectively blank lines. IDLE usually prints a new prompt for effectively blank lines.
I agree that these look better. This behavior comes from code.InteractiveInterpreter and ultimately codeop. def _maybe_compile(compiler, source, filename, symbol):
# Check for source consisting of only blank lines and comments
for line in source.split("\n"):
line = line.strip()
if line and line[0] != '#':
break # Leave it alone
else:
if symbol != "eval":
source = "pass" # Replace it with a 'pass' statement As noted above, 'pass\n' is treated the same as '\n' The first line above originally had a space, but IDLE appears to strip trailing whitespace also, even outside of comments. (For an ending '\ ', this prevents SyntaxError, but maybe this is a bad lesson for beginners.) However, I did find a case with an unnecessary continuation line. >>> # a
>>> This puzzles me, as it should be treated exactly the same as without the space after '#'. ast.dump(ast.parse(' # a\n', '', 'single')) gives the same result, 'Module(body=[], type_ignores=[])', as without. |
Regarding the IDLE mystery, *if* there's a difference between how it treats " # a" and "# a", this must be due to some part of the code that's invoked before _maybe_compile() is called, right? But that's immaterial to this bug -- I'm only complaining about the "builtin" REPL. |
Fix corner case bugs in IDLE would definitely be a separate issue. But is the 'fix' in _maybe_compile at all applicable to the REPL? Or might a parser change REPL fix make the code in _maybe_compile unneeded? |
I don't know. Most of the contortions in code.py codeop.py are meant to emulate what the parser does without help in the REPL: keep asking for input until one of the following happens:
The bug here is that apparently the above conditions aren't quite enough, and it seems we need to add:
The reason that IDLE has to reimplement similar logic is that the parser (actually, the tokenizer) isn't written as a coroutine to which you send lines you read -- it's written as a blocking function that you pass a file and the function will attempt to read lines from the file. The function returns a parse tree or raise an error. That model doesn't work in IDLE, which needs to stay reactive while the shell window is in the middle of a statement. I think we'll find that the bug is *very* old. IIRC the initial releases of Python didn't have the rule that indentation is ignored between matching parentheses/brackets/braces. In those days the tokenizer also didn't gracefully skip blank lines, or lines with comments that weren't aligned with the current indentation level. So then checking for empty lines was good enough. |
I'd like to backport this to 3.8.1 at least. Are people interested in getting it backported to earlier versions? |
So 3.8.1 got backported by Miss Islington. Do we want this in earlier releases? |
I would prefer that. I think treating '\n' and ' \n' differently is a bit of a bug. And the fix pretty well matches code/codeop behavior. I have so far not imagined how it could break code. But you could let Ned Deily decide, before the next rc, if you want. I am neutral on 2.7. |
Ned agreed, it's merged into 3.7, so let's close. |
(And I'm giving up on 3.6 and 2.7 as these are close to their end of life.) |
Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.
Show more details
GitHub fields:
bugs.python.org fields:
The text was updated successfully, but these errors were encountered: