Title: textwrap should treat Unicode em-dash like ASCII em-dash
Created on 2017-06-15 19:09 by jonathaneunice, last changed 2017-06-15 20:10 by jonathaneunice.

Author: Jonathan Eunice (jonathaneunice) Date: 2017-06-15 19:09
The textwrap module goes to great lengths to "do the right thing" when it finds the ASCII simulation of an em-dash (two or more consecutive hyphens), but it does nothing to recognize and similarly treat true (Unicode) em-dashes (aka '\N{EM DASH}', '\u2014', or U+2014). Real em-dashes should get at least as good a treatment as simulated em-dashes.
Author: R. David Murray (r.david.murray) Date: 2017-06-15 19:35
This seems sensible to me (I haven't looked at the PR, I'm talking about adding the support).  When textwrap was written python was pretty ascii oriented, so it is not too much of a surprise that unicode em dashes were not supported.
Author: Jonathan Eunice (jonathaneunice) Date: 2017-06-15 20:10
Agreed. It makes great sense that textwrap started as highly ASCII-centric. But in the Python 3, Unicode-friendly era, ASCII-biased isn't where we should leave things.
