Message388929
Python's builtin `email.parser.BytesParser` could not properly parse the message when the bytes starts with BOM.
Not 100% ensured- but this issue seems cause by that `FeedParser._parsegen` could not match any of the header line after the data is decoded.
Steps to reproduce:
1. get email sample. any from https://github.com/python/cpython/tree/master/Lib/test/test_email/data. I use msg_01.txt in following code
2. re-encoded the mail sample to some encoding with BOM
3. use `email.parser.BytesParser` to parse it
```py
import email
with open('msg_01.txt', 'rb') as fp:
msg = email.parser.BytesParser().parse(fp)
print(msg.get('Message-ID'))
```
Expect output `<15090.61304.110929.45684@aaa.zzz.org>`, got `None` |
|
Date |
User |
Action |
Args |
2021-03-17 15:45:56 | tzing | set | recipients:
+ tzing |
2021-03-17 15:45:56 | tzing | set | messageid: <1615995956.48.0.229285559191.issue43530@roundup.psfhosted.org> |
2021-03-17 15:45:56 | tzing | link | issue43530 messages |
2021-03-17 15:45:56 | tzing | create | |
|