classification
Title: ElementTree won't parse comments before root element
Type: enhancement Stage:
Components: XML Versions: Python 3.6
process
Status: open Resolution:
Dependencies: Superseder:
Assigned To: Nosy List: amaury.forgeotdarc, celinecyc, effbot, flox, poke, scoder
Priority: normal Keywords:

Created on 2019-11-08 08:10 by celinecyc, last changed 2019-11-08 08:10 by celinecyc.

Messages (1)
msg356229 - (view) Author: celinecyc (celinecyc) Date: 2019-11-08 08:10
issue8277
It couldn't work for those comments before the root element. 
It will raise an error that "xml.etree.ElementTree.ParseError: multiple elements on top level". 

Example:

test.xml
--------
<?xml version="1.0" encoding="utf-8"?>
<!-- comments here doesn't work -->
<root node>
  <nodeA />
  <!-- comments here is ok -->
  <nodeB />
</root node>

test.py
-------
from xml.etree import ElementTree
class MyTreeBuilder(ElementTree.TreeBuilder):
   def comment(self, data):
       self.start(ElementTree.Comment, {})
       self.data(data)
       self.end(ElementTree.Comment)
with open('c:/temp/t.xml', 'r') as f:
   xml = ElementTree.parse(
       f, parser=ElementTree.XMLParser(target=MyTreeBuilder()))
ElementTree.dump(xml)
History
Date User Action Args
2019-11-08 08:10:53celinecyccreate