晚安。我编写了这个类来解析 XML:
from xml.dom.minidom import parse
from xml.dom import minidom
class Book(object):
def __init__(self, filename):
self.filename = filename
def parse(self):
with open(self.filename, "r", encoding="UTF-8") as document:
document = parse(document)
print(document.version, document.encoding, document.standalone)
paragraphs = document.getElementsByTagName("section")
for paragraph in paragraphs:
text_nodes = [
node.nodeValue for node in paragraph.childNodes
if node.nodeType == minidom.Node.TEXT_NODE
]
text = "".join(text_nodes)
return text
当我输出结果时,有一个空行。 fb2文件本身是正确的,可以用程序打开。但它对我不起作用。这是为什么?