天天看點

python elementtree亂碼_python – 用ElementTree寫入帶有utf-8資料的xml utf-8檔案

我試圖使用ElementTree使用utf-8編碼的資料編寫一個xml檔案,如下所示:

#!/usr/bin/python

# -*- coding: utf-8 -*-

import xml.etree.ElementTree as ET

import codecs

testtag = ET.Element('unicodetag')

testtag.text = u'Töreboda' #The o is really ö (o with two dots over). No idea why SO dont display this

expfile = codecs.open('testunicode.xml',"w","utf-8-sig")

ET.ElementTree(testtag).write(expfile,encoding="UTF-8",xml_declaration=True)

expfile.close()

這樣會産生錯誤

Traceback (most recent call last):

File "unicodetest.py",line 10,in

ET.ElementTree(testtag).write(expfile,xml_declaration=True)

File "/usr/lib/python2.7/xml/etree/ElementTree.py",line 815,in write

serialize(write,self._root,encoding,qnames,namespaces)

File "/usr/lib/python2.7/xml/etree/ElementTree.py",line 932,in _serialize_xml

write(_escape_cdata(text,encoding))

File "/usr/lib/python2.7/codecs.py",line 691,in write

return self.writer.write(data)

File "/usr/lib/python2.7/codecs.py",line 351,in write

data,consumed = self.encode(object,self.errors)

UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 1: ordinal not in range(128)

使用“us-ascii”編碼代替工作正常,但不保留資料中的unicode字元.發生什麼事?