python lxml.etree.Element.text_content 中文报错
<html>
<body>
<table id = "ta_1" border = "2">
<tr align = "center">
<td><img src="zhengchuang">"张三#279"</td>
<td>lisi</td>
</tr>
<tr align = "center">
<td>13:19 2011-11-17#279</td>
<td>baidu</td>
</tr>
</table>
<body>
</html>
我要解析上面HTML table每一行(tr align = "center")的第一列Text节点信息,但是用方法text_content()对于中文的文字节点会报错,英文的支持完好。请问该怎么处理
import lxml
import lxml.html as HTML
from lxml import etree
fd = open("E:\\test.html","r")
dom = HTML.document_fromstring(fd.read())
trNodesList=dom.xpath("//table[@id='ta_1']//tr")
count =0
for item in trNodesList:
if(item.get("align") == "center"):
for it in item.iter():
if(it.tag =="td"):
print it.text_content()#error place
break