python xml解析

文章由Byrx.net分享于2019-03-22 10:03:50评论（568）

python xml解析

[html]
first.xml
<info>
<person >
<id>1</id>
<name>fsy</name>
<age >24</age>
</person>
<person>
<id>2</id>
<name>jianjian</name>
<age>24</age>
</person>
<count id ='1'>1000</count>
</info>

first.xml
<info>
<person >
<id>1</id>
<name>fsy</name>
<age >24</age>
</person>
<person>
<id>2</id>
<name>jianjian</name>
<age>24</age>
</person>
<count id ='1'>1000</count>
</info>

[python] view plaincopyprint?
from xml.etree import ElementTree as etree

from xml.etree import ElementTree as etree

读入

[python]
def read_xml(file):
# parse()函数会返回一个能代表整篇文档的对象。这不是根元素。要获得根元素的引用可以调用getroot()方法。
tree = etree.parse(file)
root = tree.getroot()
return root

def read_xml(file):
# parse()函数会返回一个能代表整篇文档的对象。这不是根元素。要获得根元素的引用可以调用getroot()方法。
tree = etree.parse(file)
root = tree.getroot()
return root

得到信息
[python]
def print_node(node):
'''''打印结点基本信息'''
print("node.tag:%s" % node.tag)
print("node.attrib:%s"%node.attrib)
print( "node.text:%s" % node.text)

def print_node(node):
'''打印结点基本信息'''
print("node.tag:%s" % node.tag)
print("node.attrib:%s"%node.attrib)
print( "node.text:%s" % node.text)

搜索：
[python]
find_all
>>> root = read_xml ('first.xml')
>>> res = root.findall("person")
[<Element 'person' at 0x00000000033388B8>, <Element 'person' at 0x0000000003413D68>]

注意：findall只查询直接的子节点
>>> r1 = root.findall("id")
>>> r1
[]
>>> r =tree.findall(".//id")
>>> for e in r:
print( e,e.text)

<Element 'id' at 0x00000000034279F8> 1
<Element 'id' at 0x0000000003427B38> 2

find_all
>>> root = read_xml ('first.xml')
>>> res = root.findall("person")
[<Element 'person' at 0x00000000033388B8>, <Element 'person' at 0x0000000003413D68>]

注意：findall只查询直接的子节点
>>> r1 = root.findall("id")
>>> r1
[]
>>> r =tree.findall(".//id")
>>> for e in r:
print( e,e.text)

<Element 'id' at 0x00000000034279F8> 1
<Element 'id' at 0x0000000003427B38> 2

find:

[python]
#find()方法用来返回第一个匹配到的元素。当我们认为只会有一个匹配，或者有多个匹配但我们只关心第一个的时候，这个方法是很有用的。
>>> res[0].find("id")
<Element 'id' at 0x0000000003413CC8>
>>> print_node(res[0].find("id"))
node.tag:id
node.attrib:{}
node.text:1

#find()方法用来返回第一个匹配到的元素。当我们认为只会有一个匹配，或者有多个匹配但我们只关心第一个的时候，这个方法是很有用的。
>>> res[0].find("id")
<Element 'id' at 0x0000000003413CC8>
>>> print_node(res[0].find("id"))
node.tag:id
node.attrib:{}
node.text:1

find查找失败：
使用find要注意在布尔上下文中，如果ElementTree元素对象不包含子元素，其值则会被认为是False（即如果len(element)等于0）。这就意味着if element.find('...')并非在测试是否find()方法找到了匹配项；这条语句是在测试匹配到的元素是否包含子元素。想要测试find()方法是否返回了一个元素，则需使用if element.find('...') is not None。
[python]
>>> bk = res[0].find("no")
>>> bk
>>> type(bk)
<class 'NoneType'>
>>> res[0].find("id")
<Element 'id' at 0x0000000003413CC8>
>>> if res[0].find("id"):
        print("find")
    else:
        print("not find")
not find
>>> if res[0].find("id") is not None:
        print("find")
    else:
        print("not find")
find

>>> bk = res[0].find("no")
>>> bk
>>> type(bk)
<class 'NoneType'>
>>> res[0].find("id")
<Element 'id' at 0x0000000003413CC8>
>>> if res[0].find("id"):
        print("find")
    else:
        print("not find")
not find
>>> if res[0].find("id") is not None:
        print("find")
    else:
        print("not find")
find

热门文章：

python xml解析

python xml解析

相关内容

最新python教程

python~HOT