如何获得所有元素的路径LXML与属性

问题描述:

我有以下代码:如何获得所有元素的路径LXML与属性

tree = etree.ElementTree(new_xml) 
for e in new_xml.iter(): 
    print tree.getpath(e), e.text 

这会给我类似如下:

/Item/Purchases 

/Item/Purchases/Purchase[1] 
/Item/Purchases/Purchase[1]/URL http://tvgo.xfinity.com/watch/x/6091165185315991112/movies 
/Item/Purchases/Purchase[1]/Rating R 

/Item/Purchases/Purchase[2] 
/Item/Purchases/Purchase[2]/URL http://tvgo.xfinity.com/watch/x/6091165185315991112/movies 
/Item/Purchases/Purchase[2]/Rating R 

但是,我需要得到不是列表元素的路径,而是属性的路径。这里是xml的样子:

<Item> 
    <Purchases> 
    <Purchase Country="US"> 
     <URL>http://tvgo.xfinity.com/watch/x/6091165US</URL> 
     <Rating>R</Rating> 
    </Purchase> 
    <Purchase Country="CA"> 
     <URL>http://tvgo.xfinity.com/watch/x/6091165CA</URL> 
     <Rating>R</Rating> 
    </Purchase> 
</Item> 

我该如何得到下面的路径呢?

/Item/Purchases 

/Item/Purchases/Purchase[@Country="US"] 
/Item/Purchases/Purchase[@Country="US"]/URL http://tvgo.xfinity.com/watch/x/6091165185315991112/movies 
/Item/Purchases/Purchase[@Country="US"]/Rating R 

/Item/Purchases/Purchase[@Country="CA"] 
/Item/Purchases/Purchase[@Country="CA"]/URL http://tvgo.xfinity.com/watch/x/6091165185315991112/movies 
/Item/Purchases/Purchase[@Country="CA"]/Rating R 
+0

这是极不可能的,你可以让'的getPath()'给你。该索引是生成的XPath中唯一明确的唯一内容。您可以尝试在循环中手动修改路径。找到节点'/ Item/Purchases/Purchase [1]'并用你想要的谓词替换'[1]',在随后的所有迭代中替换为相同的值。 – Tomalak

不是很漂亮,但它的确是工作。

replacements = {} 

for e in tree.iter(): 
    path = tree.getpath(e) 

    if re.search('/Purchase\[\d+\]$', path): 
     new_predicate = '[@Country="' + e.attrib['Country'] + '"]' 
     new_path = re.sub('\[\d+\]$', new_predicate, path) 
     replacements[path] = new_path 

    for key, replacement in replacements.iteritems(): 
     path = path.replace(key, replacement) 

    print path, e.text.strip() 

打印这对我来说:

/Item 
/Item/Purchases 
/Item/Purchases/Purchase[@Country="US"] 
/Item/Purchases/Purchase[@Country="US"]/URL http://tvgo.xfinity.com/watch/x/6091165US 
/Item/Purchases/Purchase[@Country="US"]/Rating R 
/Item/Purchases/Purchase[@Country="CA"] 
/Item/Purchases/Purchase[@Country="CA"]/URL http://tvgo.xfinity.com/watch/x/6091165CA 
/Item/Purchases/Purchase[@Country="CA"]/Rating R