如何读取XML和使用LXML
问题描述:
我是一个有点困惑与XML如何读取XML和使用LXML
我的XML
<?xml version="1.0" encoding="UTF-8"?>
<AirShoppingRS Version="16.2" xsi:schemaLocation="http://www.iata.org/IATA/EDIST AirShoppingRS.xsd"
xmlns="http://www.iata.org/IATA/EDIST"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<Document/>
<Success/>
<ShoppingResponseID>
<ResponseID>2017-10-04T14:35:25.243504</ResponseID>
</ShoppingResponseID>
<OffersGroup>
<AirlineOffers>
<TotalOfferQuantity>297</TotalOfferQuantity>
<Owner>SU</Owner>
<AirlineOffer>
<OfferID Owner="SU">OFFER5</OfferID>
<TotalPrice>
<SimpleCurrencyPrice Code="RUB">36229</SimpleCurrencyPrice>
</TotalPrice>
<PricedOffer>
<OfferPrice OfferItemID="5">
<RequestedDate>
<PriceDetail>
<TotalAmount>
<SimpleCurrencyPrice>36229</SimpleCurrencyPrice>
</TotalAmount>
<BaseAmount>33000</BaseAmount>
<Taxes>
<Total Code="RUB">3229</Total>
</Taxes>
</PriceDetail>
</RequestedDate>
<FareDetail>
<FareComponent>
<SegmentReference>SEG_SVOLED_1</SegmentReference>
<FareBasis>
<FareBasisCode>
<Code>DFOR</Code>
</FareBasisCode>
</FareBasis>
</FareComponent>
</FareDetail>
</OfferPrice>
</PricedOffer>
</AirlineOffer>
</AirlineOffers>
</OffersGroup>
</AirShoppingRS>
获取信息,我怎么能读它,使用lxml
库从一个元素获取文本。我尝试这一个root = etree.fromstring(xml.content)
,然后我尝试airline_offers = root.findall("AirlineOffer")
,但什么也没得到。猜猜,我做错了什么。我在哪里犯错?我怎样才能得到一个元素,然后从它的文本或属性?
UPDATE:airline_offers = root.findall(".//AirlineOffer")
回报没有什么太
答
默认命名空间(http://www.iata.org/IATA/EDIST
)是根元素上声明。这是使它工作的一种方式:
airline_offers = root.findall(".//{http://www.iata.org/IATA/EDIST}AirlineOffer")
也可以使用通配符:
airline_offers = root.findall(".//{*}AirlineOffer")
另一种方法是定义一个前缀:
NS = {"edist": "http://www.iata.org/IATA/EDIST"}
airline_offers = root.findall(".//edist:AirlineOffer", namespaces=NS)
的可能的复制[ ElementTree findall()返回空列表](https://stackoverflow.com/questions/9112121/elementtree-findall-returning-empty-list) – Bernhard
@Headmaster,你想阅读所有的文字,你想要哪些标签文字读书 ? –
@ChetanVasudevan'SimpleCurrencyPrice'标记例如 – Headmaster