刮掉HTML表格。 UWP和C#
问题描述:
我有以下代码刮下面的HTML表格:刮掉HTML表格。 UWP和C#
代码,只是在表中的'服务'。
var MobileSiteLinks = MobileDocument.DocumentNode.SelectNodes("//div[starts-with(@class,'content')]//a[starts-with(@class,'service')]");
foreach (var a in MobileSiteLinks)
{
DestinationPlaces newPlace = new DestinationPlaces();
newPlace.Services = a.InnerText.Trim();
places4.Add(newPlace);
}
<table class="busexpress-clientwidgets-departures-departureboard">
<thead><tr class="rowStopName"><th colspan="3" title="briamaw" data- lat="50.8016420087726" data-lng="-0.0475264219580729" data-bearing="">Cranleigh Avenue</th><tr>
<tr class="textHeader"><th colspan="3">text briamaw to 84268 for live times</th><tr>
<tr class="rowHeaders"><th>service</th><th>destination</th><th>time</th><tr></thead><tbody>
<tr class="rowServiceDeparture">
<td class="colServiceName">27</td>
<td class="colDestination" title="Saltdean">Saltdean</td>
<td class="colDepartureTime" data-departureTime="29/05/2017 15:04:00" title="1 mins">1 mins</td>
</tr>
<tr class="rowServiceDeparture">
<td class="colServiceName">14B</td>
<td class="colDestination" title="Peacehaven">Peacehaven</td>
<td class="colDepartureTime" data-departureTime="29/05/2017 15:19:00" title="16 mins">16 mins</td>
</tr>
<tr class="rowServiceDeparture">
<td class="colServiceName">12A</td>
<td class="colDestination" title="Eastbourne">Eastbourne</td>
<td class="colDepartureTime" data-departureTime="29/05/2017 15:22:00" title="19 mins">19 mins</td>
</tr>
<tr class="rowServiceDeparture">
<td class="colServiceName">27</td>
<td class="colDestination" title="Saltdean">Saltdean</td>
<td class="colDepartureTime" data-departureTime="29/05/2017 15:23:00" title="20 mins">20 mins</td>
</tr>
我知道现在UWP应用程序中不支持SelectNodes。我如何使用新格式获取相同的信息?
感谢
答
我知道的SelectNodes不支持UWP现在的应用程序。
UWP应用程序确实有SelectNodes
相关方法。如果将上表保存为XML文件,则可以使用XmlDocument
类,它包含SelectNodes(String)
方法。例如:
Windows.Data.Xml.Dom.XmlDocument doc;
StorageFile storageFile = await StorageFile.GetFileFromApplicationUriAsync(new Uri("ms-appx:///Assets/Onetable.xml"));
Windows.Data.Xml.Dom.XmlLoadSettings loadSettings = new Windows.Data.Xml.Dom.XmlLoadSettings();
doc = await Windows.Data.Xml.Dom.XmlDocument.LoadFromFileAsync(storageFile, loadSettings);
var xpath = "//tr[@class='rowServiceDeparture']";
var MobileSiteLinks = doc.SelectNodes(xpath);
更多的细节可以参考XmlDocument official sample。
你的代码片段看起来像你使用的是HtmlAgilityPack
包。如果上述内容仅保存为HTML格式,则还可以在UWP应用程序中使用HtmlAgilityPack
,它也包含上面显示的SelectNodes
方法。请注意使用UWP支持的HtmlAgilityPack for .NET Core。
WebRequest request = HttpWebRequest.Create("url");
WebResponse response = await request.GetResponseAsync();
Stream stream = response.GetResponseStream();
var result = "";
using (StreamReader sr = new StreamReader(stream))
{
result = sr.ReadToEnd();
}
HtmlDocument MobileDocument = new HtmlDocument();
MobileDocument.LoadHtml(result);
var nodes = MobileDocument.DocumentNode.SelectNodes("//tr[@class='rowServiceDeparture']");
谢谢。使用HtmlAgilityPack的网络,这与我现有的代码工作! –