刮掉HTML表格。 UWP和C#

问题描述:

我有以下代码刮下面的HTML表格:刮掉HTML表格。 UWP和C#

代码,只是在表中的'服务'。

var MobileSiteLinks = MobileDocument.DocumentNode.SelectNodes("//div[starts-with(@class,'content')]//a[starts-with(@class,'service')]"); 

        foreach (var a in MobileSiteLinks) 
        { 
         DestinationPlaces newPlace = new DestinationPlaces(); 
         newPlace.Services = a.InnerText.Trim(); 
         places4.Add(newPlace); 
        } 

<table class="busexpress-clientwidgets-departures-departureboard"> 
<thead><tr class="rowStopName"><th colspan="3" title="briamaw" data- lat="50.8016420087726" data-lng="-0.0475264219580729" data-bearing="">Cranleigh Avenue</th><tr> 
<tr class="textHeader"><th colspan="3">text briamaw to 84268 for live times</th><tr> 
<tr class="rowHeaders"><th>service</th><th>destination</th><th>time</th><tr></thead><tbody> 
<tr class="rowServiceDeparture"> 
<td class="colServiceName">27</td> 
<td class="colDestination" title="Saltdean">Saltdean</td> 
<td class="colDepartureTime" data-departureTime="29/05/2017 15:04:00" title="1 mins">1 mins</td> 
</tr> 
<tr class="rowServiceDeparture"> 
<td class="colServiceName">14B</td> 
<td class="colDestination" title="Peacehaven">Peacehaven</td> 
<td class="colDepartureTime" data-departureTime="29/05/2017 15:19:00" title="16 mins">16 mins</td> 
</tr> 
<tr class="rowServiceDeparture"> 
<td class="colServiceName">12A</td> 
<td class="colDestination" title="Eastbourne">Eastbourne</td> 
<td class="colDepartureTime" data-departureTime="29/05/2017 15:22:00" title="19 mins">19 mins</td> 
</tr> 
<tr class="rowServiceDeparture"> 
<td class="colServiceName">27</td> 
<td class="colDestination" title="Saltdean">Saltdean</td> 
<td class="colDepartureTime" data-departureTime="29/05/2017 15:23:00" title="20 mins">20 mins</td> 
</tr> 

我知道现在UWP应用程序中不支持SelectNodes。我如何使用新格式获取相同的信息?

感谢

我知道的SelectNodes不支持UWP现在的应用程序。

UWP应用程序确实有SelectNodes相关方法。如果将上表保存为XML文件,则可以使用Xml​Document类,它包含SelectNodes(String)方法。例如:

Windows.Data.Xml.Dom.XmlDocument doc; 
StorageFile storageFile = await StorageFile.GetFileFromApplicationUriAsync(new Uri("ms-appx:///Assets/Onetable.xml")); 
Windows.Data.Xml.Dom.XmlLoadSettings loadSettings = new Windows.Data.Xml.Dom.XmlLoadSettings(); 
doc = await Windows.Data.Xml.Dom.XmlDocument.LoadFromFileAsync(storageFile, loadSettings); 
var xpath = "//tr[@class='rowServiceDeparture']"; 
var MobileSiteLinks = doc.SelectNodes(xpath); 

更多的细节可以参考XmlDocument official sample

你的代码片段看起来像你使用的是HtmlAgilityPack包。如果上述内容仅保存为HTML格式,则还可以在UWP应用程序中使用HtmlAgilityPack,它也包含上面显示的SelectNodes方法。请注意使用UWP支持的HtmlAgilityPack for .NET Core

WebRequest request = HttpWebRequest.Create("url"); 
WebResponse response = await request.GetResponseAsync(); 
Stream stream = response.GetResponseStream(); 
var result = ""; 
using (StreamReader sr = new StreamReader(stream)) 
{ 
    result = sr.ReadToEnd(); 
} 
HtmlDocument MobileDocument = new HtmlDocument(); 
MobileDocument.LoadHtml(result); 
var nodes = MobileDocument.DocumentNode.SelectNodes("//tr[@class='rowServiceDeparture']"); 
+0

谢谢。使用HtmlAgilityPack的网络,这与我现有的代码工作! –