网页报废的（iframe）搜索结果在

问题描述：

我想从下面的网站刮所有的NPI和细节。 “https://www.pverify.com/npi-lookup-find-npi-number-of-doctors-physicians/”网页报废的（iframe）搜索结果在

代码：

library("rvest") 
library("xml2") 
url="https://www.pverify.com/npi-lookup-find-npi-number-of-doctors-physicians/" 
webpage<-read_html(url) 
data_html <- html_nodes(webpage,'iframe') 
data_html <-html_table(data_html)

当我尝试上面的代码，错误消息是 “错误：html_name（X）== ”表“ 是不是真正的” 请帮我的得到NPI号码和他们的细节。

答

您可以尝试Rselenium。

代码看起来或多或少像这样。

library(Rselenium) 
library(XML)  

remDr <- remoteDriver(port = 4445L) 
remDr$open() 
remDr$navigate("https://www.pverify.com/npi-lookup-find-npi-number-of-doctors-physicians/") 
h <- htmlParse(remDr$getPageSource()[[1]], encoding = "UTF-8") 
h_table <- html_table(h)

要创建一个泊坞窗服务器，你可以看到here

remDr

你使用的是Linux吗？如果是，打开终端和数字：服务码头状态。看看你的服务器是否在运行。如果不是你需要看到[这]（https://cran.r-project.org/web/packages/RSelenium/vignettes/RSelenium-docker.html） –

我使用的是Windows 10 –

网页报废的（iframe）搜索结果在

相关推荐