解析HTML文档PHP
问题描述:
我有一个从Web服务解析HTML文档PHP
<?php
$speechText=$_POST['spokentext'];
echo $speechText.'!';
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, 'http://nlp.stanford.edu:8080/parser/index.jsp');
curl_setopt($ch, CURLOPT_POST, 3);
curl_setopt($ch,CURLOPT_POSTFIELDS, "query=".$speechText."&parserSelect=English&parse=Parse");
$result = curl_exec($ch);
//echo $result;
curl_close($ch);
$dom = new DOMDocument();
$dom->loadHTML($result);
$xpath = new DOMXPath($dom);
$result = $xpath->query('/html/body/div[2]/div[5]/pre');
echo $result->item(0)->nodeValue;
?>
提取信息的PHP代码的变量$导致持有其返回的HTML文档。我想通过解析它从中提取一些信息。我正在使用内置的DOM解析器,但是出现此错误 -
PHP Notice: Trying to get property of non-object
什么是错误?
P.S. - 我在编程的完整的n00b:P
答
它相当简单
$dom = new DOMDocument();
@$dom->loadHTML($html);
$xpath = new DOMXpath($dom);
$elements = $xpath->query("/html/body/div[2]/div[6]/pre");
$typed_dependencies_collapsed = array();
if (!is_null($elements)) {
foreach ($elements as $element) {
$nodes = $element->childNodes;
foreach ($nodes as $node) {
echo $node->nodeValue."\n";
array_push($typed_dependencies_collapsed,$node->nodeValue);
}
}
}
http://php.net/domdocument – cmbuckley 2015-04-03 16:22:35
请:http://bit.ly/1NGQpUF – 2015-04-03 16:24:53
@cmbuckley 一些简单的帮助如何解析它?访问特定的类/ ID? – GokuShanth 2015-04-03 16:26:10