如何在Windows上安装Boilerpipe?
答
试着看看他们的Wiki和他们的QuickStart。下面的示例代码...
public static void main(final String[] args) throws Exception {
URL url;
url = new URL("http://www.example.com/some-location/index.html");
// NOTE We ignore HTTP-based character encoding in this demo...
final InputStream urlStream = url.openStream();
final InputSource is = new InputSource(urlStream);
final BoilerpipeSAXInput in = new BoilerpipeSAXInput(is);
final TextDocument doc = in.getTextDocument();
urlStream.close();
// You have the choice between different Extractors
// System.out.println(DefaultExtractor.INSTANCE.getText(doc));
System.out.println(ArticleExtractor.INSTANCE.getText(doc));
}