2012-04-09 5 views

答えて

0

WikiとそのQuickStartを見てみてください。以下のサンプルコード...

public static void main(final String[] args) throws Exception { 
    URL url; 
    url = new URL("http://www.example.com/some-location/index.html"); 

    // NOTE We ignore HTTP-based character encoding in this demo... 
    final InputStream urlStream = url.openStream(); 
    final InputSource is = new InputSource(urlStream); 

    final BoilerpipeSAXInput in = new BoilerpipeSAXInput(is); 
    final TextDocument doc = in.getTextDocument(); 
    urlStream.close(); 

    // You have the choice between different Extractors 

    // System.out.println(DefaultExtractor.INSTANCE.getText(doc)); 
    System.out.println(ArticleExtractor.INSTANCE.getText(doc)); 
} 
関連する問題