beautifulsoupでfindAllを使用した結果のフィルタリング

import urllib2 
from BeautifulSoup import BeautifulSoup 

result = urllib2.urlopen("http://www.bbc.co.uk/news/uk-scotland-south-scotland-12380537") 
html=result.read() 
soup= BeautifulSoup(html) 
print soup.html.head.title 

print soup.findAll('div', attrs={ "class" : "story-body"})

私が欲しい情報はストーリー本体にありますが、底にあります。だから私はそこに着くまでジャンク情報のロードを直してしまう。beautifulsoupでfindAllを使用した結果のフィルタリング

print soup.findAll('p', attrs={ 'class' : "introduction"})

のみ私の最初<p>を取得します...だから物語 - 体の端に導入開始から収集するために探して、この例ではすべてのアイデアを

を収集する8以上があるのですか？ CSSセレクタの面では

出典

2012-05-09 aromamode

、あなたは.story-body内のすべてのpの要素を選択します：

print soup.select('.story-body p')

http://www.crummy.com/software/BeautifulSoup/bs4/doc/index.html?highlight=select#css-selectors

出典

2012-05-09 18:57:50 thirtydot

が美しいスープ4を使用していない..you're、あなたは？ – thirtydot

いいえリンクありがとう – aromamode

いいえ、この例では、私は明日それに移動すると思っています – aromamode

beautifulsoupでfindAllを使用した結果のフィルタリング

答えて

関連する問題