文字列から単語の頻度を取得する方法は？

-1

ねえ、私は次の問題があります。私は文字列であるoutputHTMLの5つの最も頻繁な単語を見つけるためにC++を使ってウェブサイトを掻き集めました。現在私は次のコードを持っています。どんなヒントも素晴らしいだろう。文字列から単語の頻度を取得する方法は？

curl = curl_easy_init(); 
if(curl) { 
    curl_easy_setopt(curl, CURLOPT_URL, "http://example.com"); 
    curl_easy_setopt(curl, CURLOPT_WRITEFUNCTION, WriteCallback); 
    curl_easy_setopt(curl, CURLOPT_WRITEDATA, &htmlOutput); 
    res = curl_easy_perform(curl); 
    curl_easy_cleanup(curl); 

    std::cout << htmlOutput << std::endl; 
}

出典

2016-05-31 Zeid Selimovic

ヒント： 'のstd ::マップ<はstd ::文字列、unsigned int型>'を使用します。 –

ヒント：[頻度]（https://en.wikipedia.org/wiki/Frequency_（統計）） – lcs

は、ここでより多くの素晴らしさのためにいくつかのヒントがあります：

std::istringstream awsome_stream(web_text); 
std::string word; 
std::map<std::string, unsigned int> kewl_words; 
while (awsome_stream >> word) 
{ 
    kewl_words[word]++; 
} 
std::cout << "Occurances of 'div': " << kewl_words["div"] << "\n";

出典

2016-05-31 19:49:22

文字列から単語の頻度を取得する方法は？

答えて

関連する問題