2017-03-26 4 views
1
 
Whenever am trying to index an collection I get the following error 
> db.visits.ensureIndex({rawdata:"text"}) 
{ 
     "ok" : 0, 
     "errmsg" : "text contains invalid UTF-8", 
     "code" : 28755, 
     "codeName" : "Location28755" 
} 
> 
The rawdata contain Email:Time:Url 
Am using Mongo/Windows 
The data were inserted using mongoimport from 2 tsv files 
When I imported the 1st file alone in new collection , indexing was done normally 
The second file in new collection raised up the problem again while indexing it 
But honestly I can't find the wrong line 
Both file contain around +50 million lines 
Any help would be appreciated 

答えて

0

「テキストが無効なUTF-8を含む」私はいずれかがより良いsollutionを持っている場合、

を歓迎されるだろう、まだ010エディタ を使用してUTF形式に非UTF-8文字列を変換することによって、これを解決しました
関連する問題