次のデータフレームをフィルタリングする方法は？

こんにちは、私は最初の列に「0」と私はすべての値をしたい1つのデータフレームをフィルタリングしたいと思い、問題はMIのcsvファイルは、私が得たしかし私は次のデータフレームをフィルタリングする方法は？

import pandas as pd 

df = pd.read_csv('big-cluster.csv',delimiter=',',encoding='ISO-8859-15') 
print(df.iloc[:,0].isin([0]))

を試しヘッダていないということです。

0  False 
1  False 
2  False 
3  False 
4  False 
5  False 
6  False 
7  False 
8  False 
9  False 
10  False 
11  False 
12  False 
13  False 
14  False 
15  False 
16  False 
17  False 
18  False 
19  False 
20  False 
21  False 
22  False 
23  False

私のcsvファイルは、以下の構造を有する：私は取得したいと思い

10,hello this is a test 
4,just testing code 
3,this is fun 
4,you are good 
10,this is very good

出力は次のようになります。

10,hello this is a test 
10,this is very good

はここからのフィードバックの後、私が試した：

mask = df[0].astype(str).str.contains("0") 
print(df[mask])

私が得たしかし：

--------------------------------------------------------------------------- 
KeyError         Traceback (most recent call last) 
/usr/local/lib/python3.6/site-packages/pandas/core/indexes/base.py in get_loc(self, key, method, tolerance) 
    2392    try: 
-> 2393     return self._engine.get_loc(key) 
    2394    except KeyError: 

pandas/_libs/index.pyx in pandas._libs.index.IndexEngine.get_loc (pandas/_libs/index.c:5239)() 

pandas/_libs/index.pyx in pandas._libs.index.IndexEngine.get_loc (pandas/_libs/index.c:5085)() 

pandas/_libs/hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item (pandas/_libs/hashtable.c:20405)() 

pandas/_libs/hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item (pandas/_libs/hashtable.c:20359)() 

KeyError: 0 

During handling of the above exception, another exception occurred: 

KeyError         Traceback (most recent call last) 
<ipython-input-54-fe5bdde1bff6> in <module>() 
    65 ''' 
    66 
---> 67 mask = df[0].astype(str).str.contains("0") 
    68 print(df[mask]) 
    69 

/usr/local/lib/python3.6/site-packages/pandas/core/frame.py in __getitem__(self, key) 
    2060    return self._getitem_multilevel(key) 
    2061   else: 
-> 2062    return self._getitem_column(key) 
    2063 
    2064  def _getitem_column(self, key): 

/usr/local/lib/python3.6/site-packages/pandas/core/frame.py in _getitem_column(self, key) 
    2067   # get column 
    2068   if self.columns.is_unique: 
-> 2069    return self._get_item_cache(key) 
    2070 
    2071   # duplicate columns & possible reduce dimensionality 

/usr/local/lib/python3.6/site-packages/pandas/core/generic.py in _get_item_cache(self, item) 
    1532   res = cache.get(item) 
    1533   if res is None: 
-> 1534    values = self._data.get(item) 
    1535    res = self._box_item_values(item, values) 
    1536    cache[item] = res 

/usr/local/lib/python3.6/site-packages/pandas/core/internals.py in get(self, item, fastpath) 
    3588 
    3589    if not isnull(item): 
-> 3590     loc = self.items.get_loc(item) 
    3591    else: 
    3592     indexer = np.arange(len(self.items))[isnull(self.items)] 

/usr/local/lib/python3.6/site-packages/pandas/core/indexes/base.py in get_loc(self, key, method, tolerance) 
    2393     return self._engine.get_loc(key) 
    2394    except KeyError: 
-> 2395     return self._engine.get_loc(self._maybe_cast_indexer(key)) 
    2396 
    2397   indexer = self.get_indexer([key], method=method, tolerance=tolerance) 

pandas/_libs/index.pyx in pandas._libs.index.IndexEngine.get_loc (pandas/_libs/index.c:5239)() 

pandas/_libs/index.pyx in pandas._libs.index.IndexEngine.get_loc (pandas/_libs/index.c:5085)() 

pandas/_libs/hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item (pandas/_libs/hashtable.c:20405)() 

pandas/_libs/hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item (pandas/_libs/hashtable.c:20359)() 

KeyError: 0

出典

2017-06-01 neo33

は、私はあなたの問題はあなたの最初のフィールドに文字列をint型として入力していない取得されていることだと思います。だから、 'astype'と.str.containsを使ってゼロをチェックしてみましょう。

のは、試してみましょう：

mask = df[0].astype(str).str.contains("0") 
print(df[mask])

が出力：よさそうだが、私はエラーを得た、私は質問を更新してみましょう

0      1 
0 10 hello this is a test 
4 10  this is very good

出典

2017-06-01 16:22:02

@piRSquaredおかげで、あなたの列名は何neo33 @ – neo33

最初の列。あなたのファイルがないので私はゼロを使用します。あなたのカラム名をここで0の代わりに指定してください。 –

わからないときは、 'df.iloc [:, 0]'を使用してください。 – piRSquared

次のデータフレームをフィルタリングする方法は？

答えて

関連する問題