pandasライブラリのdata.groupby（cuts）.outcome.aggという関数は何ですか？

お願いします、この機能が何をしているのか分かりません。pandasライブラリのdata.groupby（cuts）.outcome.aggという関数は何ですか？

#group outcomes into bins of similar probability 
    bins = np.linspace(0, 1, 20) 
    cuts = pd.cut(prob, bins) 
    print(cuts) 
    binwidth = bins[1] - bins[0] 

    #freshness ratio and number of examples in each bin 
    cal = data.groupby(cuts).outcome.agg(['mean', 'count']) 
    print(cal['count']) 
    print(cal['mean']) 
    cal['pmid'] = (bins[:-1] + bins[1:])/2 
    cal['sig'] = np.sqrt(cal.pmid * (1 - cal.pmid)/cal['count']) 

    #the calibration plot 
    ax = plt.subplot2grid((3, 1), (0, 0), rowspan=2) 
    p = plt.errorbar(cal.pmid, cal['mean'], cal['sig']) 
    plt.plot(cal.pmid, cal.pmid, linestyle='--', lw=1, color='k') 
    plt.ylabel("Empirical Fraction")

出典

2017-01-27 mario jose

APIのドキュメントはありませんか？ – csmckelvey

dataがoutcomeという名前の列を含むDataFrameです：は、ここでは、コードのコンテキストです。あなたのコードの凸部は、次のとおりです。「カット」欄（further reference）のエントリに基づいて

グループデータ：これは何
```
cal = data.groupby(cuts).outcome.agg(['mean', 'count']) 
```
は順番に、です。
「結果」列に対応するSeriesGroupByを取得します。
SeriesGroupBy（例：hereを参照）の各グループに「平均」と「カウント」の2つの列が適用されたDataFrameを作成します。
変数をcalに割り当てます。

出典

2017-01-28 20:33:57

pandasライブラリのdata.groupby（cuts）.outcome.aggという関数は何ですか？

答えて

関連する問題