期間全体での集計

これらのデータを累積する方法は、秒で、それぞれのの唯一のIDです。私はできません期間全体での集計

## disregarding the loc grouping: 
df.test <- select(df, time, secs) 
df.test <- na.omit(df.test) ##xts with period.sum does not like NA 
df.test <- as.xts(df.test, order.by = df.test$time) 
df.test <- period.sum(df.test$secs, endpoints(df.test , "mins", k=15)) 
df.test <- align.time(df.test , 15*60)

しかし：私は秒を合計する成功したパッケージXTSを使用しました

> dput(df.out) structure(list(unique.id = c(3, 7, 2, 2, 4), loc = c("A", "A", "A", "B", "B"), time = structure(c(1425172501, 1425173400, 1425174300, 1425321900, 1425322800), class = c("POSIXct", "POSIXt"), tzone = "UTC"), secs = c(318, 380, 6, 43, 138)), class = c("tbl_df", "tbl", "data.frame"), row.names = c(NA, -5L), .Names = c("unique.id", "loc", "time", "secs"))

次のように

> dput(df) structure(list(id = c(131, 146, 160, 146, 160, 146, 160, 137, 157, 144, 124, 144, 119, 119, 242, 242, 235, 235, 145, 262, 258, 160, 145, 135, 148, 148, 143), loc = structure(c(1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 2L, 2L, 2L, 2L, 2L, 2L, 2L), .Label = c("A", "B"), class = "factor"), time = structure(c(1425197400, 1425197400, 1425197400, 1425197460, 1425197460, 1425197520, 1425197520, 1425197940, 1425198180, 1425198180, 1425198180, 1425198240, 1425198240, 1425198300, 1425198300, 1425198360, 1425198480, 1425198540, 1425198840, 1425198900, 1425346560, 1425346560, 1425347280, 1425347460, 1425347520, 1425347580, 1425347580), class = c("POSIXct", "POSIXt")), secs = c(35, 60, 60, 60, 60, 19, 24, 0, 0, 60, 0, 46, 60, 28, 60, 48, 60, 18, 6, 0, 0, 43, 0, 37, 60, 27, 14)), .Names = c("id", "loc", "time", "secs"), row.names = c(NA, 27L), class = "data.frame")

この例の出力は次のようになります一意のIDを数えるために同じことをします。誰かがよりエレガントな解決策を持っている場合はところで、私はここで（期間指標を準備してからちょうどdplyr::group_by()::summarise()にすべてを養うためにいいだろう）

おかげ

出典

2017-01-20 Thomas Speidel

を入力を歓迎したいdplyrを使用して一つの解決策です。時間を15分間隔に変換し、次にgroup_y/summarizeを実行します。

df$time<- as.POSIXct(ceiling(as.double(df$time)/(15*60)) * (15*60), 
         origin = '1970-01-01') 
df %>% 
    group_by(time, loc) %>% 
    summarise(unique.id = n_distinct(id), secs = sum(secs)) %>% 
    select(unique.id, loc, time, secs)

出力は次のとおりです。

Source: local data frame [5 x 4] 
Groups: time [5] 

    unique.id loc    time secs 
     <int> <fctr>    <dttm> <dbl> 
1   3  A 2015-03-01 03:15:00 318 
2   7  A 2015-03-01 03:30:00 380 
3   2  A 2015-03-01 03:45:00  6 
4   2  B 2015-03-02 20:45:00 43 
5   4  B 2015-03-02 21:00:00 138

出典

2017-01-20 19:22:41 Gopala

これはとても簡単です！私はちょうど天井を取ることができると信じることができない...ありがとう –

答えて

関連する問題