为每个数据组中的行创建一个序号(计数器

3回答

GCT1015

被误导的名字ave()函数，带参数FUN=seq_along，就能很好地完成这一任务-即使你personid列没有严格的排序。df <- read.table(text = "personid date measurement1         x     231         x     322         y     213         x     233         z     233         y     23", header=TRUE)## First with your data.frameave(df$personid, df$personid, FUN=seq_along)# [1] 1 2 1 1 2 3## Then with another, in which personid is *not* in orderdf2 <- df[c(2:6, 1),]ave(df2$personid, df2$personid, FUN=seq_along)# [1] 1 1 1 2 3 2

0 0

拉莫斯之舞

一些dplyr替代品，使用方便函数row_number和n.library(dplyr)df %>% group_by(personid) %>% mutate(id = row_number())df %>% group_by(personid) %>% mutate(id = 1:n())df %>% group_by(personid) %>% mutate(id = seq_len(n()))df %>% group_by(personid) %>% mutate(id = seq_along(personid))您也可以使用getanID从包装splitstackshape..注意，输入数据集作为data.table.getanID(data = df, id.vars = "personid")#    personid date measurement .id# 1:        1    x          23   1# 2:        1    x          32   2# 3:        2    y          21   1# 4:        3    x          23   1# 5:        3    z          23   2# 6:        3    y          23   3

0 0

温温酱

使用data.table，并假设您希望通过date在personid子集library(data.table)DT <- data.table(Data)DT[,id := order(date), by  = personid]##    personid date measurement id## 1:        1    x          23  1## 2:        1    x          32  2## 3:        2    y          21  1## 4:        3    x          23  1## 5:        3    z          23  3## 6:        3    y          23  2如果你不想dateDT[, id := 1:.N, by = personid]##    personid date measurement id## 1:        1    x          23  1## 2:        1    x          32  2## 3:        2    y          21  1## 4:        3    x          23  1## 5:        3    z          23  2## 6:        3    y          23  3以下任何一项都将有效DT[, id := seq_along(measurement), by =  personid]DT[, id := seq_along(date), by =  personid]使用的等效命令plyrlibrary(plyr)# ordering by dateddply(Data, .(personid), mutate, id = order(date))# in original orderddply(Data, .(personid), mutate, id = seq_along(date))ddply(Data, .(personid), mutate, id = seq_along(measurement))

0 0