在大熊猫的一个聚合中使用多个 idxmin() 和 idmax() 进行多重索引

3回答

慕神8447489

你可以试试这个，DF.groupby('id').agg(agg1=('col1',lambda x:x[DF.loc[x.index,'col2'].idxmax()]),                     agg2 = ('col2',lambda x:x[DF.loc[x.index,'col3'].idxmin()]),                     agg3 = ('col1',lambda x:x[DF.loc[x.index,'col3'].idxmax()]))    agg1  agg2  agg3id1      5     4     32      5     3     53      7     4     7

0 0

森栏

玩弄这个问题，主要是为了看看我是否可以提高原始解决方案的速度。这比命名聚合更快。grp = df.groupby("id")        pd.DataFrame({ "col1": df.col1[grp.col2.idxmax()].array,                       "col2": df.col2[grp.col3.idxmin()].array,                       "col3": df.col1[grp.col3.idxmax()].array},                       index=grp.indices)    col1    col2    col31   5       4       32   5       3       53   7       4       7加速~3x。

0 0

ABOUTYOU

tidyversepython中的一种方式怎么样：>>> from datar.all import f, tibble, group_by, which_max, which_min, summarise>>> >>> DF = tibble(...     id=[1,1,1,2,2,2,2,3,3,3], ...     col1=[1,3,5,2,5,3,6,3,67,7],...     col2=[4,6,8,3,65,3,5,4,4,7], ...     col3=[34,64,53,5,6,2,4,6,4,67]... )>>> >>> DF >> group_by(f.id) >> summarise(...     agg1=f.col1[which_max(f.col2)],...     agg2=f.col2[which_min(f.col3)],...     agg3=f.col1[which_max(f.col3)]... )       id    agg1    agg2    agg3  <int64> <int64> <int64> <int64>0       1       5       4       31       2       5       3       52       3       7       4       7我是datar包的作者。如果您有任何问题，请随时提交问题。

0 0