在SparkDataFrame中找到每个组的最大行数
sasbRowname, id_said_sbid_said_sbid_said_sbid_sa.
[Row(name='n1', id_sa='a1', id_sb='b1'), Row(name='n2', id_sa='a1', id_sb='b2'), Row(name='n3', id_sa='a1', id_sb='b2'), Row(name='n4', id_sa='a2', id_sb='b2')]
a1b2a1n1, n2n3b1, b2b2b2a1a2b2
groupBy(df.id_sa)
[Row(id_sa=a1, max_id_sb=b2), Row(id_sa=a2, max_id_sb=b2)]
FFIVE
随时随地看视频慕课网APP
相关分类