假设我有两个 pandas DataFrame 即 df1, df2
df1 = {name : [tom, jerry, jennifer, hafiz, kitty]}
df2 = {name : [tom, jerry, alex, hafiz, samdin, unnar]}
从这两个数据集中,我想生成
good_boy = [tom, jerry] # present in both the datasets
bad_boy = [jenifer, hafiz, kitty] # present in df1 but not in df2
new_boy = [alex, samdin, unnar] # in df2 but not in df1
实际数据集非常大,有数百万行,我尝试进行迭代检查,但速度太慢了。Pandas 中是否已经存在任何 tric(并行处理)。请帮我解决这个问题,我的注意力是时间。谢谢
慕标琳琳
慕桂英3389331
相关分类