通过从右表中采样来填充左连接的 NaN 值

使用sample与fillnajoined_left = left.merge(right, how="left", left_on=[0], right_on=[0],indicator=True) # adding indicatorjoined_leftOut[705]:    0  1_x  2_x  1_y  2_y     _merge0  1    1    1  2.0  2.0       both1  1    1    1  2.0  3.0       both2  2    2    2  NaN  NaN  left_only3  3    3    3  2.0  2.0       both4  3    3    3  2.0  9.0       both5  3    3    3  2.0  2.0       both6  9    9    9  NaN  NaN  left_only7  1    3    2  2.0  2.0       both8  1    3    2  2.0  3.0       bothnnull=joined_left['_merge'].eq('left_only').sum() # find all many row miss match , at the mergedfs=right.sample(nnull)# rasmple from the dataframe after dropna s.index=joined_left.index[joined_left['_merge'].eq('left_only')] # reset the index of the subset fill df to the index of null value show up joined_left.fillna(s.rename(columns={1:'1_y',2:'2_y'})) Out[706]:    0  1_x  2_x  1_y  2_y     _merge0  1    1    1  2.0  2.0       both1  1    1    1  2.0  3.0       both2  2    2    2  2.0  2.0  left_only3  3    3    3  2.0  2.0       both4  3    3    3  2.0  9.0       both5  3    3    3  2.0  2.0       both6  9    9    9  2.0  3.0  left_only7  1    3    2  2.0  2.0       both8  1    3    2  2.0  3.0       both

通过从右表中采样来填充左连接的 NaN 值

1回答