如何从短语列表中查找字典中的短语，并使用找到的短语和计数创建数据框。应计算重复项

df从相应的设置初始数据帧dictionary：df = pd.DataFrame({'urls': list(dictionary.keys()), 'strings': list(dictionary.values())})pattern = '|'.join(phrases)处理数据帧：s = df.pop('strings').str.findall(pattern)df = df.assign(phrasecount=s.str.len(), phrase=s.map(', '.join))df = df.drop_duplicates(subset='phrasecount') if df['phrasecount'].eq(0).all() else df[df['phrasecount'].ne(0)]结果：# print(df)                      urls  phrasecount                               phrase0  http://www.firsturl.com            2  going to the market, eating cookies2  http://www.thirdurl.com            1                            i am good

如何从短语列表中查找字典中的短语，并使用找到的短语和计数创建数据框。应计算重复项

1回答