循环数据框的每一行，并根据条件向数据框添加元素

无需循环。您可以使用.isin()withnp.select()根据条件返回结果。见下面的代码：import pandas as pdimport numpy as nplist_drinks=['Water','Juice','Tea']list_food=['Apple','Orange']data = {'Price':  ['1', '5','3'],    'Product': ['Juice','book','Pen']}df = pd.DataFrame(data)df['Category'] = np.select([(df['Product'].isin(list_drinks)),               (df['Product'].isin(list_food))],              ['drinks',              'food'], 'Other')dfOut[1]:   Price Product Category0     1   Juice   drinks1     5    book    Other2     3     Pen    Other下面，我将代码分解为更详细的内容，以便您可以了解它是如何工作的。我也根据你的评论略有改变。我使用列表理解和来检查列表中的值是否位于数据帧中的值的子字符串中in。为了提高匹配率，我还将 as 全部小写与进行比较.lower()：import pandas as pdimport numpy as nplist_drinks=['Water','Juice','Tea']list_food=['Apple','Orange']data = {'Price':  ['1', '5','3'],    'Product': ['green Juice','book','oRange you gonna say banana']}df = pd.DataFrame(data)c1 = (df['Product'].apply(lambda x: len([y for y in list_drinks if y.lower() in x.lower()]) > 0))c2 = (df['Product'].apply(lambda x: len([y for y in list_food if y.lower() in x.lower()]) > 0))r1 = 'drinks'r2 = 'food'conditions = [c1,c2]results= [r1,r2]df['Category'] = np.select(conditions, results, 'Other')dfOut[1]:   Price                      Product Category0     1                  green Juice   drinks1     5                         book    Other2     3  oRange you gonna say banana     food

循环数据框的每一行，并根据条件向数据框添加元素

2回答