如何在字符串中搜索关键字、提取该字符串并将其放入新列中？

4回答

GCT1015

这个解决方案怎么样，当你有一个新类别时，你所要做的就是将新类别添加到 cats 数组中。import pandas as pdimport numpy as npdf = pd.DataFrame({'Product Name': ['Nike Zoom Pegasus', 'All New Nike Zoom Pegasus 4', 'Metcon 3', 'Nike Metcon 5']})cats = ["Pegasus","Metcon"]df["Category"] = df["Product Name"].apply(lambda x: np.intersect1d(x.split(" "),cats)[0])output                  Product Name Category0            Nike Zoom Pegasus  Pegasus1  All New Nike Zoom Pegasus 4  Pegasus2                     Metcon 3   Metcon3                Nike Metcon 5   Metcon

0 0

BIG阳

使用pandas.Series.str.extract>>> df = pd.DataFrame({'Product Name': ['Nike Zoom Pegasus', 'All New Nike Zoom Pegasus 4', 'Metcon 3', 'Nike Metcon 5']})>>> cats = ["Pegasus","Metcon"]>>> df['Category'] = df["Product Name"].str.extract("(%s)" % "|".join(cats))                  Product Name Category0            Nike Zoom Pegasus  Pegasus1  All New Nike Zoom Pegasus 4  Pegasus2                     Metcon 3   Metcon3                Nike Metcon 5   Metcon

0 0

FFIVE

怎么样：import pandas as pddf = {'Product Name': ['Nike Zoom Pegasus', 'All New Nike Zoom Pegasus 4', 'Metcon 3', 'Nike Metcon 5']}c = set(['Metcon', 'Pegasus'])categories = [c.intersection(pn.split(' ')) for pn in df['Product Name']]df['Categories'] = categoriesprint(df)>> {'Product Name': ['Nike Zoom Pegasus', 'All New Nike Zoom Pegasus 4', 'Metcon 3', 'Nike Metcon 5'], 'Categories': [{'Pegasus'}, {'Pegasus'}, {'Metcon'}, {'Metcon'}]}

0 0

守候你守候我

您的代码存在的问题如下：您传递的是产品，但在检查时使用的是df["Product Name"]，这会返回整个系列。此外，返回值是产品。但根据预期的答案，要么是Pegasus要么Metcon我想你想要这样的东西。def get_category(product):    if "Pegasus" in product:        return "Pegasus"     elif "Metcon" in product:        return "Metcon"

0 0