如何在 pandas 中从列表中提取数据作为字符串，并按值选择数据？

根据问题末尾的解释，似乎两列都是str类型，并且需要转换为list类型.applymap与一起使用ast.literal_eval。如果只有一列是str类型，则使用df[col] = df[col].apply(literal_eval)每列中的数据列表必须使用以下方法提取pandas.DataFrame.explode外部explode将值从列表转换为标量（即[0.4]转换为0.4）。一旦值位于不同的行上，就可以使用布尔索引来选择所需范围内的数据。如果您想df与结合使用df_new，请使用df.join(df_new, rsuffix='_extracted')测试于python 3.10,pandas 1.4.3import pandas as pdfrom ast import literal_eval# setup the test data: this data is lists# data = {'c1': [['abc', 'bcd', 'dog'], ['cat', 'bcd', 'def']], 'c2': [[[.4], [.5], [.9]], [[.9], [.5], [.4]]]}# setup the test data: this data is stringsdata = {'c1': ["['abc', 'bcd', 'dog', 'cat']", "['cat', 'bcd', 'def']"], 'c2': ["[[.4], [.5], [.9], [1.0]]", "[[.9], [.5], [.4]]"]}# create the dataframedf = pd.DataFrame(data)# the description leads me to think the data is columns of strings, not lists# convert the columns from string type to list type# the following line is only required if the columns are stringsdf = df.applymap(literal_eval)# explode the lists in each column, and the explode the remaining lists in 'c2'df_new = df.explode(['c1', 'c2'], ignore_index=True).explode('c2')# use Boolean Indexing to select the desired datadf_new = df_new[df_new['c2'] >= 0.9]# display(df_new) c1 c22 dog 0.93 cat 1.04 cat 0.9

如何在 pandas 中从列表中提取数据作为字符串，并按值选择数据？

2回答