我正在尝试从数据框中提取名称。
df.['target_name'].head()
3 Minnie
4 Albert [unclear]Gles[/unclear]
5 Eliza [unclear]Gles[/unclear]
6 John Slaltery
7 [unclear]P.[/unclear] Slaltery
23 ? Stewart
34 John Maddison
35 Herbert Olney
36 William Iverach
37 [unclear][/unclear]
38 Peter Blacksmith
39 William Oliver
40 Emily
Name: target_name, dtype: object
这是输出。我们只想去掉不必要的字符并获取名称。这就是我所做的:
import re
df['target_name'] = df['target_name'].astype(str) #converting it into a string.
我尝试使用这两种方法,但两者都给了我相同的输出,即 Nan
df['target_name'] = df['target_name'].str.extract('([a-zA-Z ]+)', expand=False).str.strip()
df['target_name3'] = df['target_name'].str.replace(r'\([^)]*\)', '').str.strip()
杨魅力
相关分类