如何在Python中仅提取url的特定部分并将其值添加为df中每一行的另一列？

首页课程实战体系课手记专栏慕课教程

我有一个包含用户和 url 的 df 看起来像这样。

User Url

1 http://www.mycompany.com/Overview/Get

2 http://www.mycompany.com/News

3 http://www.mycompany.com/Accountinfo

4 http://www.mycompany.com/Personalinformation/Index

...

我想添加另一个仅包含 url 的第二部分的列页面，所以我会像这样使用它。

user url page

1 http://www.mycompany.com/Overview/Get Overview

2 http://www.mycompany.com/News News

3 http://www.mycompany.com/Accountinfo Accountinfo

4 http://www.mycompany.com/Personalinformation/Index Personalinformation

...

我下面的代码不起作用。

slashparts = df['url'].split('/')

df['page'] = slashparts[4]

我得到的错误

AttributeError Traceback (most recent call last)

<ipython-input-23-0350a98a788c> in <module>()

----> 1 slashparts = df['request_url'].split('/')

2 df['page'] = slashparts[1]

~\Anaconda\lib\site-packages\pandas\core\generic.py in __getattr__(self, name)

4370 if

self._info_axis._can_hold_identifiers_and_holds_name(name):

4371 return self[name]

-> 4372 return object.__getattribute__(self, name)

4373

4374 def __setattr__(self, name, value):

AttributeError: 'Series' object has no attribute 'split'

有只小跳蛙

浏览 243回答 2

随时随地看视频慕课网APP