如何获得每组 X 次以上相同单词的平均值?
但在这里,我想连续获得每组(group = name
)相同单词超过 4 次的平均值。
例子:
id | name | sentences
---------------------
1 | aa | david hi david david david
2 | aa | david david is at home
3 | bb | I'm king
4 | cc | where r u going
5 | dd | lol lol lol lol lol lol
6 | ee | abc abc cc abc abc abc abc cc
7 | ee | dd dd dd ee dd dd dd
我想得到以下结果:
name | avg
----------
aa | 0.0 (0 sentence contain the words 'david' continuously 4 times in ). total instances of 'aa' group is 2
bb | 0.0 (0 sentence contains same word continuously 4 times)
cc | 0.0 (0 sentence contains same word continuously 4 times)
dd | 1.0 (1 sentence contains same word 'lol' continuously 4 times). total instances of 'dd' group is 1
ee | 0.5 (1 sentence contains same word 'abc' continuously 4 times). total instances of 'dd' group is 2
I'm using python 3.6.8
汪汪一只猫
相关分类