打印使用 gensim 学习的二元组

我想使用 gensim 从语料库中学习 bigrams,然后打印学习到的 bigrams。我还没有看到这样做的例子。帮助赞赏


from gensim.models import Phrases

documents = ["the mayor of new york was there", "human computer interaction and machine learning has now become a trending research area","human computer interaction is interesting","human computer interaction is a pretty interesting subject", "human computer interaction is a great and new subject", "machine learning can be useful sometimes","new york mayor was present", "I love machine learning because it is a new subject area", "human computer interaction helps people to get user friendly applications"]

sentence_stream = [doc.split(" ") for doc in documents]


bigram = Phrases(sentence_stream)


# how can I print all bigrams learned and just the bigrams, including "new_york" and "human computer" ?enter code here


翻阅古今
浏览 167回答 2
2回答

慕桂英546537

import operatorsorted(    {k:v for k,v in bigram.vocab.items() if b'_' in k if v>=bigram.min_count}.items(),    key=operator.itemgetter(1),    reverse=True)
打开App,查看更多内容
随时随地看视频慕课网APP

相关分类

Python