关于counter：Python：获取字符串中形容词的计数

Python :getting the count for the adjectives in a string

我有根绳子S="X先生太棒了。他很了不起，Y先生也很了不起。"

我需要从字符串中提取所有形容词以及每个形容词的计数。例如这个字符串有形容词"棒极了"，"棒极了"，其中2个表示棒极了，1个表示棒极了。

为了提取形容词，我使用了nltk。这是提取形容词的代码，

1	adjectives =[token for token, pos in nltk.pos_tag(nltk.word_tokenize(b)) if pos.startswith('JJ')]

我需要代码为字符串中的每个形容词获取一个计数器。应该是这样的形容词：计数器

相关讨论

您可以使用collections.Counter：

1
2
3
4
5
6

>>> from collections import Counter

>>> adjectives = ['awesome', 'amazing', 'awesome']
>>> counts = Counter(adjectives)
>>> counts.items()
[('awesome', 2), ('amazing', 1)]

如果您愿意，可以将其转换为字典：

1 2	>>> dict(counts.items()) {'amazing': 1, 'awesome': 2}

号

或者您可以访问键和值：

1
2
3
4

>>> for key in counts.keys():
... print key, counts.get(key)
awesome 2
amazing 1

编辑：

对于列表列表，需要展开列表：

1
2
3
4
5
6

>>> adjectives = [['awesome', 'amazing'], ['good', 'nice' ]]
>>> counts = Counter(adjective
... for group in adjectives
... for adjective in group)
>>> counts
Counter({'awesome': 1, 'good': 1, 'amazing': 1, 'nice': 1})

。

或使用itertools.chain.from_iterable：

1
2
3

>>> from itertools import chain
>>> Counter(chain.from_iterable(adjectives))
Counter({'awesome': 1, 'good': 1, 'amazing': 1, 'nice': 1})