collections 模块：更多数据结构¶

collections 模块：更多数据结构¶

In [1]:

import collections

计数器¶

可以使用 Counter(seq) 对序列中出现的元素个数进行统计。

例如，我们可以统计一段文本中出现的单词及其出现的次数：

In [2]:

from string import punctuation
 
sentence = "One, two, three, one, two, tree, I come from China."
 
words_count = collections.Counter(sentence.translate(None, punctuation).lower().split())
 
print words_count

Counter({'two': 2, 'one': 2, 'from': 1, 'i': 1, 'tree': 1, 'three': 1, 'china': 1, 'come': 1})

双端队列¶

双端队列支持从队头队尾出入队：

In [3]:

dq = collections.deque()
 
for i in xrange(10):
    dq.append(i)
 
print dq
 
for i in xrange(10):
    print dq.pop(), 
 
print 
 
for i in xrange(10):
    dq.appendleft(i)
 
print dq
 
for i in xrange(10):
    print dq.popleft(),

deque([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
9 8 7 6 5 4 3 2 1 0
deque([9, 8, 7, 6, 5, 4, 3, 2, 1, 0])
9 8 7 6 5 4 3 2 1 0

与列表相比，双端队列在队头的操作更快：

In [4]:

lst = []
dq = collections.deque()
 
%timeit -n100 lst.insert(0, 10)
%timeit -n100 dq.appendleft(10)

100 loops, best of 3: 598 ns per loop
100 loops, best of 3: 291 ns per loop

有序字典¶

字典的 key 按顺序排列：

In [5]:

items = (
    ('A', 1),
    ('B', 2),
    ('C', 3)
)
 
regular_dict = dict(items)
ordered_dict = collections.OrderedDict(items)
 
print 'Regular Dict:'
for k, v in regular_dict.items():
    print k, v
 
print 'Ordered Dict:'
for k, v in ordered_dict.items():
    print k, v

Regular Dict:
A 1
C 3
B 2
Ordered Dict:
A 1
B 2
C 3

带默认值的字典¶

对于 Python 自带的词典 d，当 key 不存在的时候，调用 d[key] 会报错，但是 defaultdict 可以为这样的 key 提供一个指定的默认值，我们只需要在定义时提供默认值的类型即可，如果 key 不存在返回指定类型的默认值：

In [6]:

dd = collections.defaultdict(list)
 
print dd["foo"]
 
dd = collections.defaultdict(int)
 
print dd["foo"]
 
dd = collections.defaultdict(float)
 
print dd["foo"]

[]
0
0.0

原文: https://nbviewer.jupyter.org/github/lijin-THU/notes-python/blob/master/11-useful-tools/11.09-collections.ipynb

11.09 collections 模块：更多数据结构

collections 模块：更多数据结构¶

计数器¶

双端队列¶

有序字典¶

带默认值的字典¶