代码之家 › 专栏 › 技术社区 › R71

在python中实现高效的固定大小FIFO

numpy python-3.x python

R71 · 技术社区 · 6 年前

我需要在python或numpy中有效地实现固定大小的FIFO。我可能有不同的这样的FIFO,一些用于整数,一些用于字符串等。在这个FIFO中,我需要通过其索引访问每个元素。

对效率的关注是因为这些FIFOS将被用于一个计划的核心,该计划预计将连续运行几天,大量的数据预计将通过它们。因此,该算法不仅需要节省时间,而且还需要节省内存。

现在,在其他语言如C或Java中,我将有效地使用循环缓冲区和字符串指针(字符串FIFOS)来实现这一点。这是python/numpy中的有效方法,还是有更好的解决方案?

具体来说,以下哪种解决方案最有效:

(1)设置了maxlen值的dequeue:(垃圾收集对dequeue的效率有什么影响?)

import collections
l = collections.deque(maxlen=3)
l.append('apple'); l.append('banana'); l.append('carrot'); l.append('kiwi')
print(l, len(l), l[0], l[2])
> deque(['banana', 'carrot', 'kiwi'], maxlen=3) 3 banana kiwi

(2)列出子类解决方案(取自 Python, forcing a list to a fixed size ):

class L(list):
    def append(self, item):
        list.append(self, item)
        if len(self) > 3: self[:1]=[]
l2.append('apple'); l2.append('banana'); l2.append('carrot'); l2.append('kiwi')
print(l2, len(l2), l2[2], l2[0])
> ['banana', 'carrot', 'kiwi'] 3 kiwi banana

(3)一个普通的numpy数组。但这限制了字符串的大小,所以如何为其指定最大字符串大小?

a = np.array(['apples', 'foobar', 'cowboy'])
a[2] = 'bananadgege'
print(a)
> ['apples' 'foobar' 'banana']
# now add logic for manipulating circular buffer indices

(4)上述目标版本,但 python numpy array of arbitrary length strings 说使用对象会撤消numpy的好处

a = np.array(['apples', 'foobar', 'cowboy'], dtype=object)
a[2] = 'bananadgege'
print(a)
> ['apples' 'foobar' 'bananadgege']
# now add logic for manipulating circular buffer indices

(5)或者是否有比上述更有效的解决方案?

顺便说一句,如果有什么帮助的话,我的字符串在长度上有一个最大上限。

1 回复 | 直到 6 年前

John Zwinck 6 年前

dtype

np.zeros(128, (str, 32)) # 128 strings of up to 32 characters

推荐文章

serlingpa · 如何准备我的数据以避免无法推断频率

1 年前

Guillaume · 使用操作从Python列表创建numpy数组

2 年前

user19657580 · 在Python中打印两个numpy数组的列表

2 年前

user19657580 · Python中数组中具有相同元素的索引求和

2 年前

mikanim · 改进二维余弦函数的numpy功能

2 年前

Klimt865 · 在Python中将数组列表转换为列表列表

2 年前

theduker · 计算平均绝对误差时,If语句中赋值前引用的局部变量

2 年前

Lynn · 如果列包含Python中的特定字符串,则从列中删除值

2 年前

JasonX · 运行减法计算

2 年前

Jan Hrubec · 选择numpy数组的前n个元素

2 年前