关于python：有没有更快的方法将任意大整数转换为大端字节序列？

Is there a faster way to convert an arbitrary large integer to a big endian sequence of bytes?

我有这个python代码来做这个：

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29

from struct import pack as _pack

def packl(lnum, pad = 1):
if lnum < 0:
raise RangeError("Cannot use packl to convert a negative integer"
"to a string.")
count = 0
l = []
while lnum > 0:
l.append(lnum & 0xffffffffffffffffL)
count += 1
lnum >>= 64
if count <= 0:
return '\0' * pad
elif pad >= 8:
lens = 8 * count % pad
pad = ((lens != 0) and (pad - lens)) or 0
l.append('>' + 'x' * pad + 'Q' * count)
l.reverse()
return _pack(*l)
else:
l.append('>' + 'Q' * count)
l.reverse()
s = _pack(*l).lstrip('\0')
lens = len(s)
if (lens % pad) != 0:
return '\0' * (pad - lens % pad) + s
else:
return s

在我的机器上，将2**9700 - 1转换成一个字节串大约需要174个usc。如果我愿意使用python 2.7和python 3.x特定的bit_length方法，我可以通过预先将l数组分配到一开始的正确大小，并使用l[something] =语法而不是l.append来将其缩短到159个。

我能做些什么来加快速度吗？这将用于转换密码术中使用的大素数以及一些(但不是很多)小的数。

编辑

这是目前python<3.2中速度最快的选项，它需要大约一半的时间来作为公认的答案：

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28

def packl(lnum, padmultiple=1):
"""Packs the lnum (which must be convertable to a long) into a
byte string 0 padded to a multiple of padmultiple bytes in size. 0
means no padding whatsoever, so that packing 0 result in an empty
string. The resulting byte string is the big-endian two's
complement representation of the passed in long."""

if lnum == 0:
return b'\0' * padmultiple
elif lnum < 0:
raise ValueError("Can only convert non-negative numbers.")
s = hex(lnum)[2:]
s = s.rstrip('L')
if len(s) & 1:
s = '0' + s
s = binascii.unhexlify(s)
if (padmultiple != 1) and (padmultiple != 0):
filled_so_far = len(s) % padmultiple
if filled_so_far != 0:
s = b'\0' * (padmultiple - filled_so_far) + s
return s

def unpackl(bytestr):
"""Treats a byte string as a sequence of base 256 digits
representing an unsigned integer in big-endian format and converts
that representation into a Python integer."""

return int(binascii.hexlify(bytestr), 16) if len(bytestr) > 0 else 0

在python 3.2中，int类具有to_bytes和from_bytes函数，可以比上面给出的方法更快地完成这一任务。

相关讨论

这里有一个通过ctypes调用python/c api的解决方案。目前，它使用numpy，但如果numpy不是一个选项，它可以完全使用ctypes。

1
2
3
4
5
6
7
8
9
10
11
12
13

import numpy
import ctypes
PyLong_AsByteArray = ctypes.pythonapi._PyLong_AsByteArray
PyLong_AsByteArray.argtypes = [ctypes.py_object,
numpy.ctypeslib.ndpointer(numpy.uint8),
ctypes.c_size_t,
ctypes.c_int,
ctypes.c_int]

def packl_ctypes_numpy(lnum):
a = numpy.zeros(lnum.bit_length()//8 + 1, dtype=numpy.uint8)
PyLong_AsByteArray(lnum, a, a.size, 0, 1)
return a

在我的机器上，这比你的方法快15倍。

编辑：这里有相同的代码，只使用ctypes，返回字符串而不是numpy数组：

1
2
3
4
5
6
7
8
9
10
11
12

import ctypes
PyLong_AsByteArray = ctypes.pythonapi._PyLong_AsByteArray
PyLong_AsByteArray.argtypes = [ctypes.py_object,
ctypes.c_char_p,
ctypes.c_size_t,
ctypes.c_int,
ctypes.c_int]

def packl_ctypes(lnum):
a = ctypes.create_string_buffer(lnum.bit_length()//8 + 1)
PyLong_AsByteArray(lnum, a, len(a), 0, 1)
return a.raw

这又快了两倍，加起来我的机器的加速系数是30。

相关讨论

不过，这会不会使用系统本机的endianness？
@卡尔：不，不会的。PyLong_AsByteArray()的第四个参数表示要使用哪一个endianness：0表示big endians，其他的都表示little endians。
令人惊叹的。现在我希望这件事能直接暴露出来…：
这个API在不同版本的python上有很大的变化吗？
当然，你也有点作弊。：-)您不支持填充。但这是一个非常小的细节。
@无所不知：至少在python 2.4之后，函数_PyLong_AsByteArray()没有改变，我认为ctypes对于早期版本的python(从2.5开始随python提供)不可用。我用Python2.6、2.7、3.1和3.2成功地测试了它，所以它看起来相当健壮，尽管我甚至不确定它是否是官方接口的一部分。
@但是python 2.6似乎缺少bit_length函数。你是自己拼凑起来的吗？
@无所不知：关于填充：包含这个很简单——_PyLong_AsByteArray()使用给定大小的整个缓冲区。如果缓冲区太小，则返回-1(0表示成功)，表示出错。
@无所不知：不，对于2.6，我使用了i = 2**9700-1和硬编码的大小1213作为测试目的：)你也需要它用于2.6吗？
@Sven Marnach-不是真的。我在写一些在野外几个月都看不到的东西，所以我很乐意放弃2.6及更早的版本。
@万能的：好的——否则就可以访问long的内部字段来提取位长度。最后一句话：上面的代码会阻塞常规的int对象，它只适用于long对象。对于python 3.x，这种区别已经不存在了。
@SvenMarnach——事实上，如果不是long，它就会崩溃。：-)我把它修好了，这样就没问题了。此代码将进入GPLv3+工作。我想你不在乎。你希望归属吗？我想如果我想要2.6的兼容性，我会回到原来的方法。
@无所不能：很高兴听到它将是免费软件。当然，没有归属。
int(binascii.hexlify(stringbytes), 16)比ctypes.pythonapi._PyLong_FromByteArray快。谁会打雷？
@无所不知：我确实感到惊讶，尤其是在看了后者的源代码之后——看起来像是一个非常直接的C实现。
@Sven Marnach-请随意使用timeit进行测试。我也很惊讶。我猜这是ctypes的开销。
@无所不知：我相信你的话，而ctypes开销似乎是一个很好的解释。

为了完整性和将来的读者：

从python 3.2开始，有函数int.from_bytes()和int.to_bytes()，通过选择字节顺序来执行bytes和int对象之间的转换。

相关讨论

只是想发布一个对Sven答案的跟进(这很有效)。相反的操作-从任意长字节对象到python integer对象需要以下内容(因为我找不到pylong-frombytearray()c api函数)：

1
2
3
4
5
6
7

import binascii

def unpack_bytes(stringbytes):
#binascii.hexlify will be obsolete in python3 soon
#They will add a .tohex() method to bytes class
#Issue 3532 bugs.python.org
return int(binascii.hexlify(stringbytes), 16)

相关讨论

我想你真的应该只是使用numpy，我确信它有一些内置的功能。使用array模块进行黑客攻击也可能更快。但我还是要试试看。

imx，创建一个生成器并使用列表理解和/或内置求和比附加到列表的循环更快，因为附加可以在内部完成。哦，大绳子上的"lstrip"一定很贵。

此外，还有一些风格要点：特殊情况还不够特殊；而且您似乎没有收到有关新x if y else z构造的备忘录。：)尽管我们不需要它。；)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25

from struct import pack as _pack

Q_size = 64
Q_bitmask = (1L << Q_size) - 1L

def quads_gen(a_long):
while a_long:
yield a_long & Q_bitmask
a_long >>= Q_size

def pack_long_big_endian(a_long, pad = 1):
if lnum < 0:
raise RangeError("Cannot use packl to convert a negative integer"
"to a string.")
qs = list(reversed(quads_gen(a_long)))
# Pack the first one separately so we can lstrip nicely.
first = _pack('>Q', qs[0]).lstrip('\x00')
rest = _pack('>%sQ' % len(qs) - 1, *qs[1:])
count = len(first) + len(rest)
# A little math trick that depends on Python's behaviour of modulus
# for negative numbers - but it's well-defined and documented
return '\x00' * (-count % pad) + first + rest

相关讨论