Optimize wcwidth() #35

avylove · 2020-03-22T22:38:03Z

Some minor optimizations for wcwidth(). Should result in ~30% performance gain.

>>> from timeit import timeit
>>> timeit('wcwidth("a")', setup='from wcwidth import wcwidth', number=10000000)

Before optimizations:
7.065002638002625

After optimizations:
4.69557189499028

The main speedups come from using a set instead of a chain of boolean comparisons and passing the upper bound of the table into the binary search instead of calling length on the table for each run.

coveralls · 2020-03-22T22:39:09Z

Coverage remained the same at ?% when pulling 69dc30c on avylove:optimize_wcwidth into 3b1a268 on jquast:master.

jquast · 2020-03-22T22:48:41Z

thank you!

jquast · 2020-06-01T15:41:42Z

I've had to unroll the ubound optimization in #23

avylove · 2020-06-01T16:19:32Z

Makes sense.
Do you think it would be worth adding lru_cache to _bisearch() and/or wcwidth() itself? My guess is, in the average use case, a small set of characters will be looked up repeatedly. Even in the case where the language itself is all unicode and the characters used exceed the size of the cache, there are going to be more common characters that will benefit from cached lookups.

jquast · 2020-06-01T16:26:15Z

You're probably right, I will do that, maybe a size of 1,000.

I've been mostly "view all characters, top-bottom" in my own tests, but considering real-world use cases, folks will probably stay within their own language or range of emoticons :-)

Optimize wcwidth()

69dc30c

jquast merged commit 4dac3f0 into jquast:master Mar 23, 2020

dependabot bot mentioned this pull request Mar 16, 2021

Bump wcwidth from 0.1.7 to 0.2.5 internetarchive/openlibrary#4816

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize wcwidth() #35

Optimize wcwidth() #35

avylove commented Mar 22, 2020

coveralls commented Mar 22, 2020

jquast commented Mar 22, 2020

jquast commented Jun 1, 2020

avylove commented Jun 1, 2020

jquast commented Jun 1, 2020

Optimize wcwidth() #35

Optimize wcwidth() #35

Conversation

avylove commented Mar 22, 2020

coveralls commented Mar 22, 2020

jquast commented Mar 22, 2020

jquast commented Jun 1, 2020

avylove commented Jun 1, 2020

jquast commented Jun 1, 2020