Re: Factor

Factor: the language, the theory, and the practice.

Really Big Numbers

Thursday, September 22, 2011

Factor supports both `fixnum` (fixed size integers, typically 32- or 64-bit values) and `bignum` (arbitrarily large integers). Recently, I discovered that Factor did not have support for calculating the logarithm of really big numbers (those larger than 21024).

You can define a simple factorial function:

``````: factorial ( n -- n! )
[ 1 ] [ [1..b] product ] if-zero ;
``````

But if you tried to calculate the logarithm of `1000 factorial`, it produces the wrong answer.

``````IN: scratchpad 1000 factorial log .
1/0.
``````

The reason for this is that Factor attempts to convert a `bignum` into a double-precision floating point number and take the logarithm of that. Unfortunately, the value in this case is too large. What do other languages do in this case?

We could look at Ruby, but it has the same problem as Factor:

``````>> Math::log((1..1000).inject(:*))
(irb):8: warning: Bignum out of Float range
=> Infinity
``````

However, you can get the right answer in Python:

``````>>> def factorial(n):
...    r = 1
...    while n > 0:
...        r *= n
...        n -= 1
...    return r
...
>>> math.log(factorial(1000))
5912.128178488163
``````

If you look under the covers, you will see that Python handles this case by calling frexp to split a value into a fraction (`x`) and a power of two (`exp`). The original value can be calculated as `x*2exp`. Using this, the logarithm can be computed as `log(x) + log(2) * exp`.

After discussing this on #concatenative, Joe Groff and I came up with a solution for this. I’m not going to go over all the details, but if you’re curious, you can look at the discussion.

First, we implemented a cross-platform version of `frexp`:

``````GENERIC: frexp ( x -- y exp )

M: float frexp
dup fp-special? [ dup zero? ] unless* [ 0 ] [
double>bits
[
0x800f,ffff,ffff,ffff bitand
0.5 double>bits bitor bits>double
] [ -52 shift 0x7ff bitand 1022 - ] bi
] if ; inline

M: integer frexp
[ 0.0 0 ] [
dup 0 > [ 1 ] [ abs -1 ] if swap dup log2 [
52 swap - shift 0x000f,ffff,ffff,ffff bitand
0.5 double>bits bitor bits>double
] [ 1 + ] bi [ * ] dip
] if-zero ; inline
``````

Next, we added support for `log` and `log10` of `bignum`. If the number can be represented as a float, we continue to process it as before, but if it is larger, we calculate it similar to Python (with some caching of the `log(2)` and `log10(2)` values for performance):

``````: most-negative-finite-float ( -- x )
-0x1.ffff,ffff,ffff,fp1023 >integer ; inline
: most-positive-finite-float ( -- x )
0x1.ffff,ffff,ffff,fp1023 >integer ; inline

CONSTANT: log-2   0x1.62e42fefa39efp-1
CONSTANT: log10-2 0x1.34413509f79ffp-2

: (representable-as-float?) ( x -- ? )
most-negative-finite-float
most-positive-finite-float between? ; inline

: (bignum-log) ( n log-quot: ( x -- y ) log-2 -- log )
[ dup ] dip '[
dup (representable-as-float?)
[ >float @ ] [ frexp [ @ ] [ _ * ] bi* + ] if
] call ; inline

M: bignum log [ log ] log-2 (bignum-log) ;

M: bignum log10 [ log10 ] log10-2 (bignum-log) ;
``````

And now, in the listener you can get the answer!

``````IN: scratchpad 1000 factorial log .
5912.128178488163
``````

This change is now in the Factor repository (if you’d like to update), and will be in the next release.