number theory Number theory (or arithmetic or higher arithmetic in older usage) is a branch of pure mathematics devoted primarily to the study of the integers and integer-valued functions. German mathematician Carl Friedrich Gauss (1777–1855) said, "Math ...

, Dixon's factorization method (also Dixon's random squares method or Dixon's algorithm) is a general-purpose integer factorization

algorithm In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ...

; it is the prototypical

factor base In computational number theory, a factor base is a small set of prime numbers commonly used as a mathematical tool in algorithms involving extensive sieving for potential factors of a given integer. Usage in factoring algorithms A factor base is a ...

method. Unlike for other factor base methods, its run-time bound comes with a rigorous proof that does not rely on conjectures about the smoothness properties of the values taken by polynomial. The algorithm was designed by

John D. Dixon John is a common English name and surname: * John (given name) * John (surname) John may also refer to: New Testament Works * Gospel of John, a title often shortened to John * First Epistle of John, often shortened to 1 John * Seco ...

, a mathematician at

Carleton University Carleton University is an English-language public research university in Ottawa, Ontario, Canada. Founded in 1942 as Carleton College, the institution originally operated as a private, non-denominational evening college to serve returning Worl ...

, and was published in 1981.

Basic idea

Dixon's method is based on finding a

congruence of squares In number theory, a congruence of squares is a congruence commonly used in integer factorization algorithms. Derivation Given a positive integer ''n'', Fermat's factorization method relies on finding numbers ''x'' and ''y'' satisfying the equal ...

modulo the integer N which is intended to factor.

Fermat's factorization method Fermat's factorization method, named after Pierre de Fermat, is based on the representation of an odd integer as the difference of two squares: :N = a^2 - b^2. That difference is algebraically factorable as (a+b)(a-b); if neither factor equals one ...

finds such a congruence by selecting random or

pseudo-random A pseudorandom sequence of numbers is one that appears to be statistically random, despite having been produced by a completely deterministic and repeatable process. Background The generation of random numbers has many uses, such as for random ...

''x'' values and hoping that the integer ''x''² mod N is a perfect square (in the integers): :

x^2\equiv y^2\quad(\hboxN),\qquad x\not\equiv\pm y\quad(\hboxN).

For example, if , (by starting at 292, the first number greater than and counting up) the is 256, the square of 16. So . Computing the

greatest common divisor In mathematics, the greatest common divisor (GCD) of two or more integers, which are not all zero, is the largest positive integer that divides each of the integers. For two integers ''x'', ''y'', the greatest common divisor of ''x'' and ''y'' i ...

of and ''N'' using

Euclid's algorithm In mathematics, the Euclidean algorithm,Some widely used textbooks, such as I. N. Herstein's ''Topics in Algebra'' and Serge Lang's ''Algebra'', use the term "Euclidean algorithm" to refer to Euclidean division or Euclid's algorithm, is an ...

gives 163, which is a factor of ''N''. In practice, selecting random ''x'' values will take an impractically long time to find a congruence of squares, since there are only squares less than ''N''. Dixon's method replaces the condition "is the square of an integer" with the much weaker one "has only small prime factors"; for example, there are 292 squares smaller than 84923; 662 numbers smaller than 84923 whose prime factors are only 2,3,5 or 7; and 4767 whose prime factors are all less than 30. (Such numbers are called '' B-smooth'' with respect to some bound ''B''.) If there are many numbers

a_1 \ldots a_n

whose squares can be factorized as

a_i^2 \mod N = \prod_^m b_j^

for a fixed set

b_1 \ldots b_m

of small primes, linear algebra modulo 2 on the matrix

e_

will give a subset of the

a_i

whose squares combine to a product of small primes to an even power — that is, a subset of the

a_i

whose squares multiply to the square of a (hopefully different) number mod N.

Method

Suppose the composite number ''N'' is being factored. Bound ''B'' is chosen, and the ''

'' is identified (which is called ''P''), the set of all primes less than or equal to ''B''. Next, positive integers ''z'' are sought such that ''z''² mod ''N'' is ''B''-smooth. Therefore it can be written, for suitable exponents ''a_i'', :

z^2 \text N = \prod_ p_i^

When enough of these relations have been generated (it is generally sufficient that the number of relations be a few more than the size of ''P''), the methods of

linear algebra Linear algebra is the branch of mathematics concerning linear equations such as: :a_1x_1+\cdots +a_nx_n=b, linear maps such as: :(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n, and their representations in vector spaces and through matric ...

can be used (for example,

Gaussian elimination In mathematics, Gaussian elimination, also known as row reduction, is an algorithm for solving systems of linear equations. It consists of a sequence of operations performed on the corresponding matrix of coefficients. This method can also be used ...

) to multiply together these various relations in such a way that the exponents of the primes on the right-hand side are all even: :

This yields a

of the form which can be turned into a factorization of ''N'', This factorization might turn out to be trivial (i.e. ), which can only happen if in which case another try has to be made with a different combination of relations; but a nontrivial pair of factors of ''N'' can be reached, and the algorithm will terminate.

Pseudocode

input: positive integer

N

output: non-trivial factor of

N

Choose bound

B

Let

P := \

be all primes

\leq B

repeat for

i=1

k+1

do Choose

0 such that z_i^2 \text N is B -smooth
         Let a_i := \ such that z_i^2 \text N = \prod_ p_j^end for
 
     Find non-empty T \subseteq \ such that \sum_ a_i \equiv \vec \pmod Let x := \left(\prod_ z_i\right) \text N y := \left(\prod_ p_j^ \right) \text N while x \equiv \pm y \pmod return \gcd(x+y, N)

Example

This example will try to factor ''N'' = 84923 using bound ''B'' = 7. The

is then ''P'' = . A search can be made for integers between

\left\lceil\sqrt \right\rceil = 292

and ''N'' whose squares mod ''N'' are ''B''-smooth. Suppose that two of the numbers found are 513 and 537: :

513^2 \mod 84923 = 8400 = 2^4 \cdot 3 \cdot 5^2 \cdot 7

537^2 \mod 84923 = 33600 = 2^6 \cdot 3 \cdot 5^2 \cdot 7

So :

(513 \cdot 537)^2 \mod 84923 = 2^ \cdot 3^2 \cdot 5^4 \cdot 7^2 \mod 84923

Then

\begin
&  (513 \cdot 537)^2 \mod 84923 \\
& = (275481)^2 \mod 84923 \\
& = (84923 \cdot 3 + 20712)^2 \mod 84923 \\
& =(84923 \cdot 3)^2 + 2\cdot(84923\cdot 3 \cdot 20712) + 20712^2 \mod 84923 \\
&  = 0 + 0 + 20712^2 \mod 84923
\end

That is,

20712^2 \mod 84923 = (2^5 \cdot 3 \cdot 5^2 \cdot 7)^2 \mod 84923 = 16800^2 \mod 84923.

The resulting factorization is 84923 = gcd(20712 − 16800, 84923) × gcd(20712 + 16800, 84923) = 163 × 521.

Optimizations

The

quadratic sieve The quadratic sieve algorithm (QS) is an integer factorization algorithm and, in practice, the second fastest method known (after the general number field sieve). It is still the fastest for integers under 100 decimal digits or so, and is considerab ...

is an optimization of Dixon's method. It selects values of ''x'' close to the square root of such that ''x²'' modulo ''N'' is small, thereby largely increasing the chance of obtaining a smooth number. Other ways to optimize Dixon's method include using a better algorithm to solve the matrix equation, taking advantage of the sparsity of the matrix: a number ''z'' cannot have more than

\log_2 z

factors, so each row of the matrix is almost all zeros. In practice, the

block Lanczos algorithm In computer science, the block Lanczos algorithm is an algorithm for finding the nullspace of a matrix over a finite field, using only multiplication of the matrix by long, thin matrices. Such matrices are considered as vectors of tuples of finite-f ...

is often used. Also, the size of the factor base must be chosen carefully: if it is too small, it will be difficult to find numbers that factorize completely over it, and if it is too large, more relations will have to be collected. A more sophisticated analysis, using the approximation that a number has all its prime factors less than

N^

with probability about

a^

(an approximation to the

Dickman–de Bruijn function In analytic number theory, the Dickman function or Dickman–de Bruijn function ''ρ'' is a special function used to estimate the proportion of smooth numbers up to a given bound. It was first studied by actuary Karl Dickman, who defined it in hi ...

), indicates that choosing too small a factor base is much worse than too large, and that the ideal factor base size is some power of

\exp\left(\sqrt\right)

. The optimal complexity of Dixon's method is :

O\left(\exp\left(2 \sqrt 2 \sqrt\right)\right)

big-O notation Big ''O'' notation is a mathematical notation that describes the limiting behavior of a function when the argument tends towards a particular value or infinity. Big O is a member of a family of notations invented by Paul Bachmann, Edmund Land ...

, or :

L_n /2, 2 \sqrt 2 /math>
in

L-notation ''L''-notation is an asymptotic notation analogous to big-O notation, denoted as L_n alpha,c/math> for a bound variable n tending to infinity. Like big-O notation, it is usually used to roughly convey the rate of growth of a function, such as the ...

References

{{Number theoretic algorithms Integer factorization algorithms Squares in number theory