HOME

TheInfoList



OR:

The Karatsuba algorithm is a fast multiplication algorithm for
integers An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...
. It was discovered by
Anatoly Karatsuba Anatoly Alexeyevich Karatsuba (his first name often spelled Anatolii) (; Grozny, Soviet Union, 31 January 1937 – Moscow, Russia, 28 September 2008) was a Russian people, Russian mathematician working in the field of analytic number theory, p-ad ...
in 1960 and published in 1962. Knuth D.E. (1969) '' The Art of Computer Programming. v.2.'' Addison-Wesley Publ.Co., 724 pp. It is a divide-and-conquer algorithm that reduces the multiplication of two ''n''-digit numbers to three multiplications of ''n''/2-digit numbers and, by repeating this reduction, to at most n^\approx n^ single-digit multiplications. It is therefore asymptotically faster than the traditional algorithm, which performs n^2 single-digit products. The Karatsuba algorithm was the first multiplication algorithm asymptotically faster than the quadratic "grade school" algorithm. The Toom–Cook algorithm (1963) is a faster generalization of Karatsuba's method, and the Schönhage–Strassen algorithm (1971) is even faster, for sufficiently large ''n''.


History

The standard procedure for multiplication of two ''n''-digit numbers requires a number of elementary operations proportional to n^2\,\!, or O(n^2)\,\! in big-O notation.
Andrey Kolmogorov Andrey Nikolaevich Kolmogorov ( rus, Андре́й Никола́евич Колмого́ров, p=ɐnˈdrʲej nʲɪkɐˈlajɪvʲɪtɕ kəlmɐˈɡorəf, a=Ru-Andrey Nikolaevich Kolmogorov.ogg, 25 April 1903 – 20 October 1987) was a Soviet ...
conjectured that the traditional algorithm was '' asymptotically optimal,'' meaning that any algorithm for that task would require \Omega(n^2)\,\! elementary operations. In 1960, Kolmogorov organized a seminar on mathematical problems in
cybernetics Cybernetics is the transdisciplinary study of circular causal processes such as feedback and recursion, where the effects of a system's actions (its outputs) return as inputs to that system, influencing subsequent action. It is concerned with ...
at the
Moscow State University Moscow State University (MSU), officially M. V. Lomonosov Moscow State University,. is a public university, public research university in Moscow, Russia. The university includes 15 research institutes, 43 faculties, more than 300 departments, a ...
, where he stated the \Omega(n^2)\,\! conjecture and other problems in the complexity of computation. Within a week, Karatsuba, then a 23-year-old student, found an algorithm that multiplies two ''n''-digit numbers in O(n^) elementary steps, thus disproving the conjecture. Kolmogorov was very excited about the discovery; he communicated it at the next meeting of the seminar, which was then terminated. Kolmogorov gave some lectures on the Karatsuba result at conferences all over the world (see, for example, "Proceedings of the International Congress of Mathematicians 1962", pp. 351–356, and also "6 Lectures delivered at the International Congress of Mathematicians in Stockholm, 1962") and published the method in 1962, in the Proceedings of the USSR Academy of Sciences. The article had been written by Kolmogorov and contained two results on multiplication, Karatsuba's algorithm and a separate result by Yuri Ofman; it listed "A. Karatsuba and Yu. Ofman" as the authors. Karatsuba only became aware of the paper when he received the reprints from the publisher.


Algorithm


Basic step

The basic principle of Karatsuba's algorithm is divide-and-conquer, using a formula that allows one to compute the product of two large numbers x and y using three multiplications of smaller numbers, each with about half as many digits as x or y, plus some additions and digit shifts. This basic step is, in fact, a generalization of a similar complex multiplication algorithm, where the
imaginary unit The imaginary unit or unit imaginary number () is a mathematical constant that is a solution to the quadratic equation Although there is no real number with this property, can be used to extend the real numbers to what are called complex num ...
is replaced by a power of the base. Let x and y be represented as n-digit strings in some base B. For any positive integer m less than n, one can write the two given numbers as :x = x_1 B^m + x_0, :y = y_1 B^m + y_0, where x_0 and y_0 are less than B^m. The product is then \begin xy &= (x_1 B^m + x_0)(y_1 B^m + y_0) \\ &= x_1 y_1 B^ + (x_1 y_0 + x_0 y_1) B^m + x_0 y_0 \\ &= z_2 B^ + z_1 B^m + z_0, \\ \end where :z_2 = x_1 y_1, :z_1 = x_1 y_0 + x_0 y_1, :z_0 = x_0 y_0. These formulae require four multiplications and were known to
Charles Babbage Charles Babbage (; 26 December 1791 – 18 October 1871) was an English polymath. A mathematician, philosopher, inventor and mechanical engineer, Babbage originated the concept of a digital programmable computer. Babbage is considered ...
. Karatsuba observed that xy can be computed in only three multiplications, at the cost of a few extra additions. With z_0 and z_2 as before and z_3=(x_1 + x_0) (y_1 + y_0), one can observe that : \begin z_1 &= x_1 y_0 + x_0 y_1 \\ &= (x_1 + x_0) (y_0 + y_1) - x_1 y_1 - x_0 y_0 \\ &= z_3 - z_2 - z_0. \\ \end Thus only three multiplications are required for computing z_0, z_1 and z_2.


Example

To compute the product of 12345 and 6789, where ''B'' = 10, choose ''m'' = 3. We use ''m'' right shifts for decomposing the input operands using the resulting base (''B''''m'' = ''1000''), as: : 12345 = 12 · ''1000'' + 345 : 6789 = 6 · ''1000'' + 789 Only three multiplications, which operate on smaller integers, are used to compute three partial results: : ''z''2 = 12 × 6 = 72 : ''z''0 = 345 × 789 = 272205 : ''z''1 = (12 + 345) × (6 + 789) − ''z''2 − ''z''0 = 357 × 795 − 72 − 272205 = 283815 − 72 − 272205 = 11538 We get the result by just adding these three partial results, shifted accordingly (and then taking carries into account by decomposing these three inputs in base ''1000'' as for the input operands): : result = ''z''2 · (''B''''m'')''2'' + ''z''1 · (''B''''m'')''1'' + ''z''0 · (''B''''m'')''0'', i.e. : result = 72 · ''1000''2 + 11538 · ''1000'' + 272205 = 83810205. Note that the intermediate third multiplication operates on an input domain which is less than two times larger than for the two first multiplications, its output domain is less than four times larger, and base-''1000'' carries computed from the first two multiplications must be taken into account when computing these two subtractions.


Recursive application

If ''n'' is four or more, the three multiplications in Karatsuba's basic step involve operands with fewer than ''n'' digits. Therefore, those products can be computed by recursive calls of the Karatsuba algorithm. The recursion can be applied until the numbers are so small that they can (or must) be computed directly. In a computer with a full 32-bit by 32-bit multiplier, for example, one could choose ''B'' = 231 and store each digit as a separate 32-bit binary word. Then the sums ''x''1 + ''x''0 and ''y''1 + ''y''0 will not need an extra binary word for storing the carry-over digit (as in carry-save adder), and the Karatsuba recursion can be applied until the numbers to multiply are only one digit long.


Time complexity In theoretical computer science, the time complexity is the computational complexity that describes the amount of computer time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations ...
analysis

Karatsuba's basic step works for any base ''B'' and any ''m'', but the recursive algorithm is most efficient when ''m'' is equal to ''n''/2, rounded up. In particular, if ''n'' is 2''k'', for some integer ''k'', and the recursion stops only when ''n'' is 1, then the number of single-digit multiplications is 3''k'', which is ''n''''c'' where ''c'' = log23. Since one can extend any inputs with zero digits until their length is a power of two, it follows that the number of elementary multiplications, for any ''n'', is at most 3^ \leq 3 n^\,\!. Since the additions, subtractions, and digit shifts (multiplications by powers of ''B'') in Karatsuba's basic step take time proportional to ''n'', their cost becomes negligible as ''n'' increases. More precisely, if ''T''(''n'') denotes the total number of elementary operations that the algorithm performs when multiplying two ''n''-digit numbers, then :T(n) = 3 T(\lceil n/2\rceil) + cn + d for some constants ''c'' and ''d''. For this
recurrence relation In mathematics, a recurrence relation is an equation according to which the nth term of a sequence of numbers is equal to some combination of the previous terms. Often, only k previous terms of the sequence appear in the equation, for a parameter ...
, the master theorem for divide-and-conquer recurrences gives the
asymptotic In analytic geometry, an asymptote () of a curve is a line such that the distance between the curve and the line approaches zero as one or both of the ''x'' or ''y'' coordinates Limit of a function#Limits at infinity, tends to infinity. In pro ...
bound T(n) = \Theta(n^)\,\!. It follows that, for sufficiently large ''n'', Karatsuba's algorithm will perform fewer shifts and single-digit additions than longhand multiplication, even though its basic step uses more additions and shifts than the straightforward formula. For small values of ''n'', however, the extra shift and add operations may make it run slower than the longhand method.


Implementation

Here is the pseudocode for this algorithm, using numbers represented in base ten. For the binary representation of integers, it suffices to replace everywhere 10 by 2. The second argument of the split_at function specifies the number of digits to extract from the ''right'': for example, split_at("12345", 3) will extract the 3 final digits, giving: high="12", low="345". function karatsuba(num1, num2) if (num1 < 10 or num2 < 10) return num1 × num2 /* fall back to traditional multiplication */ /* Calculates the size of the numbers. */ m = max(size_base10(num1), size_base10(num2)) m2 = floor(m / 2) /* m2 = ceil (m / 2) will also work */ /* Split the digit sequences in the middle. */ high1, low1 = split_at(num1, m2) high2, low2 = split_at(num2, m2) /* 3 recursive calls made to numbers approximately half the size. */ z0 = karatsuba(low1, low2) z1 = karatsuba(low1 + high1, low2 + high2) z2 = karatsuba(high1, high2) return (z2 × 10 ^ (m2 × 2)) + ((z1 - z2 - z0) × 10 ^ m2) + z0 An issue that occurs when implementation is that the above computation of (x_1 + x_0) and (y_1 + y_0) for z_1 may result in overflow (will produce a result in the range B^m \leq \text < 2 B^m), which require a multiplier having one extra bit. This can be avoided by noting that :z_1 = (x_0 - x_1)(y_1 - y_0) + z_2 + z_0. This computation of (x_0 - x_1) and (y_1 - y_0) will produce a result in the range of -B^m < \text < B^m. This method may produce negative numbers, which require one extra bit to encode signedness, and would still require one extra bit for the multiplier. However, one way to avoid this is to record the sign and then use the absolute value of (x_0 - x_1) and (y_1 - y_0) to perform an unsigned multiplication, after which the result may be negated when both signs originally differed. Another advantage is that even though (x_0 - x_1)(y_1 - y_0) may be negative, the final computation of z_1 only involves additions.


References


External links


Karatsuba's Algorithm for Polynomial Multiplication
* * Bernstein, D. J.,
Multidigit multiplication for mathematicians
. Covers Karatsuba and many other multiplication algorithms. {{Number-theoretic algorithms Computer arithmetic algorithms Multiplication Articles with example pseudocode Divide-and-conquer algorithms