Exponentiation by squaring: Difference between revisions

Latest revision as of 17:46, 16 October 2025

Template:Short description Script error: No such module "Unsubst". Template:More citations needed

In mathematics and computer programming, exponentiating by squaring is a general method for fast computation of large positive integer powers of a number, or more generally of an element of a semigroup, like a polynomial or a square matrix. Some variants are commonly referred to as square-and-multiply algorithms or binary exponentiation. These can be of quite general use, for example in modular arithmetic or powering of matrices. For semigroups for which additive notation is commonly used, like elliptic curves used in cryptography, this method is also referred to as double-and-add.

Basic method

Recursive version

The method is based on the observation that, for any integer $n > 0$ , one has $x^{n} = {\begin{cases} x (x^{2})^{(n - 1) / 2} & if n is odd, \\ (x^{2})^{n / 2} & if n is even . \end{cases}$

If the exponent Template:Mvar is zero, then the answer is 1. If the exponent is negative then we can reuse the previous formula by rewriting the value using a positive exponent. That is, $x^{n} = {(\frac{1}{x})}^{- n} .$

Together, these may be implemented directly as the following recursive algorithm:

Inputs: a real number x; an integer n
Output: xⁿ

function exp_by_squaring(x, n) is
    if n < 0 then
        return exp_by_squaring(1 / x, −n)
    else if n = 0 then
        return 1
    else if n is even then
        return exp_by_squaring(x × x, n / 2)
    else if n is odd then
        return x × exp_by_squaring(x × x, (n − 1) / 2)
end function

In each recursive call, the least-significant digit of the binary representation of Template:Mvar is removed. It follows that the number of recursive calls is $⌈ \log_{2} n ⌉,$ the number of bits of the binary representation of Template:Mvar. So this algorithm computes this number of squares and a lower number of multiplication, which is equal to the number of 1s in the binary representation of Template:Mvar. This logarithmic number of operations is to be compared with the trivial algorithm which requires Template:Math multiplications.

This algorithm is not tail-recursive. This implies that it requires an amount of auxiliary memory that is roughly proportional to the number of recursive calls, or perhaps higher if the amount of data per iteration is increasing.

The algorithms of the next section use a different approach, and the resulting algorithms needs the same number of operations, but use an auxiliary memory that is roughly the same as the memory required to store the result.

With constant auxiliary memory

The variants described in this section are based on the formula $y x^{n} = {\begin{cases} y x (x^{2})^{(n - 1) / 2} & if n is odd, \\ y (x^{2})^{n / 2} & if n is even . \end{cases}$

If one applies recursively this formula, by starting with Template:Math, one gets eventually an exponent equal to Template:Math, and the desired result is then the left factor.

This may be implemented as a tail-recursive function:

Function exp_by_squaring(x, n)
    return exp_by_squaring2(1, x, n)

Function exp_by_squaring2(y, x, n)
    if n < 0 then return exp_by_squaring2(y, 1 / x, -n);
    else if n = 0 then return y;
    else if n is even then return exp_by_squaring2(y, x * x, n / 2);
    else if n is odd then return exp_by_squaring2(x * y, x * x, (n - 1) / 2).

The iterative version of the algorithm also uses a bounded auxiliary space, and is given by

Function exp_by_squaring_iterative(x, n)
    if n < 0 then
        x := 1 / x;
        n := -n;
    if n = 0 then return 1
    y := 1;
    while n > 1 do
        if n is odd then
            y := x * y;
            n := n - 1;
        x := x * x;
        n := n / 2;
    return x * y

The correctness of the algorithm results from the fact that $y x^{n}$ is invariant during the computation; it is $1 \cdot x^{n} = x^{n}$ at the beginning; and it is $y x^{1} = x y$ at the end.

These algorithms use exactly the same number of operations as the algorithm of the preceding section, but the multiplications are done in a different order.

Computational complexity

A brief analysis shows that such an algorithm uses $⌊ \log_{2} n ⌋$ squarings and at most $⌊ \log_{2} n ⌋$ multiplications, where $⌊ ⌋$ denotes the floor function. More precisely, the number of multiplications is one less than the number of ones present in the binary expansion of n. For n greater than about 4 this is computationally more efficient than naively multiplying the base with itself repeatedly.

Each squaring results in approximately double the number of digits of the previous, and so, if multiplication of two d-digit numbers is implemented in O(d^k) operations for some fixed k, then the complexity of computing xⁿ is given by

\sum_{i = 0}^{O (\log n)} (2^{i} O (\log x))^{k} = O ((n \log x)^{k}) .

2^k-ary method

This algorithm calculates the value of xⁿ after expanding the exponent in base 2^k. It was first proposed by Brauer in 1939. In the algorithm below we make use of the following function f(0) = (k, 0) and f(m) = (s, u), where m = u·2^s with u odd.

Algorithm:

Input: An element x of G, a parameter k > 0, a non-negative integer Template:Math and the precomputed values $x^{3}, x^{5}, . . ., x^{2^{k} - 1}$ .

Output: The element xⁿ in G

y := 1; i := l - 1
while i ≥ 0 do
    (s, u) := f(n_i)
    for j := 1 to k - s do
        y := y²
    y := y * x^u
    for j := 1 to s do
        y := y²
    i := i - 1
return y

For optimal efficiency, k should be the smallest integer satisfying^[1]

\lg n < \frac{k (k + 1) \cdot 2^{2 k}}{2^{k + 1} - k - 2} + 1 .

Sliding-window method

This method is an efficient variant of the 2^k-ary method. For example, to calculate the exponent 398, which has binary expansion (110 001 110)₂, we take a window of length 3 using the 2^k-ary method algorithm and calculate 1, x³, x⁶, x¹², x²⁴, x⁴⁸, x⁴⁹, x⁹⁸, x⁹⁹, x¹⁹⁸, x¹⁹⁹, x³⁹⁸. But, we can also compute 1, x³, x⁶, x¹², x²⁴, x⁴⁸, x⁹⁶, x¹⁹², x¹⁹⁹, x³⁹⁸, which saves one multiplication and amounts to evaluating (110 001 110)₂

Here is the general algorithm:

Algorithm:

Input: An element x of G, a non negative integer Template:Math, a parameter k > 0 and the pre-computed values $x^{3}, x^{5}, . . ., x^{2^{k} - 1}$ .

Output: The element xⁿ ∈ G.

Algorithm:

y := 1; i := l - 1
while i > -1 do
    if n_i = 0 then
        y := y²
        i := i - 1
    else
        s := max{i - k + 1, 0}
        while n_s = 0 do
            s := s + 1^{[notes 1]}
        for h := 1 to i - s + 1 do
            y := y²
        u := (n_i, n_i-1, ..., n_s)₂
        y := y * x^u
        i := s - 1
return y

Montgomery's ladder technique

Many algorithms for exponentiation do not provide defence against side-channel attacks. Namely, an attacker observing the sequence of squarings and multiplications can (partially) recover the exponent involved in the computation. This is a problem if the exponent should remain secret, as with many public-key cryptosystems. A technique called "Montgomery's ladder"^[2] addresses this concern.

Given the binary expansion of a positive, non-zero integer n = (n_k−1...n₀)₂ with n_k−1 = 1, we can compute xⁿ as follows:

x₁ = x; x₂ = x²
for i = k - 2 to 0 do
    if n_i = 0 then
        x₂ = x₁ * x₂; x₁ = x₁²
    else
        x₁ = x₁ * x₂; x₂ = x₂²
return x₁

The algorithm performs a fixed sequence of operations (up to log n): a multiplication and squaring takes place for each bit in the exponent, regardless of the bit's specific value. A similar algorithm for multiplication by doubling exists.

This specific implementation of Montgomery's ladder is not yet protected against cache timing attacks: memory access latencies might still be observable to an attacker, as different variables are accessed depending on the value of bits of the secret exponent. Modern cryptographic implementations use a "scatter" technique to make sure the processor always misses the faster cache.^[3]

Fixed-base exponent

There are several methods which can be employed to calculate xⁿ when the base is fixed and the exponent varies. As one can see, precomputations play a key role in these algorithms.

Yao's method

Yao's method is orthogonal to the Template:Math-ary method where the exponent is expanded in radix Template:Math and the computation is as performed in the algorithm above. Let Template:Mvar, Template:Mvar, Template:Mvar, and Template:Mvar be integers.

Let the exponent Template:Mvar be written as

n = \sum_{i = 0}^{w - 1} n_{i} b_{i},

where $0 ⩽ n_{i} < h$ for all $i \in [0, w - 1]$ .

Let Template:Math.

Then the algorithm uses the equality

x^{n} = \prod_{i = 0}^{w - 1} x_{i}^{n_{i}} = \prod_{j = 1}^{h - 1} [\prod_{n_{i} = j} x_{i}]^{j} .

Given the element Template:Mvar of Template:Mvar, and the exponent Template:Mvar written in the above form, along with the precomputed values Template:Math, the element Template:Mvar is calculated using the algorithm below:

y = 1, u = 1, j = h - 1
while j > 0 do
    for i = 0 to w - 1 do
        if n_i = j then
            u = u × x^b_i
    y = y × u
    j = j - 1
return y

If we set Template:Math and Template:Math, then the Template:Mvar values are simply the digits of Template:Mvar in base Template:Mvar. Yao's method collects in u first those Template:Mvar that appear to the highest power Template:Tmath; in the next round those with power Template:Tmath are collected in Template:Mvar as well etc. The variable y is multiplied Template:Tmath times with the initial Template:Mvar, Template:Tmath times with the next highest powers, and so on. The algorithm uses Template:Tmath multiplications, and Template:Tmath elements must be stored to compute Template:Mvar.^[1]

Euclidean method

The Euclidean method was first introduced in Efficient exponentiation using precomputation and vector addition chains by P.D Rooij.

This method for computing $x^{n}$ in group Template:Math, where Template:Mvar is a natural integer, whose algorithm is given below, is using the following equality recursively:

x_{0}^{n_{0}} \cdot x_{1}^{n_{1}} = {(x_{0} \cdot x_{1}^{q})}^{n_{0}} \cdot x_{1}^{n_{1} mod n_{0}},

where $q = ⌊ \frac{n_{1}}{n_{0}} ⌋$ . In other words, a Euclidean division of the exponent Template:Math by Template:Math is used to return a quotient Template:Mvar and a rest Template:Math.

Given the base element Template:Mvar in group Template:Math, and the exponent $n$ written as in Yao's method, the element $x^{n}$ is calculated using $l$ precomputed values $x^{b_{0}}, . . ., x^{b_{l_{i}}}$ and then the algorithm below.

Begin loop
    Find  $M \in [0, l - 1]$ , such that  $\forall i \in [0, l - 1], n_{M} \geq n_{i}$ .
    Find  $N \in ([0, l - 1] - M)$ , such that  $\forall i \in ([0, l - 1] - M), n_{N} \geq n_{i}$ .
    Break loop if  $n_{N} = 0$ .
    Let  $q = ⌊ n_{M} / n_{N} ⌋$ , and then let  $n_{N} = (n_{M} {mod n}_{N})$ .
    Compute recursively  $x_{M}^{q}$ , and then let  $x_{N} = x_{N} \cdot x_{M}^{q}$ .
End loop;
Return  $x^{n} = x_{M}^{n_{M}}$ .

The algorithm first finds the largest value among the Template:Math and then the supremum within the set of Template:Math. Then it raises Template:Math to the power Template:Mvar, multiplies this value with Template:Math, and then assigns Template:Math the result of this computation and Template:Math the value Template:Math modulo Template:Math.

Further applications

The approach also works with semigroups that are not of characteristic zero, for example allowing fast computation of large exponents modulo a number. Especially in cryptography, it is useful to compute powers in a ring of [[modular arithmetic|integers modulo Template:Mvar]]. For example, the evaluation of

Template:Math

would take a very long time and much storage space if the naïve method of computing Template:Math and then taking the remainder when divided by 2345 were used. Even using a more effective method will take a long time: square 13789, take the remainder when divided by 2345, multiply the result by 13789, and so on.

Applying above exp-by-squaring algorithm, with "*" interpreted as Template:Math (that is, a multiplication followed by a division with remainder) leads to only 27 multiplications and divisions of integers, which may all be stored in a single machine word. Generally, any of these approaches will take fewer than Template:Math modular multiplications.

The approach can also be used to compute integer powers in a group, using either of the rules

Template:Math,

Template:Math.

The approach also works in non-commutative semigroups and is often used to compute powers of matrices.

More generally, the approach works with positive integer exponents in every magma for which the binary operation is power associative.

Signed-digit recoding

In certain computations it may be more efficient to allow negative coefficients and hence use the inverse of the base, provided inversion in Template:Mvar is "fast" or has been precomputed. For example, when computing Template:Math, the binary method requires Template:Math multiplications and Template:Math squarings. However, one could perform Template:Mvar squarings to get Template:Math and then multiply by Template:Math to obtain Template:Math.

To this end we define the signed-digit representation of an integer Template:Mvar in radix Template:Mvar as

n = \sum_{i = 0}^{l - 1} n_{i} b^{i} with | n_{i} | < b .

Signed binary representation corresponds to the particular choice Template:Math and $n_{i} \in {- 1, 0, 1}$ . It is denoted by $(n_{l - 1} \dots n_{0})_{s}$ . There are several methods for computing this representation. The representation is not unique. For example, take Template:Math: two distinct signed-binary representations are given by $(10 \bar{1} 1100 \bar{1} 10)_{s}$ and $(100 \bar{1} 1000 \bar{1} 0)_{s}$ , where $\bar{1}$ is used to denote Template:Math. Since the binary method computes a multiplication for every non-zero entry in the base-2 representation of Template:Mvar, we are interested in finding the signed-binary representation with the smallest number of non-zero entries, that is, the one with minimal Hamming weight. One method of doing this is to compute the representation in non-adjacent form, or NAF for short, which is one that satisfies $n_{i} n_{i + 1} = 0 for all i ⩾ 0$ and denoted by $(n_{l - 1} \dots n_{0})_{NAF}$ . For example, the NAF representation of 478 is $(1000 \bar{1} 000 \bar{1} 0)_{NAF}$ . This representation always has minimal Hamming weight. A simple algorithm to compute the NAF representation of a given integer $n = (n_{l} n_{l - 1} \dots n_{0})_{2}$ with $n_{l} = n_{l - 1} = 0$ is the following:

 $c_{0} = 0$ 
for Template:Math to Template:Math do
   $c_{i + 1} = ⌊ \frac{1}{2} (c_{i} + n_{i} + n_{i + 1}) ⌋$ 
   ${n_{i}}^{'} = c_{i} + n_{i} - 2 c_{i + 1}$ 
return  $(n_{l^{'} - 1} \dots {n_{0}}^{'})_{NAF}$

Another algorithm by Koyama and Tsuruoka does not require the condition that $n_{i} = n_{i + 1} = 0$ ; it still minimizes the Hamming weight.

Alternatives and generalizations

Script error: No such module "Labelled list hatnote". Exponentiation by squaring can be viewed as a suboptimal addition-chain exponentiation algorithm: it computes the exponent by an addition chain consisting of repeated exponent doublings (squarings) and/or incrementing exponents by one (multiplying by x) only. More generally, if one allows any previously computed exponents to be summed (by multiplying those powers of x), one can sometimes perform the exponentiation using fewer multiplications (but typically using more memory). The smallest power where this occurs is for n = 15:

x^{15} = x \times (x \times [x \times x^{2}]^{2})^{2}

(squaring, 6 multiplies),

x^{15} = x^{3} \times ([x^{3}]^{2})^{2}

(optimal addition chain, 5 multiplies if x³ is re-used).

In general, finding the optimal addition chain for a given exponent is a hard problem, for which no efficient algorithms are known, so optimal chains are typically used for small exponents only (e.g. in compilers where the chains for small powers have been pre-tabulated). However, there are a number of heuristic algorithms that, while not being optimal, have fewer multiplications than exponentiation by squaring at the cost of additional bookkeeping work and memory usage. Regardless, the number of multiplications never grows more slowly than Θ(log n), so these algorithms improve asymptotically upon exponentiation by squaring by only a constant factor at best.

Notes

Template:Reflist

References

Template:Reflist

↑ ^a ^b Script error: No such module "citation/CS1".
↑ Script error: No such module "Citation/CS1".
↑ Script error: No such module "Citation/CS1".

Cite error: <ref> tags exist for a group named "notes", but no corresponding <references group="notes"/> tag was found

[frey-1] Script error: No such module "citation/CS1".

[ladder-3] Script error: No such module "Citation/CS1".

[4] Script error: No such module "Citation/CS1".

[1]

[notes 1]

[2]

[3]

@@ Line 8: / Line 8: @@
 ===Recursive version===
-The method is based on the observation that, for any integer <math>n > 0</math>, one has:
+The method is based on the observation that, for any integer <math>n > 0</math>, one has
 <math display="block"> x^n=
-    \begin{cases}
+ \begin{cases}
-                x \, ( x^{2})^{(n - 1)/2}, & \mbox{if } n \mbox{ is odd} \\
+  x \, (x^2)^{(n-1)/2} & \text{if } n \text{ is odd}, \\
-                (x^{2})^{n/2} , & \mbox{if } n \mbox{ is even}
+  (x^2)^{n/2} & \text{if } n \text{ is even}.
-     \end{cases}
+ \end{cases}
 </math>
-If the exponent {{mvar|n}} is zero then the answer is 1. If the exponent is negative then we can reuse the previous formula by rewriting the value using a positive exponent. That is,
+If the exponent {{mvar|n}} is zero, then the answer is 1. If the exponent is negative then we can reuse the previous formula by rewriting the value using a positive exponent. That is,
-<math display="block">x^n = \left(\frac{1}{x}\right)^{-n}\,.</math>
+<math display="block">
+ x^n = \left(\frac{1}{x}\right)^{-n}.
+</math>
 Together, these may be implemented directly as the following [[recursion (computer science)|recursive algorithm]]:
   '''Inputs''': a real number ''x''; an integer ''n''
-  '''Output''': x<sup>n</sup>
+  '''Output''': ''x<sup>n</sup>''
-  '''function''' exp_by_squaring(x, n) '''is'''
+  '''function''' exp_by_squaring(''x'', ''n'') '''is'''
       '''if''' ''n'' < 0 '''then'''
           '''return''' exp_by_squaring(1 / ''x'', −''n'')
@@ Line 35: / Line 37: @@
   '''end function'''
-In each recursive call, the least-significant digit of the [[binary representation]] of {{mvar|n}} is removed. It follows that the number of recursive calls is <math>\lceil \log_2 n\rceil,</math> the number of [[bit]]s of the binary representation of {{mvar|n}}. So this algorithm computes this number of squares and a lower number of multiplication, which is equal to the number of ''1''s in the binary representation of {{mvar|n}}. This logarithmic number of operations is to be compared with the trivial algorithm which requires {{math|''n'' − 1}} multiplications.
+In each recursive call, the least-significant digit of the [[binary representation]] of {{mvar|n}} is removed. It follows that the number of recursive calls is <math>\lceil \log_2 n\rceil,</math> the number of [[bit]]s of the binary representation of {{mvar|n}}. So this algorithm computes this number of squares and a lower number of multiplication, which is equal to the number of 1s in the binary representation of {{mvar|n}}. This logarithmic number of operations is to be compared with the trivial algorithm which requires {{math|''n'' − 1}} multiplications.
-This algorithm is not [[Tail call|tail-recursive]]. This implies that it requires an amount of auxiliary memory that is roughly proportional to the number of recursive calls -- or perhaps higher if the amount of data per iteration is increasing.
+This algorithm is not [[Tail call|tail-recursive]]. This implies that it requires an amount of auxiliary memory that is roughly proportional to the number of recursive calls, or perhaps higher if the amount of data per iteration is increasing.
 The algorithms of the next section use a different approach, and the resulting algorithms needs the same number of operations, but use an auxiliary memory that is roughly the same as the memory required to store the result.
@@ Line 44: / Line 46: @@
 The variants described in this section are based on the formula
-:<math> yx^n=
+<math display="block">
-    \begin{cases}
+ y x^n = \begin{cases}
-                (yx) \, ( x^{2})^{(n - 1)/2}, & \mbox{if } n \mbox{ is odd} \\
+  yx\,(x^2)^{(n-1)/2} & \text{if } n \text{ is odd}, \\
-                y\,(x^{2})^{n/2} , & \mbox{if } n \mbox{ is even}.
+  y\,(x^2)^{n/2} & \text{if } n \text{ is even}.
-     \end{cases}
+ \end{cases}
 </math>
@@ Line 83: / Line 85: @@
 </syntaxhighlight>
-The correctness of the algorithm results from the fact that <math>yx^n</math> is invariant during the computation; it is <math>1\cdot x^n=x^n</math> at the beginning; and it is <math>yx^1=xy </math> at the end.
+The correctness of the algorithm results from the fact that <math>yx^n</math> is invariant during the computation; it is <math>1 \cdot x^n = x^n</math> at the beginning; and it is <math>yx^1 = xy</math> at the end.
 These algorithms use exactly the same number of operations as the algorithm of the preceding section, but the multiplications are done in a different order.
@@ Line 98: / Line 100: @@
 ==2<sup>''k''</sup>-ary method==
-This algorithm calculates the value of ''x<sup>n</sup>'' after expanding the exponent in base 2<sup>''k''</sup>. It was first proposed by [[Brauer]] in 1939. In the algorithm below we make use of the following function ''f''(0) = (''k'', 0) and ''f''(''m'') = (''s'', ''u''), where ''m'' = ''u''·2<sup>''s''</sup> with ''u'' odd.
+This algorithm calculates the value of ''x<sup>n</sup>'' after expanding the exponent in base 2<sup>''k''</sup>. It was first proposed by [[Alfred_Brauer|Brauer]] in 1939. In the algorithm below we make use of the following function ''f''(0) = (''k'', 0) and ''f''(''m'') = (''s'', ''u''), where ''m'' = ''u''·2<sup>''s''</sup> with ''u'' odd.
 Algorithm:
@@ Line 215: / Line 217: @@
   {{nowrap|'''Return''' <math>x^n = x_M^{n_M}</math>.}}
-The algorithm first finds the largest value among the {{math|''n''<sub>''i''</sub>}} and then the supremum within the set of {{math|{{(}} ''n''<sub>''i''</sub> \ ''i'' ≠ ''M'' {{)}}}}.
+The algorithm first finds the largest value among the {{math|''n''<sub>''i''</sub>}} and then the [[Infimum and supremum|supremum]] within the set of {{math|{{(}} ''n''<sub>''i''</sub> \ ''i'' ≠ ''M'' {{)}}}}.
 Then it raises {{math|''x''<sub>''M''</sub>}} to the power {{mvar|q}}, multiplies this value with {{math|''x''<sub>''N''</sub>}}, and then assigns {{math|''x''<sub>''N''</sub>}} the result of this computation and {{math|''n''<sub>''M''</sub>}} the value {{math|''n''<sub>''M''</sub>}} modulo {{math|''n''<sub>''N''</sub>}}.
@@ Line 231: / Line 233: @@
 The approach also works in [[non-commutative]] semigroups and is often used to compute powers of [[matrix (mathematics)|matrices]].
-More generally, the approach works with positive integer exponents in every [[magma (algebra)|magma]] for which the binary operation is [[power associative]].
+More generally, the approach works with positive integer exponents in every [[magma (algebra)|magma]] for which the [[binary operation]] is [[power associative]].
 ==Signed-digit recoding==

Exponentiation by squaring: Difference between revisions

Latest revision as of 17:46, 16 October 2025

Contents

Basic method

Recursive version

With constant auxiliary memory

Computational complexity

2^k-ary method

Sliding-window method

Montgomery's ladder technique

Fixed-base exponent

Yao's method

Euclidean method

Further applications

Signed-digit recoding

Alternatives and generalizations

See also

Notes

References

Navigation menu

Exponentiation by squaring: Difference between revisions

Latest revision as of 17:46, 16 October 2025

Basic method

Recursive version

With constant auxiliary memory

Computational complexity

2k-ary method

Sliding-window method

Montgomery's ladder technique

Fixed-base exponent

Yao's method

Euclidean method

Further applications

Signed-digit recoding

Alternatives and generalizations

See also

Notes

References

Navigation menu

Search

2^k-ary method