Integer Square Root Algorithm (sqrt64, Newton–Raphson)

2025-12-31

Takayuki HOSODA

Overview

This is a 64-bit integer square root algorithm and its C implementation, designed for 18–24-bit signal processing systems, withminimum rounding error.

Since the square root of a 64-bit integer results in a 32-bit value, the computation is performed around the lower part of the upper 8 bits of the MSB in order to avoid overflow and underflow during coefficient multiplications and right shifts. Values smaller than 0x8000 are scaled up to the upper 16 bits before computation.

The initial value is obtained using a first-order Min?Max?like approximation. Then, the Newton?Raphson method is applied three times to reduce the error to within 1 LSB. Finally, an LSB adjustment is performed to confine the error within±1/2 LSB.

Algorithm

x₀ = (22 a + 11) / 32 | a ∈ (0.25, 1) … (1)
x_i+1 ← (x_i + a / x_i ) / 2 … (2)

This algorithm is centered around equations (1) and (2).

sqrt arpproximation errors — **Fig. 1** Relative error of square root approximations (`x` ∈ (0.25, 1))

Initial Approximation

As an initial approximation of the square root over the interval x ∈ (0.25, 1), the first-order approximation (1) is used, where the denominator is chosen to be a power of two.
With this approximation, an accuracy within ± 0.31 % is achieved.

Error Reduction

Error reduction is performed using the Newton–Raphson iteration given by equation (2).
When the function has no multiple roots, the Newton&endash;Raphson method exhibits quadratic convergence in the vicinity of the solution.
The theoretical accuracy after each iteration is as follows:

1st: 4.8×10^-4
2nd: 1.2×10^-7
3rd: 6.5×10^-15

Since

1 / 2³² ≈ 2.33×10^-10 ≫ 6.5×10^-15

three iterations of the Newton–Raphson method are sufficient.

The Newton–Raphson update formula (2) can be derived by defining

f (x) = x_i² - a = 0

which leads to

x_i+1 = x_i - f (x_i) / f '(x_i)
= x_i - (x_i² - a) / (2 x_i)
= (x_i + a / x_i) / 2

Alternatively, this can be interpreted as follows.
Let the error term be

h = 1 - a / x_i² … (3)

Then, the series expansion of √(1 - h) up to the 4th order is

h = 1 - a / x_i² … (3)
series(√(1 - h), h, 4) = 1 - h / 2 - h² / 8 - h³ / 16 + O(h⁴) … (4)

Using only the first-order term of this expansion to correct x_i, we obtain

x_i+1 = x_i (1 - h / 2)
= (x_i + a / x_i) / 2

which is identical to equation (2). Focusing on the second-order term in (4), we observe that -h / 8 is negative. Therefore, when h is sufficiently small, the sum of the remaining correction terms is negative. As a result, equation (2) always under-corrects, and the approximation error is therefore always positive.

Using the expansion up to the second-order term yields an iteration equivalent to Halley's method, and using up to the third-order term yields Householder's method.

Adjustment to ±1/2 LSB Error

Let the given integer be x, and the computed integer result be y.
There exists a y such that

y² < x < (y + 1)² … (5)

The value of x at which the error becomes the maximum of 1/2 LSB is

x == (y + 1/2)² … (6)

However, since x is an integer, this occurs when

x == y² + y

Therefore, to confine the error within }1/2 LSB, it is sufficient to return y + 1 when

y² + y < x

Error Distribution

The error histogram obtained by exhaustive testing over the full range
0 ≤ x ≤ 2³⁸ - 1
(= 274877906943, or 0x3fffffffff) in a 38-bit system is as follows:

error(-1/4..-1/2 LSB) =  68719476736
error(-1/4..+1/4 LSB) = 137438953472
error(+1/4..+1/2 LSB) =  68719476736

C Implementation

sqrt64 returns the integer value that minimizes the error for the square root of a given 64-bit integer. For negative inputs, the function returns the value as-is without raising an exception.

/* sqrt64 - calculate squre root of given integer value
 * Rev.1.4 (2025-12-31) (c) 2003-2025 Takayuki HOSODA
 * SPDX-License-Identifier: BSD-3-Clause
 */
#include <inttypes.h>
int64_t sqrt64(int64_t x);
int64_t sqrt64(int64_t x) {
    int64_t a;                          // approximation register
    int64_t t;                          // scaled '1'
    int     sr;                         // scaling (for relative domain of (0.25, 1))
    int     sx;                         // scaling (for small values)
    int     n_bit = sizeof(x) * 8;      // number of digits of given x (e.g. 64 for int64_t)
    int     n_room = 8;                 // head room
    if (x <= 0) return x;               // netative values are returned unchanged.
    sx = 0;
    a  = x;
    if (x < (1LL << (n_bit / 2 - 1))) { // If x is in the lower half of the digits,
        sx = n_bit / 4;                 // note that square root halves the digits.
        a  = x = (x << (n_bit / 2));    // enlarge to the upper half of the didits.
    }
    a >>= n_room;                       // scale down to get head-room
    sr  = n_bit / 2 - n_room;
    t = 1LL << (n_bit - n_room - 2);
    while (a < t) {                     // find operating point
        t >>=2;
        sr--;
    }
    a >>= sr;                           // now, a / t is in (0.25, 1)
    t = 1LL << (sr + n_room);           // scaled '1'
    a = (22 * a + 11 * t) / 32;         // -3.1e-2 < err < +2.9e-2, a / t in (0.25, 1)
    a = (x / a + a) >> 1;               // 0 <= err < 4.8e-4
    a = (x / a + a) >> 1;               // 0 <= err < 1.2e-7
    a = (x / a + a) >> 1;               // 0 <= err < 6.5e-15
    if ((x - a) > (a * a))              // the approximation error is always zero or negative
        a++;                            // adjust LSB so that the error is within 1/2 LSB
    if (sx) {
        a += (1LL << (sx - 1)) - 1;     // for round half
        a >>= sx;
    }
    return a;
}
#ifdef DEBUG
#include <stdio.h>
#include <math.h>
int test_sqrt(int64_t x) {
    int64_t y;
    double  ref;
    ref = round(sqrt((double)x));
    y = sqrt64(x);
    fprintf(stdout, "sqrt64(%"PRId64") = %"PRId64" %s %.f\n",
        x, y, y != ref ? "!=" : "==", ref);
    return (y != ref ? 1 : 0);
}
int main(int argc, char *argv[]) {
    int     i;
    int64_t x, y;
    if (argc == 2) {
        sscanf(&argv[1][0], "%" SCNd64 "\n", &x);
        y = sqrt64(x);
        fprintf(stdout, "sqrt64(%"PRId64") = %"PRId64"\n", x, y);
    } else {
        x = 1LL;
        for (i = 0; i < 63; i++) {
            test_sqrt(1LL << i);
            test_sqrt(x);
            x = (x << 1) | 1;
        }
    }
    return 0;
}
#endif

¦ The square root of INT64_MAX (0x7fffffffffffffff) is 0xb504f334 (= 3037000500 ) , which exceeds INT32_MAX (0x7fffffff). Therefore, the return type cannot be int32_t .

Q&A: Design and Implementation of Integer Square Root

Q1:Halleyfs method converges cubically. Why not use it?

A1: Because intermediate values become excessively large in integer arithmetic.

In one example of the Halley update formula:

x ← x (x + 3 a / x ) / ( 3 x + a / x )

the intermediate term

x (x + 3 a / x) ≈ 4a

temporarily reaches a magnitude comparable to the original value a.
While this is harmless in floating-point arithmetic, such transient growth is fatal in fixed-width integer arithmetic.

⛔ Halley's method is fast, but consumes too much dynamic range for integer arithmetic.

Q2. Can we reduce the number of Newton?Raphson iterations by using a second-order initial approximation?

A2. The initial error is reduced, but intermediate values become much larger.

For example, using the following second-order Min?Max approximation over
x ∈ (0.25, 1):

g(x) = (66 + 270x - 81x²) / 256

the initial approximation error can be reduced to about 0.6 %, which is approximately 0.7 decimal digits better than the first-order approximation.

However, this expression contains an x² term. In integer arithmetic, this temporarily requires more than double the bit width.

⛔ Increasing the approximation order drastically increases the required dynamic range.

Q3. Canft we work around that?

A3. It is possible, but at that point, the approach becomes counterproductive.

Possible workarounds include:

Fine-grained scaling
Splitting intermediate evaluations
Additional shifts and branches

However, branches disrupt pipeline efficiency and slow down execution, and systematic error analysis becomes difficult.
In practice, running one more Newton–Raphson iteration is faster and safer.
The Newton–Raphson update formula

x ← (x + a/x) / 2

handles only values on the order of √a throughout the iteration.

✅ Newton?Raphson is not flashy, but it is slim, robust, and predictable.

Q4. What was ultimately prioritized?

A4. Safely covering the entire value range in a fully explainable manner.

This implementation adopts:

First-order initial approximation
Three Newton?Raphson iterations
Intermediate values always on the order of √a
One-sided error sign
Final LSB error correction

As a result:

The full 64-bit input range is covered
Error is confined within ± 1/2 LSB
No special workarounds are required

✅ This is a highly robust design for integer arithmetic.

Summary

Methods that are gfasth in floating-point arithmetic
may become gdangeroush in integer arithmetic.

In this implementation, dynamic range was the highest priority in all design decisions.

✅ As a result, the algorithm is also numerically reliable, simple, and highly portable.