Search Results for “itp method”

Source: ITP method

In numerical analysis, the ITP method, short for Interpolate Truncate and Project, is the first root-finding algorithm that achieves the superlinear convergence of the secant method while retaining the optimal worst-case performance of the bisection method. It is also the first method with guaranteed average performance strictly better than the bisection method under any continuous distribution. In practice it performs better than traditional interpolation and hybrid based strategies (Brent's Method, Ridders, Illinois), since it not only converges super-linearly over well behaved functions but also guarantees fast performance under ill-behaved functions where interpolations fail.
The ITP method follows the same structure of standard bracketing strategies that keeps track of upper and lower bounds for the location of the root; but it also keeps track of the region where worst-case performance is kept upper-bounded. As a bracketing strategy, in each iteration the ITP queries the value of the function on one point and discards the part of the interval between two points where the function value shares the same sign. The queried point is calculated with three steps: it interpolates finding the regula falsi estimate, then it perturbes/truncates the estimate (similar to Regula falsi § Improvements in regula falsi) and then projects the perturbed estimate onto an interval in the neighbourhood of the bisection midpoint. The neighbourhood around the bisection point is calculated in each iteration in order to guarantee minmax optimality (Theorem 2.1 of ). The method depends on three hyper-parameters

κ

1

∈
(
0
,
∞
)
,

κ

2

∈

[

1
,
1
+
ϕ

)

{\displaystyle \kappa _{1}\in (0,\infty ),\kappa _{2}\in \left[1,1+\phi \right)}

and

n

0

∈
[
0
,
∞
)

{\displaystyle n_{0}\in [0,\infty )}

where

ϕ

{\displaystyle \phi }

is the golden ratio

1
2

(
1
+

5

)

{\displaystyle {\tfrac {1}{2}}(1+{\sqrt {5}})}

: the first two control the size of the truncation and the third is a slack variable that controls the size of the interval for the projection step.

Root finding problem

ITP

method

The method

ITP

method

ITP

= The algorithm

where

y

a

≡
f
(
a
)

{\displaystyle y_{a}\equiv f(a)}

and

y

b

≡
f
(
b
)

{\displaystyle y_{b}\equiv f(b)}

; and, it returns an estimate

x
^

{\displaystyle {\hat {x}}}

that satisfies

|

x
^

−

x

∗

|

≤
ϵ

{\displaystyle |{\hat {x}}-x^{*}|\leq \epsilon }

in at most

n

1

/

2

+

n

0

{\displaystyle n_{1/2}+n_{0}}

function evaluations.

Input:

a
,
b
,
ϵ
,

κ

1

,

κ

2

,

n

0

,
f

{\displaystyle a,b,\epsilon ,\kappa _{1},\kappa _{2},n_{0},f}

Preprocessing:

n

1

/

2

=
⌈

log

2

⁡

b
−
a

2
ϵ

⌉

{\displaystyle n_{1/2}=\lceil \log _{2}{\tfrac {b-a}{2\epsilon }}\rceil }

,

n

max

=

n

1

/

2

+

n

0

{\displaystyle n_{\max }=n_{1/2}+n_{0}}

, and

j
=
0

{\displaystyle j=0}

;
While (

b
−
a
>
2
ϵ

{\displaystyle b-a>2\epsilon }

)

Calculating Parameters:

x

1

/

2

=

a
+
b

2

{\displaystyle x_{1/2}={\tfrac {a+b}{2}}}

,

r
=
ϵ

2

n

max

−
j

−
(
b
−
a
)

/

2

{\displaystyle r=\epsilon 2^{n_{\max }-j}-(b-a)/2}

,

δ
=

κ

1

(
b
−
a

)

κ

2

{\displaystyle \delta =\kappa _{1}(b-a)^{\kappa _{2}}}

;
Interpolation:

x

f

=

y

b

a
−

y

a

b

y

b

−

y

a

{\displaystyle x_{f}={\tfrac {y_{b}a-y_{a}b}{y_{b}-y_{a}}}}

;
Truncation:

σ
=

sign

(

x

1

/

2

−

x

f

)

{\displaystyle \sigma ={\text{sign}}(x_{1/2}-x_{f})}

;
If

δ
≤

|

x

1

/

2

−

x

f

|

{\displaystyle \delta \leq |x_{1/2}-x_{f}|}

then

x

t

=

x

f

+
σ
δ

{\displaystyle x_{t}=x_{f}+\sigma \delta }

,
Else

x

t

=

x

1

/

2

{\displaystyle x_{t}=x_{1/2}}

;
Projection:
If

|

x

t

−

x

1

/

2

|

≤
r

{\displaystyle |x_{t}-x_{1/2}|\leq r}

then

x

ITP

=

x

t

{\displaystyle x_{\text{ITP}}=x_{t}}

,
Else

x

ITP

=

x

1

/

2

−
σ
r

{\displaystyle x_{\text{ITP}}=x_{1/2}-\sigma r}

;
Updating Interval:

y

ITP

=
f
(

x

ITP

)

{\displaystyle y_{\text{ITP}}=f(x_{\text{ITP}})}

;
If

y

ITP

>
0

{\displaystyle y_{\text{ITP}}>0}

then

b
=

x

I
T
P

{\displaystyle b=x_{ITP}}

and

y

b

=

y

ITP

{\displaystyle y_{b}=y_{\text{ITP}}}

,
Elseif

y

ITP

<
0

{\displaystyle y_{\text{ITP}}<0}

then

a
=

x

ITP

{\displaystyle a=x_{\text{ITP}}}

and

y

a

=

y

ITP

{\displaystyle y_{a}=y_{\text{ITP}}}

,
Else

a
=

x

ITP

{\displaystyle a=x_{\text{ITP}}}

and

b
=

x

ITP

{\displaystyle b=x_{\text{ITP}}}

;

j
=
j
+
1

{\displaystyle j=j+1}

;
Output:

x
^

=

a
+
b

2

{\displaystyle {\hat {x}}={\tfrac {a+b}{2}}}

Example: Finding the root of a polynomial

Suppose that the ITP method is used to find a root of the polynomial

f
(
x
)
=

x

3

−
x
−
2

.

{\displaystyle f(x)=x^{3}-x-2\,.}

Using

ϵ
=
0.0005
,

κ

1

=
0.1
,

κ

2

=
2

{\displaystyle \epsilon =0.0005,\kappa _{1}=0.1,\kappa _{2}=2}

and

n

0

=
1

{\displaystyle n_{0}=1}

we find that:

This example can be compared to Bisection method § Example: Finding the root of a polynomial. The ITP method required less than half the number of iterations than the bisection to obtain a more precise estimate of the root with no cost on the minmax guarantees. Other methods might also attain a similar speed of convergence (such as Ridders, Brent etc.) but without the minmax guarantees given by the ITP method.

Analysis

The main advantage of the ITP method is that it is guaranteed to require no more iterations than the bisection method when

n

0

=
0

{\displaystyle n_{0}=0}

. And so its average performance is guaranteed to be better than the bisection method even when interpolation fails. Furthermore, if interpolations do not fail (smooth functions), then it is guaranteed to enjoy the high order of convergence as interpolation based methods.

= Worst case performance

=
Because the ITP method projects the estimator onto the minmax interval with a

n

0

{\displaystyle n_{0}}

slack, it will require at most

n

1

/

2

+

n

0

{\displaystyle n_{1/2}+n_{0}}

iterations (Theorem 2.1 of ). This is minmax optimal like the bisection method when

n

0

{\displaystyle n_{0}}

is chosen to be

n

0

=
0

{\displaystyle n_{0}=0}

.

= Average performance

=
Because it does not take more than

n

1

/

2

+

n

0

{\displaystyle n_{1/2}+n_{0}}

iterations, the average number of iterations will always be less than that of the bisection method for any distribution considered when

n

0

=
0

{\displaystyle n_{0}=0}

(Corollary 2.2 of ).

= Asymptotic performance

=
If the function

f
(
x
)

{\displaystyle f(x)}

is twice differentiable and the root

x

∗

{\displaystyle x^{*}}

is simple, then the intervals produced by the ITP method converges to 0 with an order of convergence of

κ

2

{\displaystyle {\sqrt {\kappa _{2}}}}

if

n

0

≠
0

{\displaystyle n_{0}\neq 0}

or if

n

0

=
0

{\displaystyle n_{0}=0}

and

(
b
−
a
)

/

ϵ

{\displaystyle (b-a)/\epsilon }

is not a power of 2 with the term

ϵ

2

n

1

/

2

b
−
a

{\displaystyle {\tfrac {\epsilon 2^{n_{1/2}}}{b-a}}}

not too close to zero (Theorem 2.3 of ).