- Source: Phase retrieval
Phase retrieval is the process of algorithmically finding solutions to the phase problem. Given a complex spectrum
F
(
k
)
{\displaystyle F(k)}
, of amplitude
|
F
(
k
)
|
{\displaystyle |F(k)|}
, and phase
ψ
(
k
)
{\displaystyle \psi (k)}
:
F
(
k
)
=
|
F
(
k
)
|
e
i
ψ
(
k
)
=
∫
−
∞
∞
f
(
x
)
e
−
2
π
i
k
⋅
x
d
x
{\displaystyle F(k)=|F(k)|e^{i\psi (k)}=\int _{-\infty }^{\infty }f(x)\ e^{-2\pi ik\cdot x}\,dx}
where x is an M-dimensional spatial coordinate and k is an M-dimensional spatial frequency coordinate. Phase retrieval consists of finding the phase that satisfies a set of constraints for a measured amplitude. Important applications of phase retrieval include X-ray crystallography, transmission electron microscopy and coherent diffractive imaging, for which
M
=
2
{\displaystyle M=2}
. Uniqueness theorems for both 1-D and 2-D cases of the phase retrieval problem, including the phaseless 1-D inverse scattering problem, were proven by Klibanov and his collaborators (see References).
Problem formulation
Here we consider 1-D discrete Fourier transform (DFT) phase retrieval problem. The DFT of a complex signal
f
[
n
]
{\displaystyle f[n]}
is given by
F
[
k
]
=
∑
n
=
0
N
−
1
f
[
n
]
e
−
j
2
π
k
n
N
,
=
|
F
[
k
]
|
⋅
e
j
ψ
[
k
]
k
=
0
,
1
,
…
,
N
−
1
{\displaystyle F[k]=\sum _{n=0}^{N-1}f[n]e^{-j2\pi {\frac {kn}{N}},}=|F[k]|\cdot e^{j\psi [k]}\quad k=0,1,\ldots ,N-1}
,
and the oversampled DFT of
x
{\displaystyle x}
is given by
F
[
k
]
=
∑
n
=
0
N
−
1
f
[
n
]
e
−
j
2
π
k
n
M
,
k
=
0
,
1
,
…
,
M
−
1
{\displaystyle F[k]=\sum _{n=0}^{N-1}f[n]e^{-j2\pi {\frac {kn}{M}},}\quad k=0,1,\ldots ,M-1}
,
where
M
>
N
{\displaystyle M>N}
.
Since the DFT operator is bijective, this is equivalent to recovering the phase
ψ
[
k
]
{\displaystyle \psi [k]}
. It is common recovering a signal from its autocorrelation sequence instead of its Fourier magnitude. That is, denote by
f
^
{\displaystyle {\hat {f}}}
the vector
f
{\displaystyle f}
after padding with
N
−
1
{\displaystyle N-1}
zeros. The autocorrelation sequence of
f
^
{\displaystyle {\hat {f}}}
is then defined as
g
[
m
]
=
∑
i
=
max
{
1
,
m
+
1
}
N
f
^
i
f
^
i
−
m
¯
,
m
=
−
(
N
−
1
)
,
…
,
N
−
1
{\displaystyle g[m]=\sum _{i=\max\{1,m+1\}}^{N}{\hat {f}}_{i}{\overline {{\hat {f}}_{i-m}}},\quad m=-(N-1),\ldots ,N-1}
,
and the DFT of
g
[
m
]
{\displaystyle g[m]}
, denoted by
G
[
k
]
{\displaystyle G[k]}
, satisfies
G
[
k
]
=
|
F
[
k
]
|
2
{\displaystyle G[k]=|F[k]|^{2}}
.
Methods
= Error reduction algorithm
=The error reduction is a generalization of the Gerchberg–Saxton algorithm. It solves for
f
(
x
)
{\displaystyle f(x)}
from measurements of
|
F
(
u
)
|
{\displaystyle |F(u)|}
by iterating a four-step process. For the
k
{\displaystyle k}
th iteration the steps are as follows:
Step (1):
G
k
(
u
)
{\displaystyle G_{k}(u)}
,
ϕ
k
{\displaystyle \phi _{k}}
, and
g
k
(
x
)
{\displaystyle g_{k}(x)}
are estimates of, respectively,
F
(
u
)
{\displaystyle F(u)}
,
ψ
{\displaystyle \psi }
and
f
(
x
)
{\displaystyle f(x)}
. In the first step we calculate the Fourier transform of
g
k
(
x
)
{\displaystyle g_{k}(x)}
:
G
k
(
u
)
=
|
G
k
(
u
)
|
e
i
ϕ
k
(
u
)
=
F
(
g
k
(
x
)
)
.
{\displaystyle G_{k}(u)=|G_{k}(u)|e^{i\phi _{k}(u)}={\mathcal {F}}(g_{k}(x)).}
Step (2): The experimental value of
|
F
(
u
)
|
{\displaystyle |F(u)|}
, calculated from the diffraction pattern via the signal equation, is then substituted for
|
G
k
(
u
)
|
{\displaystyle |G_{k}(u)|}
, giving an estimate of the Fourier transform:
G
k
′
(
u
)
=
|
F
(
u
)
|
e
i
ϕ
k
(
u
)
,
{\displaystyle G'_{k}(u)=|F(u)|e^{i\phi _{k}(u)},}
where the ' denotes an intermediate result that will be discarded later on.
Step (3): the estimate of the Fourier transform
G
k
′
(
u
)
{\displaystyle G'_{k}(u)}
is then inverse Fourier transformed:
g
k
′
(
x
)
=
|
g
k
′
(
x
)
|
e
i
θ
k
′
(
x
)
=
F
−
1
(
G
k
′
(
u
)
)
.
{\displaystyle g'_{k}(x)=|g'_{k}(x)|e^{i\theta '_{k}(x)}={\mathcal {F}}^{-1}(G'_{k}(u)).}
Step (4):
g
k
′
(
x
)
{\displaystyle g'_{k}(x)}
then must be changed so that the new estimate of the object,
g
k
+
1
(
x
)
{\displaystyle g_{k+1}(x)}
, satisfies the object constraints.
g
k
+
1
(
x
)
{\displaystyle g_{k+1}(x)}
is therefore defined piecewise as:
g
k
+
1
(
x
)
≡
{
g
k
′
(
x
)
,
x
∉
γ
,
0
,
x
∈
γ
,
{\displaystyle g_{k+1}(x)\equiv {\begin{cases}g'_{k}(x),&x\notin \gamma ,\\0,&x\in \gamma ,\end{cases}}}
where
γ
{\displaystyle \gamma }
is the domain in which
g
k
′
(
x
)
{\displaystyle g'_{k}(x)}
does not satisfy the object constraints. A new estimate
g
k
+
1
(
x
)
{\displaystyle g_{k+1}(x)}
is obtained and the four step process is repeated.
This process is continued until both the Fourier constraint and object constraint are satisfied. Theoretically, the process will always lead to a convergence, but the large number of iterations needed to produce a satisfactory image (generally >2000) results in the error-reduction algorithm by itself being unsuitable for practical applications.
= Hybrid input-output algorithm
=The hybrid input-output algorithm is a modification of the error-reduction algorithm - the first three stages are identical. However,
g
k
(
x
)
{\displaystyle g_{k}(x)}
no longer acts as an estimate of
f
(
x
)
{\displaystyle f(x)}
, but the input function corresponding to the output function
g
k
′
(
x
)
{\displaystyle g'_{k}(x)}
, which is an estimate of
f
(
x
)
{\displaystyle f(x)}
. In the fourth step, when the function
g
k
′
(
x
)
{\displaystyle g'_{k}(x)}
violates the object constraints, the value of
g
k
+
1
(
x
)
{\displaystyle g_{k+1}(x)}
is forced towards zero, but optimally not to zero. The chief advantage of the hybrid input-output algorithm is that the function
g
k
(
x
)
{\displaystyle g_{k}(x)}
contains feedback information concerning previous iterations, reducing the probability of stagnation. It has been shown that the hybrid input-output algorithm converges to a solution significantly faster than the error reduction algorithm. Its convergence rate can be further improved through step size optimization algorithms.
g
k
+
1
(
x
)
≡
{
g
k
′
(
x
)
,
x
∉
γ
,
g
k
(
x
)
−
β
g
k
′
(
x
)
,
x
∈
γ
.
{\displaystyle g_{k+1}(x)\equiv {\begin{cases}g'_{k}(x),&x\notin \gamma ,\\g_{k}(x)-\beta {g'_{k}(x)},&x\in \gamma .\end{cases}}}
Here
β
{\displaystyle \beta }
is a feedback parameter which can take a value between 0 and 1. For most applications,
β
≈
0.9
{\displaystyle \beta \approx 0.9}
gives optimal results.{Scientific Reports volume 8, Article number: 6436 (2018)}
= Shrinkwrap
=For a two dimensional phase retrieval problem, there is a degeneracy of solutions as
f
(
x
)
{\displaystyle f(x)}
and its conjugate
f
∗
(
−
x
)
{\displaystyle f^{*}(-x)}
have the same Fourier modulus. This leads to "image twinning" in which the phase retrieval algorithm stagnates producing an image with features of both the object and its conjugate. The shrinkwrap technique periodically updates the estimate of the support by low-pass filtering the current estimate of the object amplitude (by convolution with a Gaussian) and applying a threshold, leading to a reduction in the image ambiguity.
Semidefinite relaxation-based algorithm for short time Fourier transform
The phase retrieval is an ill-posed problem. To uniquely identify the underlying signal, in addition to the methods that adds additional prior information like Gerchberg–Saxton algorithm, the other way is to add magnitude-only measurements like short time Fourier transform (STFT).
The method introduced below mainly based on the work of Jaganathan et al.
= Short time Fourier transform
=Given a discrete signal
x
=
(
f
[
0
]
,
f
[
1
]
,
.
.
.
,
f
[
N
−
1
]
)
T
{\displaystyle \mathbf {x} =(f[0],f[1],...,f[N-1])^{T}}
which is sampled from
f
(
x
)
{\displaystyle f(x)}
. We use a window of length W:
w
=
(
w
[
0
]
,
w
[
1
]
,
.
.
.
,
w
[
W
−
1
]
)
T
{\displaystyle \mathbf {w} =(w[0],w[1],...,w[W-1])^{T}}
to compute the STFT of
f
{\displaystyle \mathrm {f} }
, denoted by
Y
{\displaystyle \mathbf {Y} }
:
Y
[
m
,
r
]
=
∑
n
=
0
N
−
1
f
[
n
]
w
[
r
L
−
n
]
e
−
i
2
π
m
n
N
{\displaystyle Y[m,r]=\sum _{n=0}^{N-1}{f[n]w[rL-n]e^{-i2\pi {\frac {mn}{N}}}}}
for
0
≤
m
≤
N
−
1
{\displaystyle 0\leq m\leq N-1}
and
0
≤
r
≤
R
−
1
{\displaystyle 0\leq r\leq R-1}
, where the parameter
L
{\displaystyle L}
denotes the separation in time between adjacent short-time sections and the parameter
R
=
⌈
N
+
W
−
1
L
⌉
{\displaystyle R=\left\lceil {\frac {N+W-1}{L}}\right\rceil }
denotes the number of short-time sections considered.
The other interpretation (called sliding window interpretation) of STFT can be used with the help of discrete Fourier transform (DFT). Let
w
r
[
n
]
=
w
[
r
L
−
n
]
{\displaystyle w_{r}[n]=w[rL-n]}
denotes the window element obtained from shifted and flipped window
w
{\displaystyle \mathbf {w} }
. Then we have
Y
=
[
Y
0
,
Y
1
,
.
.
.
,
Y
R
−
1
]
{\displaystyle \mathbf {Y} =[\mathbf {Y} _{0},\mathbf {Y} _{1},...,\mathbf {Y} _{R-1}]}
, where
Y
r
=
x
∘
w
r
{\displaystyle \mathbf {Y} _{r}=\mathbf {x} \circ \mathbf {w} _{r}}
.
= Problem definition
=Let
Z
w
[
m
,
r
]
=
|
Y
[
m
,
r
]
|
2
{\displaystyle {Z}_{w}[m,r]=|Y[m,r]|^{2}}
be the
N
×
R
{\displaystyle N\times R}
measurements corresponding to the magnitude-square of the STFT of
x
{\displaystyle \mathbf {x} }
,
W
r
{\displaystyle \mathbf {W} _{r}}
be the
N
×
N
{\displaystyle N\times N}
diagonal matrix with diagonal elements
(
w
r
[
0
]
,
w
r
[
1
]
,
…
,
w
r
[
N
−
1
]
)
.
{\displaystyle \left(w_{r}[0],w_{r}[1],\ldots ,w_{r}[N-1]\right).}
STFT phase retrieval can be stated as:
Find
x
{\displaystyle \mathbf {x} }
such that
Z
w
[
m
,
r
]
=
|
⟨
f
m
,
W
r
x
⟩
|
2
{\displaystyle Z_{w}[m,r]=\left|\left\langle \mathbf {f} _{m},\mathbf {W} _{r}\mathbf {x} \right\rangle \right|^{2}}
for
0
≤
m
≤
N
−
1
{\displaystyle 0\leq m\leq N-1}
and
0
≤
r
≤
R
−
1
{\displaystyle 0\leq r\leq R-1}
, where
f
m
{\displaystyle \mathbf {f} _{m}}
is the
m
{\displaystyle m}
-th column of the
N
{\displaystyle N}
-point inverse DFT matrix.
Intuitively, the computational complexity growing with
N
{\displaystyle N}
makes the method impractical. In fact, however, for the most cases in practical we only need to consider the measurements corresponding to
0
≤
m
≤
M
{\displaystyle 0\leq m\leq M}
, for any parameter
M
{\displaystyle M}
satisfying
2
W
≤
M
≤
N
{\displaystyle 2W\leq M\leq N}
.
To be more specifically, if both the signal and the window are not vanishing, that is,
x
[
n
]
≠
0
{\displaystyle x[n]\neq 0}
for all
0
≤
n
≤
N
−
1
{\displaystyle 0\leq n\leq N-1}
and
w
[
n
]
≠
0
{\displaystyle w[n]\neq 0}
for all
0
≤
{\displaystyle 0\leq }
n
≤
W
−
1
{\displaystyle n\leq W-1}
, signal
x
{\displaystyle \mathbf {x} }
can be uniquely identified from its STFT magnitude if the following requirements are satisfied:
L
<
W
≤
N
/
2
{\displaystyle L
,
2
W
≤
M
≤
N
{\displaystyle 2W\leq M\leq N}
.
The proof can be found in Jaganathan' s work, which reformulates STFT phase retrieval as the following least-squares problem:
min
x
∑
r
=
0
R
−
1
∑
m
=
0
N
−
1
(
Z
w
[
m
,
r
]
−
|
⟨
f
m
,
W
r
x
⟩
|
2
)
2
{\displaystyle \min _{\mathbf {x} }\sum _{r=0}^{R-1}\sum _{m=0}^{N-1}\left(Z_{w}[m,r]-\left|\left\langle \mathbf {f} _{m},\mathbf {W} _{r}\mathbf {x} \right\rangle \right|^{2}\right)^{2}}
.
The algorithm, although without theoretical recovery guarantees, empirically able to converge to the global minimum when there is substantial overlap between adjacent short-time sections.
= Semidefinite relaxation-based algorithm
=To establish recovery guarantees, one way is to formulate the problems as a semidefinite program (SDP), by embedding the problem in a higher dimensional space using the transformation
X
=
x
x
∗
{\displaystyle \mathbf {X} =\mathbf {x} \mathbf {x} ^{\ast }}
and relax the rank-one constraint to obtain a convex program. The problem reformulated is stated below:
Obtain
X
^
{\displaystyle \mathbf {\hat {X}} }
by solving:
m
i
n
i
m
i
z
e
t
r
a
c
e
(
X
)
s
u
b
j
e
c
t
t
o
Z
[
m
,
r
]
=
t
r
a
c
e
(
W
r
∗
f
m
f
m
∗
W
r
X
)
X
⪰
0
{\displaystyle {\begin{aligned}&\mathrm {minimize} ~~~\mathrm {trace} (\mathbf {X} )\\[6pt]&\mathrm {subject~to} ~~Z[m,r]=\mathrm {trace} (\mathbf {W} _{r}^{\ast }\mathbf {f} _{m}\mathbf {f} _{m}^{\ast }\mathbf {W} _{r}\mathbf {X} )\\[0pt]&~~~~~~~~~~~~~~~~~~~\mathbf {X} \succeq 0\end{aligned}}}
for
1
≤
m
≤
M
{\displaystyle 1\leq m\leq M}
and
0
≤
r
≤
R
−
1
{\displaystyle 0\leq r\leq R-1}
Once
X
^
{\displaystyle \mathbf {\hat {X}} }
is found, we can recover signal
x
{\displaystyle \mathbf {x} }
by best rank-one approximation.
Applications
Phase retrieval is a key component of coherent diffraction imaging (CDI). In CDI, the intensity of the diffraction pattern scattered from a target is measured. The phase of the diffraction pattern is then obtained using phase retrieval algorithms and an image of the target is constructed. In this way, phase retrieval allows for the conversion of a diffraction pattern into an image without an optical lens.
Using phase retrieval algorithms, it is possible to characterize complex optical systems and their aberrations. For example, phase retrieval was used to diagnose and repair the flawed optics of the Hubble Space Telescope.
Other applications of phase retrieval include X-ray crystallography and transmission electron microscopy.
See also
Phase problem
Crystallography
X-ray crystallography
Coherent diffraction imaging
Transport-of-Intensity Equation
Phase correlation
References
Klibanov, M. V. (1985). "On uniqueness of the determination of a compactly supported function from the modulus of its Fourier transform". Soviet Mathematics - Doklady. 32: 668–670.
Klibanov, M.V. (1987). "Determination of a function with compact support from the absolute value of its Fourier transform and an inverse scattering problem". Differential Equations. 22: 1232–1240.
Klibanov, M.V. (1987). "Inverse scattering problems and restoration of a function from the modulus of its Fourier transform". Siberian Math. J. 27 (5): 708–719. doi:10.1007/bf00969199. S2CID 120840929.
Klibanov, M. V. (1989). "Uniqueness of the determination of distortions of a crystal lattice by the X-ray diffraction in a continuous dynamical model". Differential Equations. 25: 520–527.
Klibanov, M.V. & Sacks, P.E. (1992). "Phaseless inverse scattering and the phase problem in optics". J. Math. Phys. 33 (11): 2813–3821. Bibcode:1992JMP....33.3813K. doi:10.1063/1.529990.
Klibanov, M. V.; Sacks, P.E. (1994). "Use of partial knowledge of the potential in the phase problem of inverse scattering". J. Comput. Phys. 112 (2): 273–281. Bibcode:1994JCoPh.112..273K. doi:10.1006/jcph.1994.1099.
Klibanov, M. V.; Sacks, P.E.; Tikhonravov, A.V. (1995). "The phase retrieval problem". Inverse Problems. 11 (1): 1–28. Bibcode:1995InvPr..11....1K. doi:10.1088/0266-5611/11/1/001. S2CID 250916850.
Klibanov, M. V. (2006). "On the recovery of a 2-D function from the modulus of its Fourier transform". J. Math. Anal. Appl. 323 (2): 818–843. doi:10.1016/j.jmaa.2005.10.079.
Kata Kunci Pencarian:
- Daftar merger dan akuisisi oleh Microsoft
- Daftar istilah fotografi
- Phase retrieval
- Phase problem
- Spacing effect
- Gerchberg–Saxton algorithm
- Difference-map algorithm
- Fourier ptychography
- Distributed practice
- Cauchy wavelet
- Phase correlation
- Retrieval-augmented generation