Next: Note:
Up: Globalizing Newton's method: Descent
Previous: A secant update for
I now return to the question of finding a secant update for the Hessian
that preserves symmetry and positive definiteness. That is, given
a symmetric and positive definite matrix Hk and
I want to find a symmetric positive definite matrix Hk+1 that satisfies
Such an Hk+1 does not necessarily exist. If (6) holds,
where Hk+1 is positive definite, then
Therefore, if
fails, it will be impossible to find Hk+1.
The condition
is equivalent to
 |
(7) |
A strictly convex function f would satisfy (7) for any points
x(k) and x(k+1). The condition (7) therefore means that
and
are consistent with f's having
positive curvature on the line segment between x(k) and x(k+1). If this
condition were to fail, it would be impossible to find a positive definite
matrix Hk+1 satisfying the secant equation.
Next I wish to point out the following: No rank-one update can produce
Hk+1 that is symmetric, positive definite, and satisfies the
secant equation, at least not in every case. Indeed, to
preserve symmetry and positive definiteness, a rank-one update would
have to take the form
Hk+1=Hk+uuT.
However, it is easy to show that, in some cases (even when
holds), no vector u causes Hk+1 to satisfy the secant
equation.4
Since a rank one update cannot be found, I will try a different
approach. To simplify the notation in the following derivation, I will
write H=Hk,
H+=Hk+1, s=s(k), and y=y(k). The positive definite
matrix H has a Cholesky factorization H=LLT. I will look for H+ in
the form H+=JJT, where J is nonsingular and close to L in some sense.
The reader should notice that, if J is nonsingular, then JJT is
necessarily symmetric and positive definite.
If H+ is to satisfy the secant equation, then
JJTs=y
must hold. The matrix J will be found by a two-step process:
- 1.
- Given any vector v, choose J so that Jv=y and J is as
close as possible to L. Broyden's method shows how to do this:
 |
(8) |
- 2.
- Choose v so that JTs=v also holds (then JJT will satisfy
the secant equation (6)). Using (8),
This last equation is quite nonlinear in v, and to solve it directly would
be difficult or impossible. However, it shows that the vectors v and
LTs must point in the same direction. Therefore, there exists
such that
Substituting this formula for v into (8) and simplifying yields
(to arrive at this formula for J, I used the fact that H=LLT).
I now use the equation JTs=v, where
,
to solve for
:
There are two solutions for
;
the positive solution is the correct one
to take, since then J=L if H already satisfies the secant equation.
(The reader should notice that the requirement that
be positive
appears here.)
Therefore, if
 |
(9) |
where
then H+=JJT is symmetric, positive definite, and satisfies
H+s=y.
Since J, as given by (9), is not necessarily lower triangular
(even though L is), it is more convenient to express the update
in terms of H and H+ rather than in terms of L and J.
A tedious calculation, which simplifies nicely in the end, shows that
The resulting update is known as the BFGS update:5
 |
(10) |
Next: Note:
Up: Globalizing Newton's method: Descent
Previous: A secant update for
Mark S. Gockenbach
2003-02-17