content
menu
navigation
page footer

Numerical methods for Data Assimilation

Lesson

Numerical methods

Numerical methods

leads to an incremental least squares problem
( is a several dense matrix)
can be solved by :
normal equations method ( )
or orthogonal transformations (e.g QR)
requirements :
Performance (daily computation)
Memory storage

Normal equations

steps for solving :
- updating ( flops)
- Cholesky factorization ( flops)
- triangular solve : followed by ( flops)
computational cost: flops (if )
accuracy:

QR factorization

general QR :
with -by- orthogonal and -by- upper triangular, compute , and solve ( denotes the first $ $ elements of )
incremental QR :
- only is stored and updated
- we perform
- is overwritten by
computational cost: flops (if )
accuracy:

Normal equations vs QR factorization

QR costs twice ( vs )
by forming the normal equations we square the condition number
if is large and the residual is small then the normal equations may be less accurate
when applied to large residual, ill-conditioned problems, we get comparably inaccurate results

Need for parallelism

Supposing that peak performance of matrix-matrix product: 4.2 Gflops then

(or ) in-core impossible with 32 procs and 2Gbytes memory per proc
assembling time (sequential)
result within a day using 32 procs depends on scalability of the algorithm

Existing libraries

Standard parallel libraries for dense linear algebra computations on parallel distributed machines:

ScaLAPACK (University of Tennessee - J. Dongarra)
PLAPACK (University of Texas, Austin - R. Van de Geijn)

There is no packed format for symmetric (ATA) or triangular (R) matrices in ScaLAPACK or PLAPACK (the

whole matrix is stored)

Objectives

exploit parallelism
take profit of the structure by storing half of the matrix (32.5 Gbytes instead of 65 Gbytes),
obtain Gflops performance similar to ScaLAPACK

Print S. Gratton and Ph. Toint, submitted to Open Learn. Res. Ed. INPT 0502 (2013) 6h

Created with Scenari (new window)