HLIBpro
2.1
|
Table of contents |
In this example, an integral equation is to be solved by representing the discretised operator with an 𝓗-matrix, whereby the 𝓗-matrix is constructed using a matrix coefficient function for entries of the equation system. Furthermore, for the iterative solver a preconditioner is computed using 𝓗-LU factorisation.
Given the integral equation
with being sought for a given right hand side , the Galerkin discretisation with constant ansatz functions
leads to a linear equation system where contains the coefficients of the discretised and is defined by
The right hand side is given by
The code starts with the standard initialisation:
Together with the initialisation, the verbosity level of HLIBpro was increased to have additional output during execution. The macro CHECK_INFO
is defined as
and tests the result of a 𝓗𝖫𝖨𝖡𝗉𝗋𝗈 function call. If an error occured, the description of this error is queried using hlib_error_desc
and printed to the screen.
For clustering the unknowns, coordinate information is to be defined. For this 1d example, the indices are placed in the middle of equally sized intervals.
Coordinates in HLIBpro are vectors of the corresponding dimension, e.g. one in for this problem.
Having coordinates for each index, the cluster tree and block cluster tree can be constructed.
For the cluster tree, the partitioning strategy for the indices is determined automatically by choosing HLIB_BSP_AUTO
. For the block cluster tree, the standard admissibility condition (see {eqn:std_adm}) is used as defined by HLIB_ADM_STD_MIN
with .
Using the block cluster tree, finally the 𝓗-matrix can be constructed. For this, adaptive cross approximation (ACA, see ???) is applied to compute low rank approximations of admissibly matrix blocks. In HLIBpro, an improved version of ACA, ACA+ is implemented.
ACA only needs access to the matrix coefficients of the dense matrix. Those are provided by a coefficient function kernel
with an optional parameter h
, which will be discussed below. Furthermore, the block wise accuracy of the 𝓗-matrix approximation has to be defined. In this case, an accuracy of is used and stored in acc
. Finally, a flag indicates, whether the matrix is symmetric or not.
The coefficient function used during matrix construction comes in the form of a callback function:
It is called, whenever matrix coefficients are needed. Usually, a complete block of coefficients is requested by the internal matrix constructor, e.g. a full dense block in case of an inadmissible leaf or a full row/column in case of ACA. The row (column) indices for the requested coefficients are stored in rowidx
(colidx
) and are of size n
(m
). In the array matrix
the coefficients have to be stored in column-wise order, e.g. the coefficient with the i'th row index and the j'th column index at position (i,j) or at matrix
[j*n+i]. The final argument arg
is the additional parameter to hlib_matrix_build_coeff
at can be used to provided necessary data for the coefficient function. Here, it is the stepwidth of the problem.
For a complex coefficient functions the callback function is identical except the value type of matrix
, which is of type hlib_complex_t
.
Having constructed the 𝓗-matrix, the corresponding equation system shall be solved. In most cases, a standard iterative solve will not be sufficient, e.g. either the convergence rate is to bad or there is no convergence at all. Therefore, a (good) preconditioner is needed, with the inverse of being the best possible. Since iterative schemes, only need matrix-vector multiplication, the corresponding functionality for the inverse is sufficient. This is provided by an (𝓗-) LU factorisation of , or for the symmetric case, a LDL factorisation:
Since A
is modified during the factorisation, a copy B
is constructed. For the factorisation the accuracy is chosen to be equal as during matrix construction. The result of the factorisation is linear operator represented by the type hlib_linearoperator_t
, which can be used to perform matrix vector multiplication. In this case, the evaluation of the inverse of is provided.
Still missing is the right hand side of the equation system. In this example, it was chosen, such that :
Here, hlib_vector_import
copies the coefficients from b_arr
to b
and rhs
is defined by
b_arr
, the coefficients of b
could directly be set using hlib_vector_entry_set
.For the iterative solver the special auto solver is used, which automatically chooses a suitable solver for the given combination of matrix and preconditioner. Furthermore, statistics about the solution process is stored in solve_info
, e.g. convergence rate, number of steps.
hlib_linearoperator_t
type.To check the accuracy of the computed solution, we compare it with the known exact solution:
Finally, all resources have to be freed and 𝓗𝖫𝖨𝖡𝗉𝗋𝗈 be finished:
A_inv
and the copy B
of A
are explicitly freed. By default, A_inv
will only use the provided matrix object and does not perform memory management, e.g. deletion.