COMSOL 6.4 - Direct

The node (

) is an attribute that handles settings for direct linear system solvers. Use it together with a Stationary Solver, Eigenvalue Solver, and Time-Dependent Solver, for example. The attribute can also be used together with the Coarse Solver attribute when using multigrid linear system solvers.

An alternative to the direct linear system solvers is given by iterative linear system solvers which are handled via the Iterative attribute node. Several attribute subnodes for solving linear systems can be attached to an operation node at once. However, if a solver is used only one can be active at any given time.

Also see Choosing the Right Linear System Solver, which describes The MUMPS Solver, The PARDISO Solver, The SPOOLES Solver, the The Dense Matrix Solver, and The NVIDIA cuDSS® Solver.

General

Select a linear system . Select:

•

(multifrontal massively parallel sparse direct solver) (the default). Also see MUMPS below for more information.

•

(parallel sparse direct solver). See Ref. 2 for more information about this solver. Also see PARDISO below for more information.

•

(sparse object-oriented linear equations solver). See Ref. 3 for more information about this solver. Also see SPOOLES below for more information.

•

to use a dense matrix solver. The dense matrix solver stores the LU factors in a filled matrix format. It is mainly useful for boundary element (BEM) computations. This solver does not have any additional settings.

•

(NVIDIA cuDSS CUDA direct sparse solver). See Ref. 4 for more information about this solver. Also see cuDSS below for more information.

MUMPS

For it estimates how much memory the unpivoted system requires. Enter a to tell MUMPS how much more memory the pivoted system requires. The default is 1.2.

Select a : (the default automatically selected by the MUMPS solver), , , , , or .

Select the checkbox (selected by default) to control whether the solver should use a maximum weight matching strategy or not. Click to clear the checkbox to turn off the weight matching strategy.

Select the checkbox (selected by default) to reuse the reordering of the system, which speeds up the computation but leads to a higher memory peak. If you have selected that checkbox, the checkbox is available. It is selected by default to store the sparsity patterns of the assembled matrices and try to reuse them for successive assembly processes within the same solution process. In many cases, the sparsity pattern of the system matrices does not change from one nonlinear iteration or time step to the others. Reusing the sparsity pattern from the previous iteration or step can then improve the solution performance at the cost of a usually small amount of memory.


	The Reuse preordering option has a weak dependence on the system matrix. In extreme cases, this can cause the solvers to fail. If you suspect this is the problem, make sure that the Check error estimate setting is not set to No in the Error section below. Then, if the linear solvers fail and the preordering is old, a new preordering will be done.

Select the checkbox (selected by default) to use multithreading during matrix factorization. If you have selected that checkbox, the checkbox is available. It is selected by default so that the backward and forward solves run multithreaded. These two settings mainly improve performance for larger problems when there are many cores. Click to clear the checkboxes to not run the solver multithreaded.

The default is , which controls whether or not pivoting should be used.

•

If the default is kept (), enter a number between 0 and 1. The default is 0.01. This means that in any given column, the algorithm accepts an entry as a pivot element if its absolute value is greater than or equal to the specified pivot threshold times the largest absolute value in the column.

•

For , enter a value for the , which controls the minimum size of pivots (the pivot threshold). The default is 10−8.


	The perturbation strategy is not as robust as ordinary pivoting. In order to improve the solution, MUMPS uses iterative refinements.

Select the checkbox to make a low rank approximation of the LU factors, both when computing them and when storing them. The value for the controls the quality of the approximation. The block low rank factorization is an approximate but accurate LU-factorization method that can provide a faster factorization and potentially save memory in your models. For the compression, select a type from the list: (the default) or . The aggressive compression can potentially use less memory and be faster, at the expense of a slightly lower accuracy for the approximations of the LU factors.

The MUMPS out-of-core solver stores the LU factors on the hard drive. This storage minimizes the internal memory usage. The cost is longer solution times because it takes longer time to read and write to disk than it takes using physical memory.


	You can specify the temporary directory where MUMPS stores the LU factors using the -tmpdir option. See the Running COMSOL Multiphysics chapter for details.

From the list, choose to store all matrix factorizations (LU factors) as blocks on disk rather than in the computer’s memory. The solver reads some of the blocks into memory and performs the LU-factorization on the part that is currently in memory. The blocks of data are then written back to disc and new blocks are read into memory. The size of the blocks that the solver reads from disc is controlled by the in-core memory setting. Choose to not store the matrix factorizations on disk. The default setting is , which switches the storage to disk (out-of-core) if the estimated memory (for the LU factors) is exhausting the physically available memory. For the automatic option, you can specify the fraction of the physically available memory that will be used before switching to out-of-core storage in the (a value between 0 an 1). The default is 0.99; that is, the switch occurs when 99% of the physically available memory is used.

When the list is set to or , you can choose to specify how to compute the in-core memory to control the maximum amount of internal memory allowed for the blocks (stored in RAM and not on disk) using the list:

•

Choose (the default) to derive the in-core memory from system data and a given formula:

(20-62)

where you can specify Mmin in the field (default 512 MB), fuse in the field (default: 0.8; that is, 80% of currently available memory), and Kint in the field (default: 3). Mtot is the total physical memory on the computer, and Muse is the physical memory used on the computer before the solver starts.

•

Choose to specify the in-core memory directly in the field. The default is 512 MB.

PARDISO

Select a : (the default to perform the nested dissection faster when COMSOL Multiphysics runs multithreaded), , or .

Select a to use when factorizing the matrix:

•

(the default): Selects one of the two algorithms based on the type of matrix.

•

: Choose this when you have many cores as it is usually faster.

Select the checkbox (selected by default) to control whether the solver should use a maximum weight matching strategy or not. Click to clear the checkbox to turn off the weight matching strategy.


	The Reuse preordering option has a weak dependence on the system matrix. In extreme cases, this can cause the solvers to fail. If you suspect this is the problem, make sure that the Check error estimate setting is not set to No in the Error section below. Then, if the linear solvers fail and the preordering is old, a new preordering will be done.

By default, the checkbox is not selected. Click to select it and control whether to use 2-by-2 Bunch–Kaufman partial pivoting instead of 1-by-1 diagonal pivoting.

By default, the checkbox is selected so that the backward and forward solves run multithreaded. This mainly improves performance when there are many cores and the problem is solved several times, such as in eigenvalue computations and iterative methods. Click to clear this checkbox to not run the solver multithreaded.

The field controls the minimum size of pivots (the pivot threshold ε).


	To avoid pivoting, PARDISO uses a pivot perturbation strategy that tests the magnitude of the potential pivot against a constant threshold of ε = α\|PPMPSDrADcP\|∞, where P and PMPS are permutation matrices, Dr and Dc are diagonal scaling matrices, and \| · \|∞ is the infinity norm (maximum norm). If the solver encounters a tiny pivot during elimination, it sets it to sign(lii)ε\|PPMPSDrADcP\|∞. The perturbation strategy is not as robust as ordinary pivoting. In order to improve the solution, PARDISO uses iterative refinements.

Select the checkbox to use the Parallel Direct Sparse Solver (PARDISO) for Clusters from the Intel® oneMKL (Intel® oneAPI Math Kernel Library) when running COMSOL Multiphysics in a distributed mode.

The PARDISO out-of-core solver stores the LU factors on the hard drive. This storage minimizes the internal memory usage. The cost is longer solution times because it takes longer time to read and write to disk than it takes using physical memory.


	You can specify the temporary directory where PARDISO stores the LU factors using the -tmpdir option. See the Running COMSOL Multiphysics chapter for details.

From the list, choose to store all matrix factorizations (LU factors) as blocks on disk rather than in the computer’s memory. The solver reads some of the blocks into memory and performs the LU-factorization on the part that is currently in memory. The blocks of data are then written back to disc and new blocks are read into memory. The size of the blocks that the solver reads from disc is controlled by the in-core memory setting. Choose to not store the matrix factorizations on disk. The default setting is , which switches the storage to disk (out-of-core) if the estimated memory (for the LU factors) is exhausting the physically available memory. For the automatic option, you can specify the fraction to be stored on disk in the (a value between 0 an 1; the default is 0.99). If needed, the out-of-core PARDISO solver automatically increases the in-core memory that is required.

•

Choose (the default) to derive the in-core memory from system data and a given formula:

(20-63)

•

Choose to specify the in-core memory directly in the field. The default is 512 MB.


	The PARDISO solver is not supported on ARM machines. Instead the COMSOL software will automatically switch to the MUMPS solver and use settings that are similar to the settings set up for PARDISO. In particular, the MUMPS solver will use Pivoting perturbation instead of the default pivoting setting.

SPOOLES

Select a : (the best of nested dissection and multisection), , , or .

Enter a number between 0 and 1. The default is 0.1. This means that in any given column the algorithm accepts an entry as a pivot element if its absolute value is greater than or equal to the specified pivot threshold times the largest absolute value in the column.

cuDSS


	The NVIDIA cuDSS direct sparse solver is only available for Windows and Linux. A compatible CUDA driver is also needed. cuDSS and CUDA are installed with the COMSOL software. If you do not have an NVIDIA GPU or an incompatible driver you will get an error when you try to compute a solution.


	It is possible to use an external installation of cuDSS by pointing to it in the Computing > GPU Acceleration preference settings (select the Use system cuDSS library checkbox and specify an alternative cuDSS installation in the Custom cuDSS directory field). Note, however, that it is not guaranteed that any other version of cuDSS than the one distributed by COMSOL will be compatible.

Select the checkbox (selected by default) to reuse the reordering of the system, which speeds up the computation but leads to a higher memory peak.


	The Reuse preordering option has a weak dependence on the system matrix. In extreme cases, this can cause the solvers to fail. If you suspect that this is the problem, make sure that the Check error estimate setting is not set to No in the Error section below. Then, if the linear solvers fail and the preordering is old, a new preordering will be done.

Select a : (the default), , , , or .

If , , or is selected from the list, select the checkbox to control whether the solver should use a maximum weight matching strategy or not. Click to clear the checkbox to turn off the weight matching strategy.

If is selected, choose an algorithm to use for weight matching from the list: (the default automatically selected by the cuDSS solver), , , , , or (see Ref. 4).

Choose an algorithm to use during the factorization phase from the list: (the default automatically selected by the cuDSS solver), , or .

Enter a number. The default is 1. The cuDSS algorithm accepts an entry as a pivot element if its absolute value is greater than or equal to the specified pivot threshold times the largest absolute value in the column.

From the list, choose (the default) or .

Enter a value for the , which controls the minimum size of pivots (the pivot threshold ε). The default is 10−13.

From the list, choose (the default) to let the memory limit be decided by the cuDSS solver, or choose or . Hybrid host device memory is useful when there is insufficient memory on the GPU. When you have chosen , also specify a value between 0 and 1 in the (default: 0.8). For and , you can also select the checkbox beside the text field and enter a limit as an integer in the field. The default is 4096 MB.

Select the checkbox to allow the solver to use both GPU and CPU for solving. Hybrid execution mode is useful when the matrix is too small to benefit from GPU computing; some of the computations are then carried out on the CPU.

Select the checkbox to run the computation on all GPUs on the machine on which the COMSOL process is running that meet the capability requirements.


	For more fine-grained control over which GPUs to run the computation on, you can use the -gpuid option. See the Running COMSOL Multiphysics chapter for details.

Error

You can control the accuracy of the solution of the linear system from the list:

•

The default is , meaning that the main solver is responsible for error management. The solver checks for errors for every linear system that is solved. To avoid false termination, the main solver continues iterating until the error check passes or until the step size is smaller than about 2.2·10−14. With this setting, linear solver errors are either added to the error description if the nonlinear solver does not converge, or added as a warning if the errors persist for the converged solution.

•

Choose to check for errors for every linear system that is solved. If an error occurs in the main solver, warnings originating from the error checking in the direct solver appear. The error check asserts that the relative error times a constant ρ is sufficiently small. This setting is useful for debugging problems with singular or near singular formulations.

•

Choose for no error checking.

Use the field to manually set the constant ρ. The default is 1. See Convergence Criteria for Linear Solvers for more information.

The checkbox is selected by default (except for the eigenvalue solver) so that iterative refinement is used for direct and iterative linear solvers. For linear problems (or when a nonlinear solver is not used), this means that iterative refinement is performed when the computed solution is not good enough (that is, the error check returned an error). It is possible that the refined solution is better. Iterative refinement can be a remedy for instability when solving linear systems with a solver where convergence is slow and errors might be too large, due to ill-conditioned system matrices, for example. If a nonlinear solver is used, iterative refinement is not used by default. You can often get away with intermediate linear solver steps, but if that is not the case, select the checkbox to use an iterative refinement. The default value in the field is 15; you can change it if needed. By default, the is set to 0.5. For both linear and nonlinear solver runs where iterative refinements are used, iterative refinements should be terminated when the error is not decreasing along the iteration. Therefore, the error ratio bound should always be smaller than 1. Because the error is computed in the L2 norm, the error ratio bound should always be greater than 0. Setting the error ratio bound to 0.5, as suggested by Ref. 7, is a more cautious approach that rarely yields significant underestimations or overestimations of errors, and it always terminates quickly compared to an error ratio bound much closer to 1. When is set to , a single warning, Iterative refinement triggered, appears in the window if the iterative refinement is triggered. When is set to , the same warning and the number of iterative refinements applied in each linear solver call are shown in the window.


	For an example using a Stationary Solver, The Blasius Boundary Layer: Application Library path COMSOL_Multiphysics/Fluid_Dynamics/blasius_boundary_layer. For an example using an Eigenvalue Solver, Isospectral Drums: Application Library path COMSOL_Multiphysics/Equation_Based/isospectral_drums.