COMSOL 6.2 - Surrogate Model Training

Use a study node (

) to add functionality for training a surrogate model. A surrogate model is a simpler and computationally cheaper model, which you can use to approximate the behavior of a more complex and computationally expensive model (the full finite element model). In COMSOL apps, for example, faster model evaluation using the surrogate model provides users of the apps with a more interactive user experience. You create surrogate models by training a deep neural network (DNN), for example. The surrogate model training is typically based on output data from a large parametric sweep of the model for which you want to create a surrogate model. The training of a surrogate model needs a number of inputs and outputs, where the outputs are considered as functions of the inputs, which replace the full finite element solution. A large number of data points are needed in order to fully describe how the inputs map to the outputs. The outputs can be captured using domain point probes, for example.

A straightforward parametric sweep could densely and uniformly distribute input points, but such an approach would be inefficient. Random sampling is another alternative but with inherent drawbacks such as nonuniform sampling and potential failure to cover the entire input space. Instead, a more strategic approach is to use a design of experiments (DOE) method, carefully sampling within the parameter space. The study uses Latin hypercube sampling (LHS), which is a DOE method that generates a dataset that uniformly covers the input space without requiring an excessive number of finite element computations. This property of LHS makes it an efficient method for data generation intended for the purpose of training a surrogate model.

The more data points that are available, the more accurately the surrogate model will be able to represent the actual solution. However, generating a large number of data points requires a large number of simulations to be run, so there is a tradeoff between the time it takes to generate all the data points and the desired accuracy the surrogate model.

	Tubular Reactor Surrogate Model Application: Application Library path COMSOL_Multiphysics/Applications/tubular_reactor_surrogate. If you have the Battery Design Module, see Surrogate Model Training of a Battery Rate Capability Model: Application Library path Battery_Design_Module/Applications/lib_rate_capability_surrogate.

For additional information about surrogate models in connection with uncertainty quantification, see the Uncertainty Quantification Module User’s Guide.

To add an study, right-click a node and choose . You can only have one node in each study.

Click the button (

) to initiate the surrogate model training and create a surrogate model.

If you choose from the node, the study generates and sets up a job and its subsequence node (see the Uncertainty Quantification Module User’s Guide). The job is controlled by and synchronized to the study. If you click from the node, the study identifies and runs its job if it exists; else it first creates a job.

The window for the study include the following sections:

From the list, choose one of the following options: (the default) or . will discard data from the previous run (if the same table is used) or start a new run using a new table. will add more data to an existing table, which can be useful if you realize after having examined the trained SM function that it does not seem accurate enough. The study outputs training data tables and trained surrogate models for each quantity of interest. If you choose for the surrogate model in the list below, a training data table will be the only result.

From the list, choose (the default), , , , , or . The solution to use will be the last solution for time-dependent and parametric solutions, while for eigenvalue and eigenfrequency solutions, it will be the first solution. You can override this mechanism by selecting any of the other methods. For , the quantity of interest (QoI) is defined as the summation of the over all the solutions. For (or ), the QoI is defined as the maximum (or minimum) of the expression taken over all the solutions. Also note that evaluation operators like at() and with() can be used in the expression, making it possible to evaluate even more general quantities from dynamic solutions.

From the list, choose (the default) to use a DOE model, or choose , , or : and require a license for the Uncertainty Quantification Module.

•

For , also, from the list, choose to create a new output table group or choose any existing table group. A group is created automatically.

•

For , also specify a covariance and a mean. From the list, choose , (the default), , , or . From the list, choose (the default), , or . Also, from the list, choose to create a new function or choose any existing function.

•

For , also specify (the default) or . If you chose , specify a value in the field (default value: 30), which terminates the increase of order for the PCE construction. You can also specify a value in the field (default: 0.5), which determines the truncation level of the polynomial basis. Also, from the list, choose to create a new function or choose any existing function.

•

For , under , define the layers in the DNN. In the column, choose (the default). In the column the current layer, you can see its settings. You can edit the table using the buttons under the table: Use the (

), (), and (

) buttons and the fields under tables to edit the table contents. Or right-click a table cell and select , , or . The button (

) adds a new layer to the list. Use the button (

) to clear the entire table. Underneath the table of layers, in the field, specify the number of output features, which are the neural networks (default: 1). Choosing the number of layers and nodes in a neural network is often an iterative process that involves a combination of knowledge about the specific problem and data, empirical testing, and a bit of trial and error.

You can also specify an activation function using the list. An activation function in a DNN defines how the weighted sum of the input is transformed into an output from a node or nodes in a layer of the network. The default activation is , for a hyperbolic tangent function, which is an S-shaped function. You can also choose for a linear (that is, no) activation function; for a rectified linear unit, an activation function defined as the positive part of its argument; for an exponential linear unit; or for a sigmoid function. The default activation, , is usually a good choice. and are less smooth than the other functions, so avoid them if the trained function later needs to be differentiated. Choosing is only useful for the last layer.

In the table under , define the outputs from the surrogate model (quantities of interest), which could be defined as point probe expressions, for example.

In the column, type the expression for the output (comp1.ppb1, for example, for a in ). In the column, type a description for the output (Temperature, for example). In the Individual solution to use column, the default is to use the solution specified in the list above. You can also choose , , , , , or . Use the (

), (), and (

) buttons and the fields under tables to edit the table contents. Or right-click a table cell and select , , or . The button (

) adds a new output to the list. Use the button (

) to clear the entire table.

In this section you define the input parameters for the surrogate model training.

For the input parameters, in the column, choose the parameter from a list of all global parameters in the model. In column, choose (the default) or .

If you chose , you can define the following properties for the selected parameter underneath the table:

•

In the list, choose the distribution for the input parameter: (the default), μσ, μσ, θ, αβ, λ, or μβ. All distributions except the uniform distribution have two distribution parameters shown under the list, such as the and for a normal distribution and and for a gamma distribution. You specify the distribution parameters in the corresponding text field. All distributions except the uniform distribution and beta distribution have , shown under the list.

•

In the list, choose the cumulative distribution function level for your lower bound: , , , (the default), , , , , or . These bounds automatically compute a lower bound by using the inverse cumulative distribution function.

•

In the list, choose the cumulative distribution function level for your upper bound: , , , (the default), , , , , or . These bounds automatically compute an upper bound by using the inverse cumulative distribution function.

•

For Manual bounds, you can enter bounds and units for the input parameter in the , , and columns. For the bounds, you can use unit syntax such as 0.45[mm], and for the unit, add its abbreviation, such as Pa for pascal.

The chosen analytic distribution appears in the column.

If you chose in the column, you can instead specify the following properties:

If the source type for the parameter is , you can add correlations groups. Doing so allows sampling of parameters that are not statistically independent by specifying a correlation matrix. You could select a subset of all input parameters in the same correlation group, and set the correlation matrix to specify the correlation between each pair of two parameters. Multiple correlation groups can be added in one study, and one parameter can only appears in one correlation group. The correlation matrix is a symmetric semidefinite matrix where all the diagonal elements equal to 1 and all the off-diagonal elements are between [−1, 1].

Under , specify the following setting if the source type is :

•

If is set to , then in the field, if is set to , or , if is set to , specify the number of input points to use (default: 20 or 10). You may want to use a larger number to train the surrogate model to a sufficient degree of accuracy. The number of input points must be chosen empirically.

•

From the list, choose (the default), , or . If you chose , enter a seed in the field. The random seed affects which sampling points you get from the Latin hypercube sampling.

You can edit the table using the buttons under the table:

Select the check box to add accumulated probe tables for the result from the surrogate model training. From the list, choose to create a new table, or choose any existing table. The check box is selected by default to accumulate all the model probes in the table. If you clear this check box, the probes selected from the list are used.

From the list, choose (the default) or .

From the list, choose (the default) to only keep the last model evaluation in memory, or choose to keep all model evaluations.

From the list choose (the default) or . Use this setting to control whether a solver sequence should be generated using global parameters or each parameter tuple.

Select the check box if you want the surrogate model training algorithm to reuse the solution from the previous step. It is useful if you are using an iterative solver and the solutions for different parameter values are similar enough so that it is faster to start from the previous solution instead of starting the solver from scratch.