Glossary¶
A–M¶
Alignment-
A mapping from the features of a dataset to the state variables, parameters, and initial conditions of a model. The alignment supports constructing configurations of a simulation. The SIR compartmental model and a training dataset with features
truth-incident_cases,truth-incident_deaths,truth-incident_hospitalizationcan have the following model-data alignment:{ "S": null, "I": null, "R": null, "I_obs": null, "N": null, "R_frac": null, "ℜ₀": null, "inc_I_obs": "truth-incident_cases", "inc_D": "truth-incident_deaths", "inc_H": "truth-incident_hospitalization" }Note
Incident refers to new occurrences, while "prevalent" refers to current (new and preexisting) occurrences.
Calibration-
A form of statistical inference for determining or updating the value (point estimate or posterior distribution) of model parameters given a reference dataset of observations. The result is typically selected to balance trade-offs between consistency with the modeler's expert knowledge and the "fit" of model observables to the dataset. In advanced cases, other selection criteria can include robustness to model misspecification, interpretability, focus on one statistical quantity of interest, and data privacy and security.
Configuration-
Any set of values used as input for a simulator. A configuration is a simulation and model-specific representation of a scenario.
Downscaling-
In climate science, the process of combining high-resolution observational data with low-resolution models to generate high-resolution simulation results that would otherwise be too coarse to accurately capture the dynamics of large-scale phenomena such as hurricanes.
Fitting-
See Calibration.
Gene regulatory network
See RegNet.
GNR
See RegNet.
Hyperparameter-
A quantity that's an input of a simulator. Hyperparameters can't be inferred from data and can impact the precision and accuracy of the resulting simulation.
loss,penalty, andtolare hyperparameters of the stochastic gradient descent algorithm . Inference-
See Calibration.
Initial condition-
A parameter that corresponds to the value of a state variable at a starting time point. In a model, there are as many initial conditions as state variables.
S₀,I₀, andR₀are initial conditions of the SIR compartmental model. Lineage graph-
A subgraph of the provenance graph. A lineage graph tracks the versioning of an artifact, containing all the data processing steps that lead to its creation.
Model-
An abstract representation that approximates the behavior of a system. For example, a set of ordinary differential equations can approximate the course of an epidemic.
Model template-
A template is a piece of a model that depicts a common transition for a variable or group of variables. Model templates can be used to quickly edit or create a model. For each model framework, the available model templates make up a list of all the possible states and transitions.
Depending on the modeling framework, available model templates may include:
- Natural conversion
- Natural production
- Natural degradation
- Controlled conversion
- Controlled production
- Controlled degradation
- Grouped controlled conversion
- Grouped controlled production
- Grouped controlled degradation
- Natural replication
- Controlled replication
Modeling-
The process of building a model or a simulation.
Modeling framework-
Examples include PetriNets, gene regulatory networks (regnets), stock and flow diagrams, and agent-based models (ABMs). Terarium uses the JSON serialization schemas defined in the ASKEM Model Representations repository.
N–Z¶
Observable-
A quantity of a system (and corresponding model) that's measurable as an "observation" data point. The SIR compartmental model has the following observables:
I_obs(observed infected population)N(total population)R_frac(recovered population fraction)ℜ₀(basic reproduction ratio )inc_I_obs(observed incident infection rate)
Observation function-
A function that maps state variables to an observable, capturing knowledge such as the physics of the observation or measurement process and expert heuristics. For example:
I_obs = 0.50 * I N = S + I + R R_frac = R / N ℜ₀ = β * S / γ inc_I_ob = diff(I_obs(t), t)) * Heaviside(diff(I_obs(t), t))For convenience, an observation function may refer to observables in many cases.
Operator-
An operator in the workflow graph is a node that can represent a model, dataset, or document or a scientific modeling process that modifies or executes project resources.
Operators that perform scientific modeling processes have two configuration methods. A wizard view exposes the most common configuration options, while a notebook view provides a direct interface to writing and editing executable code. Additionally, an integrated AI assistant can generate code based on natural language input.
Optimization-
The process of determining the values of variables that minimize or maximize some objectives subject to constraints. These variables typically represent possible
interventionsto achieve an outcome (for example, adjusting the duration of a masking policy to reduce the number of hospitalized individuals). In risk-based optimization under uncertainty (RBOUU), the objectives and constraints can be functions of distributions of model parameters and outputs. For example, the constraint that the probability of a super-spreader event never exceeds a threshold value.Note
Fittingcan involve optimization ("optimal fitting" and "constrained optimal fitting") but not necessarily (approximation with a "particle filter"). Parameter-
A fixed quantity of a model. Parameters consist of the constants internal to the model. They can be inferred from data made from observations of the underlying system.
βandγare parameters of the SIR compartmental model and weights of an artificial neural network model. PetriNet-
A PetriNet---or place/transition network---is a modeling framework that represents the dynamic behavior of a system. Circular nodes represent variables or places taken at states or compartments. Square nodes represent transitions. Edges between nodes show the flow of variables or places through various transitions.
A PetriNet could, for example, represent populations of people transitioning between the different states in an SIR (susceptible/infected/recovered) model.
Provenance graph-
A directed graph constructed from all the artifacts created by the workflow (as nodes) and the relations between them (as links):
Relation Type Source Node Type Target Node Type 0 COPIED_FROM Model Model 1 COPIED_FROM ModelRevision ModelRevision 2 GLUED_FROM Model Model ... ... ... 32 IS_CONCEPT_OF Concept Dataset RegNet-
A RegNet---or gene regulatory network (GNR)---is a modeling framework commonly used in systems biology. Nodes (or vertices) represent genes, proteins, or chemicals, while edges represent regulatory relationships or interactions between them.
Result-
The output of a simulation (partial or complete).
Run-
An execution of a simulation.
Scenario-
A natural-language description of the context, problems, or questions that require a modeling and simulation process.
Given a scenario, you can construct a simulation and a configuration through modeling to execute a run and generate a result. During a run, the simulator has states and values (running time, memory usage) incidental to the simulation that are not considered part of the result, but that may influence future model and simulation selection.
Simulating-
The process of executing a simulation on a simulator.
Simulation-
An executed instance of a model. A simulation suggests the behavior of the underlying system under specific conditions. What makes a model executable depends on the details of the model, the simulator, and the goals of the people involved.
Simulator-
A program that runs a model with specific input values and generates output values.
State variable-
A varying quantity of a system and corresponding model. In combination with others, these variables can fully determine the "state" of the underlying system.
S,I, andRare state variables of the SIR compartmental model. Stock and flow-
Stock and flow is a modeling framework commonly used in system dynamics. Nodes represent stocks or reservoirs that accumulate over time. Edges between stocks flow into valves that represent how accumulations change.
Strata model-
A model that captures the fine-grained interactions between the different strata state variables. Examples include infectious contact between subpopulations of different age groups and travel by individuals between different locations.
Stratification-
The process of dividing the populations of a model into subsets (subpopulations or strata), often along demographic characteristics such as age and location. The goal is to include more fine-grained interactions—those between the strata—into the model.
It is herein implemented as a kind of "typed" Cartesian product between the graph representation of a model
Pand that of one or many "strata models"Q. The stratified modelGhas:- A node for every pair of nodes in
PandQof the same type. - A link for every link in
PorQwith the same pair of node types.
- A node for every pair of nodes in
Training-
See Calibration.
Workflow graph-
A data-flow diagram made up of high-level operators that represent resources and scientific modeling processes such as configure model, calibrate model, and simulate model.