# What Is a Digital Twin? Experimental Design for a Data-Centric Machine Learning Perspective in Health

## Abstract

## 1. Introduction

The digital twin is a set of virtual information constructs that fully describes a potential or actual physical manufactured product from the micro atomic level to the macro geometrical level.

A digital twin is a virtual representation that serves as the real-time digital counterpart of a physical object or process and addresses every instance for its total life cycle.

## 2. What Is a Biological Digital Twin?

## 3. What Advantages Do the Digital Twins Provide?

## 4. What Is a Digital Twin System?

## 5. Experimental Design

## 6. Applications

## 7. Ethical Considerations

## 8. Discussion

- A digital twin simulates data.
- A digital twin cohort is a collection of digital twins.
- There are four types of data sources, and digital twin data are one of these.
- Digital twin data are time-dependent.
- A digital twin cohort is calibrated to a target patient at time ${t}_{i}$.
- A Digital Twin System consists of two main parts (S-DTS and I-DTS), which are collections of analysis methods.

## 9. Conclusions

- A Digital Twin System is a complex entity with interconnected substructures.
- Each substructure needs to be optimized for a given problem setting, e.g., in medicine or health.
- A digital twin is just one method for simulating intervention-dependent data.

**Figure 1.**Visualizing the idea of a digital twin by comparing experimental settings in biology and medicine.

**Figure 2.**Complexity of the data (

**A**–

**C**) and the analysis system (

**D**,

**E**). (

**A**): A simplified view on an analysis system that has access to four different data sources. If all four data sources (i) to (iv) are available, we call the analysis system a Digital Twin System. (

**B**): Availability of data to the Digital Twin System over time. The time dependency of the different data sources is important. (

**C**): Starting from a calibrated digital twin cohort at time ${t}_{i}$, different outcomes of various interventions are shown corresponding to different patient trajectories. (

**D**): Part of the Digital Twin System for single analyses (S-DTS). (

**E**): Part of the Digital Twin System for integration of analysis results (I-DTS).

**Figure 3.**Main structure of a Digital Twin System consisting of S-DTS and I-DTS, which have themselves a complex substructure.

**Table 1.**An overview of different intervention types that can be simulated by different digital twins.

Intervention Type | External Condition | Internal Condition |
---|---|---|

environmental changes | knockdown effects | |

diet changes | gene therapy | |

surgery | pharmaceutical interventions |

