Learning Processes in Hierarchical Pairs Regulate Entire Gene Expression in Cells

doi:10.21203/rs.3.rs-1136384/v1

Download PDF

Research Article

Learning Processes in Hierarchical Pairs Regulate Entire Gene Expression in Cells

https://doi.org/10.21203/rs.3.rs-1136384/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Expression of numerous genes is precisely controlled in a cell in various contexts. While genetic and epigenetic mechanisms contribute to this regulation, how each mechanism cooperates to ensure the proper expression patterns of whole gene remains unclear. Here, I theoretically show that the repetition of simple biological processes makes appropriate whole-gene expression only if the appropriateness of current pattern is roughly detectable. A learning pair model is developed, in which two factors autonomously approach the target ratio by repeating two stochastic processes; competitive amplification with a small addition term and decay depending on the difference between the current and target ratios. Furthermore, thousands of factors are self-regulated in a hierarchical-pair architecture, in which the activation degrees competitively amplify, while transducing the activation signal, and decay at four different probabilities. Changes in whole-gene expression during human early embryogenesis and hematopoiesis are reproduced in simulation using this epigenetic learning process in a single genetically-determined hierarchical-pair architecture of gene regulatory cascades. On the background of this learning process, I propose the law of biological inertia which means that a living cell basically maintains the expression pattern while renewing the contents.

Molecular Genetics

Systems Biology

gene expression

genetic and epigenetic mechanisms

biological process

A living cell is a complex adaptive system. The expression of a gene is controlled by many mechanisms, including transcription factors, chromatin modifications, and non-coding RNAs. Fine regulation of multiple genes is required for a cell to function appropriately, depending on the cell type and environment. Big omics data, including whole gene expression data in a single cell, are accumulated by using single-cell RNA sequencing (scRNA-seq) and other technologies. Based on these data, systems biology proposes gene regulatory networks (GRNs) that generates outputs from inputs ^1–3. However, GRNs for many genes is complicated and requires tuning of numerous parameters. It remains unclear how the expression level of more than 10,000 genes is properly controlled in a cell ¹.

While molecular biology investigates causal relationships in cells as if they were well-designed machines, superior machines have acquired learning ability. Deep reinforcement learning and the AlphaGo algorithm in computer science have made great advances to play board games ^4,5. Prior to these technologies, conventional software could not overcome professional board game players, even after collecting large volumes of data and tuning many parameters ⁴. Deep reinforcement learning includes a deep neural network and a Monte Carlo tree search. The neural network is a multilayer architecture, through which input data with high dimensionality are processed using a weight matrix ⁶. Error is calculated as the difference between the output and the correct answer to alter the weight matrix through backpropagation. Monte Carlo tree search stochastically selects a series of actions ⁵. By repeating these trial-and-error processes, the algorithm determines an optimal weight matrix for selecting correct actions in any situation. The learning processes may be required for cells in which there are too many situations to prepare in advance ⁷.

The importance of stochastic and feedback processes is proposed in complex adaptive systems ^7–9. Waddington epigenetic landscape schematically visualizes the processes through which a cell autonomously, not deterministically, reaches an appropriate gene-expression pattern ^10,11. A cell may alter the gene expression by assessing the appropriateness of the current pattern. This is a kind of learning process. In this study, I theoretically show that biological processes in gene expression can regulate the expression of whole gene at appropriate levels by acting as a learning process.

Amplification and error-dependent decay for learning

I attempt to clarify the processes through which factors autonomously reach their target ratio without individual commands by using a simple simulation model with two factors (Table 1). Here, the non-negative integer values of two factors, x_A and x_B, change by 10⁴–10⁵ repeats of stochastic processes of increase and decrease from 1, which is set as the initial value. The target ratio of T_A: T_B is set to 1: 2. In the increase process, which proceeds at a probability of α_inc = 0.1, either A or B is selected, and the value of the selected factor increases by one. x_A and x_B stochastically decay at a probability α_dec ε, where α_dec = 0.1. Thus, after a decrease process, x_A decays to a value selected from a binomial distribution with the number of trials x_A and the probability (1 – 0.1 ε). In this text, the assumption or settings are written in present tense, whereas the results of simulation are written in past tense.

Table 1

Variables and parameters in the models
Indicator	Meaning	Values	Comments
A, B	Identifier of two factors	A: B indicates the ratio of selection in increase process
x_A, x_B	Value of each factor	Non-negative integer variables	Changing by increase and decrease
T_A: T_B	Target ratio of each factor	1: 2 (Figure 1, except for Figure 1g after 10⁵ repeats)	Calculated from RNAseq data (Figures 2k–5)
α_inc	Probability to enter increase process at each repetition	Constant value 0.1 in Figure 1–2. Variable (0.01–0.101) depending on the coverage of the pair in the whole in Figure 3c–f.	α_inc is replaced by the Monte Carlo tree search in the model with an mRNA in Figure 4–5
α_dec	Constant coefficient of decay probability	0.1	Applied in Figures 1–2
α_dec	Probability to enter decrease process at each repetition	0.1	Applied in Figures 3–5
β_A, β_B	Bias. In amplification, select either at a (x_A + β_A): (x_B + β_B) ratio	1 in Figures 1e–h, 2–3, 4a–f. 10⁻⁷ in Figures 1k–m, 4g–i.	Constant value to increase additively and to avoid extinction in amplification
MSE	Mean squared error between current and target ratios	\({\left({x}_{A}/\left({x}_{A}+ {x}_{B}\right)- {T}_{A}/\left({T}_{A}+ {T}_{B}\right)\right)}^{2}\)	\({= \left({x}_{B}/\left({x}_{A}+ {x}_{B}\right)- {T}_{B}/\left({T}_{A}+ {T}_{B}\right)\right)}^{2}\) Same value for A and B
ε	Error A parameter of decay probability	Constant (Figure 1a–c, e–f)	x_A values after a decrease process is randomly selected from binomial distribution with the number of trials x_A and the probability (1 – α_dec ε) in Figures 1–2 or (1 – ε_(x)) in Figures 3–5. 4-step error is applied in Figure 5.
ε_(x)		MSE (Figure 1d, g–m, 2b–c, i)
ε_(x)		MSE in Figures 2–5 is rounded to 10⁻¹, 10⁻², 10⁻³, …,10⁻ⁱ in stepwise, to 10⁻¹, 10⁻², 10⁻³ in 3-step error, and to 10⁻¹, 10⁻², 10⁻³, 10⁻⁴ in 4-step error
γ	Probability to choose additive increase among increase processes	0 or an indicated constant in range from 0 to 1	This γ is used only in Figure 1l–m. 0 in other Figures.
Initial ratio	Ratio of each factor in the total at the initial setting	Even distribution in Figures 1–2. Expression ratio of genes from RNA-seq data in Figures 3–5.
Initial value of a branch in a pair		1 in Figures 1–2. Initial ratios are summed for genes in the branch of the pair, multiplied by the number of genes (11,281), and rounded to make an integer value, in Figure 5.

In the learning pair model, the values of two factors, x_A and x_B, repeat stochastic processes of increase and decrease. In the increase process, either A or B is selected, and the value of the selected factor increases by one. In the increase process, competitive amplification or additive increase is chosen at (1 - γ): γ ratio. In competitive amplification, A or B is selected at the (x_A + β_A): (x_B + β_B) ratio. In the additive increase, A or B is selected at a 1: 1 ratio. In the decrease process, x decreases by decay at a probability depending on the error value ε.

In the first model, x_A and x_B are assumed to change at a fixed probability (Figure 1a–c). Either x_A or x_B is selected at a 1: 1 ratio for the increase, and the decay probability is fixed at ε = 0.1 or 0.01. Simulation results showed the similar values in x_A and x_B (Figure 1a). If the probability of increase in x_B is two-fold of that in x_A (Figure 1b) or if the decay probability in x_A is two-fold of that in x_B (Figure 1c), the x_A/x_B ratio approached the target ratio, 0.5. However, these conventional models with individualized probabilities require something that determines the appropriate parameter-setting.

In the second model (Figure 1d), the decay probability is the same for x_A and x_B but changes over time, taking a value that is the mean squared error (MSE) between the current and target ratios: \({\epsilon }_{\left(x\right)}=MSE= {\left({x}_{A}/\left({x}_{A}+ {x}_{B}\right)- {T}_{A}/\left({T}_{A}+ {T}_{B}\right)\right)}^{2}\). Regarding the increase, either x_A or x_B is selected at a 1: 1 ratio. The dynamics of x_A and x_B exhibited a pattern similar to predator-prey in ecology, in which the fluctuation in the number of prey x_B slightly preceded that of predator x_A.

In the third model (Figure 1e–f), x_A and x_B are assumed to increase by competitive amplification, in which either x_A or x_B is selected at a ratio of (x_A + β_A): (x_B + β_B) to increase by one, where bias β_A = β_B = 1. When the decay probability ε = 0.1 or 0.01, x_A and x_B fluctuated with a switching pattern in which either A or B dominated transiently (Figure 1e). When ε is as low as 0.001, the x_A/x_B ratio persisted at a certain value that was stochastically determined at early time points (Figure 1f).

In the fourth model (Figure 1g–h), x_A and x_B are assumed to increase by competitive amplification as in the third model, and to decrease by decay with a probability of MSE between the current and target ratios as in the second model. The simulation results showed that the x_A/x_B ratio approached the target ratio of 0.5. Some deviations observed at 10⁴ repeats were reduced after 10⁵ repeats of stochastic processes (Figure 1h). This model was applicable for other target ratios (Figure 1g) without tuning parameters. Repeating stochastic processes of competitive amplification and MSE-dependent decay is a system that autonomously learns the target ratio through trial-and-error (Figure 1i). The epigenomic regulation of chromatin modification can be interpreted as a competitive amplification process (Table 2) ¹².

Table 2

Assumptions in the learning hierarchical-pair model are supported by biological knowledge.
Model assumptions	Biological findings	Regulation
Competition	A transcription factor chooses a binding locus among candidates, depending on the openness ratio of the chromatin.	Epigenomic
Amplification	Transcriptional coactivators with histone acetyltransferase activity relax the chromatin structure. Transcription opens the chromatin, and the open chromatin structure induces transcription.	Epigenomic
Bias (no extinction) Additive increase	Whole genome in every somatic cell. Conventional genetic regulation of transcription.	Genetic
Error (approximated) -dependent decay	Rough evaluation of the current state. Histone deacetylases and DNA methyltransferases close the chromatin structure. Non-coding RNA-dependent cleavage.	Dependent on cell and environment. Feedback from the current fitness.
Hierarchical-pair architecture	Signal transduction cascades for gene expression	Genetic
Competitive amplification	Active and expressed cascades are preferentially selected and activated. Kinase is activated by phosphorylation at multiple sites.	Cell-type dependent Post-translational
Error-dependent decay	Dephosphorylation. Polyubiquitin dependent degradation.	Dependent on cell and environment

Epigenetic regulations, which are highly variable depending on cell type, can be interpreted as a process of competitive amplification. Conventional genetic regulation and transcription at a fixed rate are included in the bias term. The decay rate is roughly regulated at several levels by the fitness of the current expression pattern. Hierarchical pairs are genetically determined and consistent in all cell-types. To be noticed, the target pattern is used to passively quantify the fitness of the generated current pattern.

Amplification may induce a large difference in the value of each factor by exponential growth, making a factor all or nothing. In non-competitive amplification, in which either A or B is selected at a 1: 1 ratio and the selected term increases by x_A + 1 or x_B + 1, x_B reached much higher value than x_A (Figure 1j). When bias β in competitive amplification is not 1 but rather 10⁻⁷ (which is almost equivalent to 0 and avoids the 0/0 error in processing), x_A decreased to 0 in six of the ten tests (Figure 1k and 1l left). Interestingly, the x_A/x_B ratio approached the target ratio in the other four tests. Competition and the addition term of bias are required to avoid extinction in amplification.

In our previously reported immune response model, three processes were assumed to occur during changes in the interaction intensity or cell number: competitive amplification (proliferation), regulated reduction (dissociation), and additive increase (migration) ¹³. Based on the model, a process of additive increase, in which either x_A or x_B is selected at a 1: 1 ratio to increases by one, is chosen at a probability γ in an increase process (Figure 1l). The condition γ = 0 is equivalent to that in Figure 1k, whereas γ = 1 is equivalent to that in Figure 1d. As γ is set to a lower value, the x_A/x_B ratio after 10⁵ repeats became skewed from 1 to 0.5 (target ratio). When γ is negligibly low, x_A sometimes disappeared. When the additive increase is chosen at low probabilities (γ = 0.01, 0.1), the x_A/x_B ratio approached the target ratio (Figure 1l–m).

The learning process can be explained as follows. While MSE and decay probability are large, the x_A/x_B ratio fluctuates in full range by avoiding the extinction using the bias or additive increase (Figure 1e). The x_A/x_B ratio is improved on average by the error-dependent decay, which is a random walk with smaller step-size as it gets closer to the target. When the x_A/x_B ratio approaches the target ratio, the ratio persists because A or B increases at an almost ideal ratio as set in Figure 1b. In the main simulation hereafter, competitive amplification implies selecting A or B at a (x_A + 1): (x_B + 1) ratio to increase by one with β = 1, γ = 0. This is designated as a learning pair process.

Hierarchical pairs and approximated MSE

To regulate gene expression, more than two factors must be controlled. When the ratios of four or eight factors are examined to be controlled by competitive amplification and MSE-dependent decay, the value ratios of eight factors in a single list failed to approach the target ratios (Figure 2a–b). The eight factors can be divided into seven pairs in three layers (Figure 2c–d). The fraction of each factor in total is calculated as an infinite product of all ratios in the pairs that include the factor. When the values in each pair independently change by the stochastic learning pair process, eight factors successfully approached the target ratio after 10⁵ repeats (Figure 2c).

Next, the required accuracy of MSE is tested because accurate detection of errors is difficult in vivo. When the MSE between the current and target ratios is accurately calculated, 64 factors approached the target ratio, which is set as a linear distribution in the range of 1–64 (correlation coefficient between the target and result ratios after 10⁵ repeats, r, was 0.99, Figure 2e–f). As an approximation of MSE, the calculated MSE is rounded to 10⁻¹, 10⁻², 10⁻³, … ,10⁻ⁱ, where i is a natural integer, in stepwise error. When this approximated error is used, the correlation between the result and target ratios decreased but remained high (r = 0.95, Figure 2e–f). By setting a maximum value for i that indicates the lower limit of the stepwise error, five additional types of approximated MSE are compared (6-, 5-, 4-, 3-, and 2-step error in Figure 2e, g). The results indicated that 3-step error was required for learning (median r = 0.89) and that the stepwise error was almost equivalent to 5-step error (median r = 0.95). The approximation of MSE decreased the learning accuracy, but multiple factors in the model using stepwise detection of error approached the target ratio to an acceptable level.

Next, the relationship between indexes for making pairs and targets is randomly shuffled. The ratios of each factor after 10⁵ repeats approached the target ratios to an almost equivalent level to that without shuffling (Figure 2e, h). Furthermore, 4,096 = 2¹² factors approached the target that is set to shuffled values ranging from 1 to 4,096 (r = 0.97–0.98 with accurate MSE and r = 0.84–0.91 with stepwise error in five tests, Figure 2i–j).

When the gene expression data from bacteria without antibiotics (GSM2538622 RNA-seq dataset) ¹⁴ are used as the target ratio, the ratio of 4,096 factors changed from the initial even-distribution to the expression pattern after 10⁵ repeats of stochastic processes with stepwise error (r = 0.98, Figure 2k). Subsequently, when the target ratios are reset to the gene expression pattern observed in the presence of antibiotics (Figure 2l) ¹⁴, the ratio of 4,096 factors changed from the pattern without antibiotics to the new target pattern (Figure 2m). Thus, bacteria may autonomously produce proper gene expression patterns by reducing error caused by antibiotics.

Hierarchical clustering of human genes

I next apply the learning hierarchical-pair process to human gene expression. In advance, it is necessary to set the genes that are paired. Six hierarchical clustering analysis

methods (Figure 3a–c), which are Ward, WCO, Single, and three newly-developed methods (AreaSum, CvSum and Cvarea), are applied to a total of 16,921 genes in 20 differently labeled cells from preimplantation human embryos, human embryonic stem cells, and downstream early mesoderm and endoderm progenitors (scRNA-seq datasets E-MTAB-3929, GSM2257302, and GSE75748) ^15–17. The number of layers in hierarchical pairs generated by the AreaSum method was the smallest (27), whereas that by the Single method was largest (10,796) (Figure 3c). In the AreaSum method, the area formed by two vectors from the origin is calculated as the distance between two genes and the total gene expression level is used as the representative value of the cluster (Figure 3a–b).

As another modification, the probability of entering a process of competitive amplification, α_inc, is set as a variable in the range of 0.001–0.101 depending on the coverage of the pair. This assumption is further modified in another model with an mRNA pool. For test data, another scRNA-seq dataset from human preimplantation embryos (GSE36552) is used ¹⁸. The initial and target ratios in each pair are set with the data of a zygote and a cell at 4-cell stage, respectively. The correlation coefficient between the initial and target ratios is a median r = 0.78 (range 0.67–0.84) in 12 tests (Figure 3d). For each pair, the stochastic processes of competitive amplification and decay using the stepwise-approximated MSE are repeated 10⁵ times.

The learning efficiency was compared among the six different hierarchical-pair architectures. The expression ratio most closely approached the target ratio when hierarchical pairs generated by the AreaSum method are used (r = 0.98, Figure 3c–d), with even a closer correlation than another 4-cell data in scRNA-seq (Figure 3e) ¹⁸. In pairs generated by the Single method, the expression ratio did not approach the target ratio (Figure 3c). These results indicate that the architecture of hierarchical pairs affects the ability to approach the target ratio. In contrast, even when the initial and target patterns are independently shuffled to test non-correlated artificial patterns, the expression ratio approached the target ratio (median r = 0.98, range 0.94–0.99 in six tests) in the hierarchical pairs generated by the AreaSum method (Figure 3f). Owing to the high adaptability of this learning process, it was difficult to validate the accuracy of gene pairing.

A model with a signal transduction cascade and an mRNA pool

I assume that the hierarchical-pair architecture is a signal transduction cascade to select a gene for transcription in a model with an mRNA pool. Rather than using parameter α_inc, a pair is stochastically selected at each repetition among pairs in the top seven layers depending on the coverage of the pair. In the selected pair, the competitive amplification is performed; branch A or B is selected at a ratio (x_A + β): (x_B + β), where β = 1, and the value of the selected branch, x_A or x_B, increases by one. Additionally, the downstream pair of the selected branch enters the process of competitive amplification until the selected branch is a leaf indicating a single gene. In an mRNA pool, mRNA of the selected gene increases by one, with randomly replacing one mRNA. Initially, 360,000 mRNAs in the mRNA pool are set based on the initial ratio (zygote). In addition to the mRNA, the expression probability is calculated as an infinite product of ratios in pairs including the gene, which is equivalent to the expression ratio in the previous model without an mRNA pool. The ratios of mRNA and expression-probability approached the target ratio (4-cell) after 5 × 10⁵ repeats (r = 0.95–0.97 and r = 0.97–0.99, respectively, in six tests), although genes with 0 to several mRNAs were plotted discretely in mRNA ratios (Figure 4a). Furthermore, even when decay probability or MSE is approximated to three different values, 0.1, 0.01, and 0.001 (3-step error as in Figure 2e) similar changes approaching the target ratio were observed with setting 4-cell as the targets (r = 0.94–0.98 in 12 tests, Figure 4b) and with shuffling the targets (r = 0.92–0.94 in six test, Figure 4c).

To analyze the dynamics in the simulation, when the initial and target ratios are set with zygote and blastocyst data, the mRNA ratio gradually approached the target blastocyst pattern over 10⁶ repeats, but not via the patterns of the 2-cell, 8-cell, or morula stages (Figure 4d). The expression probability more quickly and directly reached near the target ratio within 10⁴ repeats (Figure 4e), and then the similar correlation levels persisted during 10⁴–10⁶ repeats. The mRNA levels of each gene approached the target level with fluctuations (Figure 4f). In a simulation, the dynamics of GATA3 were more similar to those of GATA2 than to those of DAB2, although the initial and target values of GATA3 and DAB2 are closer than those of GATA2. The higher correlation during stochastic fluctuations is explained by the hierarchical-pair architecture, where GATA2 and GATA3 are paired in the 7th layer from the top, whereas they are separated from DAB2 in the 2nd layer.

For gene regulation during homeostatic state, the bias term β may not be required because MSE or decay rate can be kept low. When both the initial and target ratios are set with the same 4-cell data, the mRNA ratio deviated from the pattern during 5 × 10⁵ repeats in the model with β = 10⁻⁷ and 3-step error (r = 0.66–0.88 in six tests, Figure 4g). In contrast, the mRNA ratio maintained the set pattern in the model with 4-step error at least for 5 × 10⁵ repeats (r = 0.98–0.99 in six tests), while the correlation between expression probability and the target ratio gradually decreased (r = 0.76–0.87, Figure 4h). When the initial state is set with a zygote and the target ratio is set with scRNA-seq data of 2-cell stage, which has a highly-similar expression pattern to a zygote (r = 0.94–0.96) ¹⁸, the mRNA ratios approached the target ratio, except for one case in six tests (median r = 0.98, range 0.51–0.98, Figure 4i). However, the change from a zygote to the 4-cell stage was poorly reproducible in the model with β = 10⁻⁷ (median r = 0.88, range 0.85–0.94 in six tests). In the absence of a bias term or additive increase, a homeostatic state with a similar expression pattern was maintained while allowing some limited changes in differentiation.

A common model for human gene expression

Based on these findings, I propose that a single model can control whole gene expression during any differentiation processes in human cells and evaluate this in early embryogenesis and hematopoiesis. To generate hierarchical pairs, I collect 13 scRNA-seq datasets from human tissues ^{15–17,19−28}, in which 11,281 gene names were commonly labeled in 11,803 cells. Using the relative expression ratio of these 11,281 genes in each cell, a hierarchical-pair architecture was generated using the AreaSum clustering method. This architecture contained 11,280 pairs in 22 layers (Supplementary Table 1).

The model with an mRNA pool, 4-step approximated error, and this hierarchical-pair architecture is applied to the regulation of 11,281 genes, setting bias β = 10⁻⁷ or 1 depending on the situation. When the initial state of pairs and mRNA pool is set with a zygote scRNA-seq data and the target ratio is changed in the order of zygote, 2-cell, 4-cell, 8-cell, morula, and blastocyst stages every 5 × 10⁵ repeats, the ratio in the mRNA pool dynamically approached the target ratios until the 4-cell stage in the model with β = 10⁻⁷ (Figure 5a–b). When the model with β = 1 is applied after 1.5 × 10⁶ repeats, the gene expression patterns sequentially approached the 4-cell, 8-cell, morula, and blastocyst patterns with a correlation coefficient of more than 0.95 at the peaks (Figure 5c–d).

In hematopoiesis, multi-lymphoid progenitors (MLPs) differentiate into B cells or T cells in peripheral blood mononuclear cells (PBMCs), whereas granulocyte-macrophage progenitors (GMPs) differentiate into myeloid cells ²⁹. When the initial state and target ratio are set with a progenitor, the mRNA ratios were maintained during 5 × 10⁵ repeats in the model with β = 10⁻⁷ (r = 0.97–1.0 in six tests, Figure 5e–g). The expression patterns in progenitors are largely different from those in PBMCs ³⁰ (median r = 0.29, range 0.15–0.57 in 9 tests, Figure 5h). When the target ratio is changed to a PBMC pattern and β is set to 1, the mRNA ratio approached the target ratio during the next 5 × 10⁵ repeats (r = 0.86–0.97), with more rapid adaptation in the expression probability (Figure 5e–f, i). The mRNA ratio, but not the expression probability, further approached the target ratio during the following 5 × 10⁵ repeats in the model with β = 10⁻⁷ (r = 0.97–0.99, Figure 5e–f, j). These results demonstrate that the learning hierarchical-pair model using one common architecture can reproduce various differentiations and not-immortal homeostasis by adding bias terms in the former.

I propose a principle underlying whole gene regulation within cells, which includes learning ability and a common architecture of gene regulation. The learning ability is implemented as a repeat of two stochastic processes: competitive amplification in a pair and decay depending on MSE between the current and target ratios. The hierarchical structure of the pairs enables multiple factors to reach any target ratio.

In this model, the expression of each gene is regulated by itself, in contrast with conventional GRNs in which each gene is regulated by other genes (Figure 6). The simplicity of self-regulation is critical to increase the number of regulated genes in modeling, actual evolution, and organizing complex systems. Importantly, the simple self-regulation system is not uncontrollable but rather efficient to generate a proper diversity if the decay rates are appropriately regulated. When n number of genes changes the expression at L levels, an infinitely large number of patterns Lⁿ may exist. Conventional models set m number of master regulators (m << n) that control cell types and generate L^m states. The number of L^m is almost infinite and larger than the number of cell types we can understand, but it explains negligibly small space in Lⁿ. In my model, by using the four decay rates in n - 1 pairs, only 4(n - 1) regulations are sufficient to generate any appropriate pattern. The increase probability of each factor in competitive amplification is autonomously tuned to the correct ratio, x_i / Σx_j. Thus, amplification and stochasticity, which are misunderstood as interfering with strict control at a specific level, are essential for complex systems. I propose that the homeostasis, in which a cell keeps the expression pattern while the contents are metabolized, is not a result of complicated GRNs but a basic operating system shared in living things. This homeostatic system, which I refer to as the law of biological inertia, contains the learning process (Figure 6c).

Biological knowledge of gene regulation is consistent with the assumptions in the learning hierarchical-pair model (Table 2). The first assumption, competition, is supported by the epigenomic regulation of transcription ¹². An RNA polymerase or transcription factor chooses a binding locus among candidates, depending on the local openness ratio of the chromatin. To be noticed, the binding candidates are genetically determined as discussed in the next paragraph. The second assumption, amplification, is supported by positive feedback in the epigenomic regulation. The binding of transcription factors opens the chromatin at the locus, using cofactors with histone acetyltransferase activity. The third assumption, additive increase using a bias term, is supported by the fact that all somatic cells have the whole genome. The fourth assumption, decay rates dependent on the error between the current and target ratios, is not clearly documented as gene regulation, but histone deacetylase and DNA methyltransferase close the chromatin structure. Non-coding RNAs degrade a group of mRNAs with a particular sequence. Biological stress may increase the decay rates. Although many studies are required to demonstrate the molecular basis of the learning process, the assumptions are applicable to cells.

The hierarchical-pair architecture, the fifth assumption, is also supported by findings on signal transduction cascades for gene expression (Table 2). In the conventional view of signal transduction cascades, multimerization of specific receptors is assumed to deterministically trigger activation of a signal cascade to express a set of genes. However, the cascades and the induced genes vary depending on the cell type, which reflects the current expression and activation state. Active branches in the cascades may be preferentially used, just like the stochastic competitive amplification in the assumption. Further, many signal-transducing proteins are kinases that are activated by phosphorylation at multiple sites. Decay of activation is regulated by phosphatases and polyubiquitin ligases. The architecture of possible signal transduction, which is genetically determined by 3D molecular structure and promoter sequence, should be discriminated from the branch activity, which is regulated epigenetically or post-translationally (Table 2). In my model, the former genetic regulation is set as a common hierarchical-pair architecture conserved in all cells, whereas the latter epigenetic regulation is dynamically controlled following the basic law.

The learning hierarchical-pair model differs from gene regulation in vivo in several aspects. First, the bias β or additive increase should be controlled. Bias β_A and β_B may differ for each branch in each pair. Finely regulated bias is equivalent to an additive increase in gene or module activity, in which conventional deterministic regulations can be included. I propose that this additive increase contributes little to gene expression in homeostatic states, but is transiently and roughly required during differentiation. Second, the calculation of the approximated error values from the target and current ratios is highly simplified. Pairs of crucial genes may be controlled more strictly, whereas many other pairs are controlled less strictly. Using scRNA-seq data as the target ratio, I show that gene expression reaches acceptable patterns for the cell. Third, the hierarchical pairs of gene(s) generated by the AreaSum clustering method (Supplementary Table 1) should be revised to a true architecture. There is no evidence that my pairing is correct, because the shuffling of gene pairs did not significantly affect learning efficiency in the model with β = 1. Formally, forming a pair is equivalent to reducing the number of dimensions by one. The molecular biology of gene regulation, big data obtained by RNA-seq, and simulations and clustering using super computers would reveal the single correct gene-regulation architecture, which might be as useful as the periodic table of the elements in chemistry.

If the complexity of living organisms requires a template for increase, the increase of template would be formulated as competitive amplification. Death is formulated as error-dependent decay. A tissue composed of numerous cell-types regulates the cell ratio through proliferation (competitive amplification) and apoptosis (decay). In the immune system, we previously proposed that regulatory T cells, which are crucial for immune suppression, reduce decay probability and can be redefined as an indicator of low error ¹³. The law of biological inertia will provide insights for understanding various complex systems.

Computation

The simulation is performed using Python 3.7 software. Four files including codes for the learning hierarchical-pair model (Code File 1), clustering of genes (Code File 2), model of human gene expression (Code File 3), and model with an mRNA pool (Code File 4) are available in the supplementary files. I perform Monte Carlo simulations in which the stochastic processes of increase and decrease are repeated 10⁴–10⁶ times for each pair as explained in Table 1. In the learning pair model, the values of factors A and B in each pair, x_A and x_B, increase by one after selecting either at an A: B = (x_A + β_A): (x_B + β_B) ratio as competitive amplification, where bias β is 1 if not indicated to be 10⁻⁷, and decays at MSE-dependent probability. In “shuffle”, the factor indexes for each target value and for the location in the hierarchical pairs is randomly shuffled to set randomized target values in the hierarchical pairs. The expression ratio of each factor in total is calculated as an infinite product of the ratios in all pairs containing the factor. In the text, the assumption or settings are written in present tense, whereas the results of simulation are written in past tense.

Approximation of error

The MSE is calculated for each pair as the difference between the current and target ratios. In stepwise error, the value is expressed in exponential notation with a base of 10, the mantissa is rounded to 1, and only the exponent value is used as the level of error and as the decay probability. Accordingly, the stepwise error takes a value \(\in \left\{{10}^{-1}, {10}^{-2}, {10}^{-3}, \dots , {10}^{-i},\dots \right\}\), where i is a natural integer. In 6-, 5-, 4-, 3-, and 2-step errors, the lower limit of the error value is set to 10⁻⁶, 10⁻⁵, 10⁻⁴, 10⁻³, and 10⁻², respectively. Whereas the stepwise error may take an unlimitedly small value to zero, 6-step error can take six types of error, from 10⁻¹ to 10⁻⁶, and 2-step error can take two types of error, 10⁻¹ or 10⁻². The code is available in Code File 1.

Hierarchical clustering analysis

The hierarchical pairs in the learning hierarchical-pair model indicate groups of genes with similar expression patterns that might be controlled by a particular regulation-module. To generate optimal hierarchical pairs for the model, six hierarchical clustering analysis methods are compared.

Hierarchical clustering analysis repeats the following two calculations until a pair containing all genes is created: 1) pairing two genes or clusters with the closest distance, and 2) calculating the distances to the new cluster of genes. Ward method uses the Euclidean distance. The WCO method uses the cosine distance, which takes a high value in the case of a low correlation, and “weighted method” that is Weighted Pair Group Method with Arithmetic Mean (WPGMA). The Single method uses the Euclidian distance and “single method” that selects the nearest point in clusters. These three clustering methods are available in scipy.cluster.hierarchy.linkage of the Python tool. The three new clustering methods, AreaSum, CvSum, and Cvarea, are available in Code File 2. In these three methods, the total expression of genes in a cluster is used as the representative value of the cluster. This is appropriate because pairing in the learning hierarchical-pair model is equivalent to separation into two subgroups. In the AreaSum method, the area between two vectors from the origin to the values of the clusters is used as the distance. A small angle indicates a constant expression ratio among different cells. A large vector size allows genes with high expression to skip many layers in the hierarchical pairs. The two clusters with the smallest distance are paired. In the CvSum method, the total expression level of genes in the pair, including both branches, is summed for every cell, and the variation (cv) of the summed value among cells is used as the distance between two clusters. Family genes with functional substitutability can be paired. In the Cvarea method, the product of the area and the cv is used as the distance between two clusters.

In Figure 3, hierarchical clustering analyses are applied to the expression of 16,921 genes in 20 cells. To generate the hierarchical-pair architecture used in Figure 3d, f and 4, the AreaSum method is applied to this dataset. For the hierarchical-pair architecture used in Figure 5, the AreaSum method is applied to the expression of 11,281 genes in 11,803 human cells. The gene list and hierarchical cluster are available in the Supplementary Table 1.

Learning hierarchical-pair model with an mRNA pool

In the model in Figures 4–5, the hierarchical pairs are assumed to be signal transduction cascades to select a gene in an mRNA pool, similarly to the Monte Carlo tree search. At each repetition, a pair is stochastically chosen among pairs in the top seven layers depending on the coverage of the pair. In the selected pair, the competitive amplification is performed; branch A or B is selected at a ratio (x_A + β): (x_B + β), where bias β is 1 if not indicated to be 10⁻⁷, and the value of the selected branch, x_A or x_B, increases by one. The downstream pair of the selected branch also enters the process of competitive amplification until the selected branch indicates a single gene. In an mRNA pool, mRNA of the selected gene increases by one, with randomly replacing one mRNA. Initially, 360,000 mRNAs in the mRNA pool are set based on the initial ratio, in addition to the ratios in each pair. The values in each pair decrease by MSE-dependent decay every 10 repetitions at the average. The expression probability is calculated as an infinite product of ratios in all pairs that contain the gene. The code is available in Code File 4.

Resource datasets

For bacterial genes (Figure 2), RNAseq data from Escherichia coli (BWk3) (GSE96706) are used ¹⁴. Among the 4296 genes, 4096 genes with expression values greater than 2^1.9 under either culture condition in the dataset are selected. The genes are set to form a hierarchical-pair architecture using the order within the genome. GSM2538622 (1A), GSM2538631 (10A), and GSM2538649 (27A) are used as data without antibiotics, with kanamycin, and with ciprofloxacin, respectively.

For human early embryogenesis (Figure 3–4), 20 differently-labeled cells are selected from three scRNA-seq datasets (E-MTAB-3929, GSM2257302, and GSE75748) for gene expression in preimplantation embryos, in vitro cultured embryonic stem cells, and the downstream early mesoderm and endoderm progenitors ^15–17. 'E3.1.443', 'E4.1.1', 'E5.1.26','E6.1.72', and 'E7.2.138' are selected from E-MTAB-3929. 'APS.p1c1r2', 'D2_25somitomere.p9c1r1', 'DLL1PXM.p8c1r1', 'Earlysomite.p10c2r8', 'H7hESC.p7c1r4', 'LatM.p3c1r1', 'MPS3.p5c1r1', 'Sclerotome.p2c1r1', and 'cDM.p4c1r1' are selected from GSM2257302. 'H9.00hb4s_001', 'H9.12h_001', 'H9.24h_013', 'H9.36h_001', 'H9.72h_001', and 'H9.96h_001' are selected from GSE75748. For the 16,921 genes expressed at more than 10 TPM in either cell, hierarchical clustering analyses are applied. For the test data, another scRNA-seq dataset of human preimplantation embryos is used (GSE36552) after merging the 16,921 genes with the gene names and assigning 0 as the expression level of non-annotated genes ¹⁸. GSM896806, GSM896809, GSM922146, GSM922158, GSM922178, and GSM922194 are used for scRNA-seq data of the zygote, 2-cell, 4-cell, 8-cell, morula, and blastocyst stages, respectively. In Figure 3c, 3 zygotes and 12 4-cell datasets are used.

To generate a common hierarchical-pair architecture in Figure 5 and Supplementary Table 1, 13 scRNA-seq datasets of human tissues are used; 515 peripheral blood cells (GSE97531) ¹⁹, 836 hematopoietic stem and progenitor cells in the bone marrow, spleen, and peripheral blood (GSE143567) ²⁰, 1567 trophoblast and stromal cells from the placenta (GSE89497) ²⁷, 559 cardiomyocytes (GSE95140_human) ²³, 2148 endometrium cells from the uterus (GSE111976) ²⁵, 766 renal cells from kidney biopsy (GSE160048_human) ²¹, 91 fallopian tube epithelial cells (GSE132149_sc16) ²⁶, 2036 retina cells from the eyes (GSE133707_P1) ²², 134 primordial germ cells from a female embryo at 10 weeks of gestation (GSM2295850) and from a male embryo at 25 weeks of gestation (GSM2306040) ²⁸, 372 in vitro-cultured primary myoblasts (GSE52529) ²⁴, 498 in vitro-cultured embryonic stem cells and early mesoderm progenitors (GSM2257302) ¹⁶, 758 in vitro-cultured embryonic stem cells and endoderm progenitors (GSE75748_sc_time_course_ec) ¹⁷, and 1529 cells from early preimplantation embryos (E-MTAB-3929) ¹⁵. Gene names are used, if available, to integrate multiple datasets. If not available, the gene name is determined as ‘symbol’ using MyGene.py in the Python package. The code is written in a comment form in Code File 2. For the 11,281 genes successfully annotated in all 13 datasets (11,803 cells), gene expression ratios are recalculated by normalizing the total expression of the 11,281 genes to 1,000,000. The gene list and hierarchical clustering are available in Supplementary Table 1.

For the test data in Figure 5, 39 single-cell datasets of the zygote, 2-cell, 4-cell, 8-cell, morula, and blastocyst stages in the preimplantation human embryo GSE36552 ¹⁸, 9 datasets of hematopoietic progenitors, including GMP, MLP, and lymphoid-primed multi-potential progenitors in the human cord blood of normal donors from GSE100618 ²⁹, and 13 PBMCs from normal donors from GSE161901 ³⁰ are collected. Gene expression ratios are recalculated by normalizing the total expression of 11,281 genes to 1,000,000. GSM896806, GSM896809, GSM922146, GSM922158, GSM922178, and GSM922194 are used for scRNA-seq data of the zygote, 2-cell, 4-cell, 8-cell, morula, and blastocyst stages, respectively ¹⁸. For PBMCs, cell types are determined based on the high expression of CD19 and IGHM (immunoglobulin heavy constant mu) for B lymphocytes, TRBC2 (T cell receptor beta constant 2) for T lymphocytes, and CD33 for myeloid cells. Among the test data, GSM2689351 (P5_E5_MLP) is used for MLP, GSM2689085 (P5_G11_GMP) for GMP, GSM4916527 (NormalDonor1_untreated_PBMC_027) for a B cell, GSM4916502 (NormalDonor1_untreated_PBMC_002) for a T cell, and GSM4916594 (NormalDonor1_untreated_PBMC_094) for a myeloid cell ^29,30. In the text, GSM2689298 (P3_F5_MLP) and GSM2689390 (P6_G6_MLP) are used for MLPs, GSM2689057 (P4_G9_GMP) and GSM2689102 (P6_C9_GMP) for GMP, GSM4916525 (NormalDonor1_untreated_PBMC_025) and GSM4916536 (NormalDonor1_untreated_PBMC_036) for B cells, and GSM4916573 (NormalDonor1_untreated_PBMC_073) and GSM4916783 (NormalDonor4_untreated_PBMC_001) for T cells, and GSM4916557 (NormalDonor1_untreated_PBMC_057) and GSM4916559 (NormalDonor1_untreated_PBMC_059) for myeloid cells ^29,30.

Statistical analysis

Paired t-test was applied for statistical analysis to compare the results in 12 target ratios in Figure 3c.

Acknowledgments

I wish to thank T. Shinagawa, H. Nakamoto, and Y. Yamaguchi for critically reading the manuscript, and Editage (www.editage.com) for English language editing. This work was financially supported by the Tokushukai Medical Group.

Author Contributions

Conceptualization, T.Y.; Investigation, T.Y.; Writing, T.Y.

Lead contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Tomoyuki YAMAGUCHI ([email protected]).

Declaration of interests

The author declares no competing interests.

RESOURCE AVAILABILITY

Lead contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact.

Data availability

All relevant data supporting the key findings of this study are available within the article and its Supplementary information files.

Code availability

The codes generated in this study are provided as a supplementary file. They are available at GitHub (https://github.com/tyamaguc-tky/Learning_pair).

Supplemental Information

Four code files are provided as supplemental information; the learning hierarchical-pair model (Code File 1), clustering of genes (Code File 2), model of human gene expression (Code File 3), and model with an mRNA pool (Code File 4). Supplemental Table 1 (The 11,281 genes and pairing indexes in hierarchical-pair architecture) is provided as a zip tab-separated values file that is readable in Excel after decompression.

Meyer, P. & Saez-Rodriguez, J. Advances in systems biology modeling: 10 years of crowdsourcing DREAM challenges. Cell Syst 12, 636-653, doi:10.1016/j.cels.2021.05.015 (2021).
Karlebach, G. & Shamir, R. Modelling and analysis of gene regulatory networks. Nat Rev Mol Cell Biol 9, 770-780, doi:10.1038/nrm2503 (2008).
Djebali, S. et al. Landscape of transcription in human cells. Nature 489, 101-108, doi:10.1038/nature11233 (2012).
Mnih, V. et al. Human-level control through deep reinforcement learning. Nature 518, 529-533, doi:10.1038/nature14236 (2015).
Silver, D. et al. Mastering the game of Go with deep neural networks and tree search. Nature 529, 484-489, doi:10.1038/nature16961 (2016).
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436-444, doi:10.1038/nature14539 (2015).
Pezzulo, G. & Levin, M. Top-down models in biology: explanation and control of complex living systems above the molecular level. J R Soc Interface 13, doi:10.1098/rsif.2016.0555 (2016).
Friston, K. The free-energy principle: a unified brain theory? Nat Rev Neurosci 11, 127-138, doi:10.1038/nrn2787 (2010).
Himeoka, Y. & Kaneko, K. Epigenetic Ratchet: Spontaneous Adaptation via Stochastic Gene Expression. Sci Rep 10, 459, doi:10.1038/s41598-019-57372-0 (2020).
Waddington, C. The Strategy of the Genes. (Allen & Unwin, 1957).
Wang, J., Zhang, K., Xu, L. & Wang, E. Quantifying the Waddington landscape and biological paths for development and differentiation. Proc Natl Acad Sci U S A 108, 8257-8262, doi:10.1073/pnas.1017017108 (2011).
Jaenisch, R. & Bird, A. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet 33 Suppl, 245-254, doi:10.1038/ng1089 (2003).
Yamaguchi, T. et al. Theoretical modeling reveals that regulatory T cells increase T-cell interaction with antigen-presenting cells for stable immune tolerance. Int Immunol 31, 743-753, doi:10.1093/intimm/dxz043 (2019).
Lazar, V. et al. Antibiotic-resistant bacteria show widespread collateral sensitivity to antimicrobial peptides. Nat Microbiol 3, 718-731, doi:10.1038/s41564-018-0164-0 (2018).
Petropoulos, S. et al. Single-Cell RNA-Seq Reveals Lineage and X Chromosome Dynamics in Human Preimplantation Embryos. Cell 165, 1012-1026, doi:10.1016/j.cell.2016.03.023 (2016).
Loh, K. M. et al. Mapping the Pairwise Choices Leading from Pluripotency to Human Bone, Heart, and Other Mesoderm Cell Types. Cell 166, 451-467, doi:10.1016/j.cell.2016.06.011 (2016).
Chu, L. F. et al. Single-cell RNA-seq reveals novel regulators of human embryonic stem cell differentiation to definitive endoderm. Genome Biol 17, 173, doi:10.1186/s13059-016-1033-x (2016).
Yan, L. et al. Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells. Nat Struct Mol Biol 20, 1131-1139, doi:10.1038/nsmb.2660 (2013).
Parker, M. M. et al. RNA sequencing identifies novel non-coding RNA and exon-specific effects associated with cigarette smoking. BMC Med Genomics 10, 58, doi:10.1186/s12920-017-0295-9 (2017).
Mende, N. et al. Quantitative and molecular differences distinguish adult human medullary and extramedullary haematopoietic stem and progenitor cell landscapes. bioRxiv 2020, 919753, doi:10.1101/2020.01.26.919753 (2020).
He, B. et al. Single-cell RNA sequencing reveals the mesangial identity and species diversity of glomerular cell transcriptomes. Nat Commun 12, 2141, doi:10.1038/s41467-021-22331-9 (2021).
Liang, Q. et al. Single-nuclei RNA-seq on human retinal tissue provides improved transcriptome profiling. Nat Commun 10, 5743, doi:10.1038/s41467-019-12917-9 (2019).
Nomura, S. et al. Cardiomyocyte gene programs encoding morphological and functional signatures in cardiac hypertrophy and failure. Nat Commun 9, 4435, doi:10.1038/s41467-018-06639-7 (2018).
Trapnell, C. et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol 32, 381-386, doi:10.1038/nbt.2859 (2014).
Wang, W. et al. Single-cell transcriptomic atlas of the human endometrium during the menstrual cycle. Nat Med 26, 1644-1653, doi:10.1038/s41591-020-1040-z (2020).
Hu, Z. et al. The Repertoire of Serous Ovarian Cancer Non-genetic Heterogeneity Revealed by Single-Cell Sequencing of Normal Fallopian Tube Epithelial Cells. Cancer Cell 37, 226-242 e227, doi:10.1016/j.ccell.2020.01.003 (2020).
Liu, Y. et al. Single-cell RNA-seq reveals the diversity of trophoblast subtypes and patterns of differentiation in the human placenta. Cell Res 28, 819-832, doi:10.1038/s41422-018-0066-y (2018).
Li, L. et al. Single-Cell RNA-Seq Analysis Maps Development of Human Germline Cells and Gonadal Niche Interactions. Cell Stem Cell 20, 858-873 e854, doi:10.1016/j.stem.2017.03.007 (2017).
Karamitros, D. et al. Single-cell analysis reveals the continuum of human lympho-myeloid progenitor cells. Nat Immunol 19, 85-97, doi:10.1038/s41590-017-0001-2 (2018).
Anand, P. et al. Single cell RNA-seq reveals developmental plasticity with coexisting oncogenic and immune evasion programs in ETP-ALL. Blood 137, 2463-2480, doi:10.1182/blood.2019004547 (2021).

No competing interests reported.

Download PDF

Editorial decision: Major revision
10 Feb, 2022
Reviews received at journal
01 Jan, 2022
Reviewers agreed at journal
11 Dec, 2021
Reviewers invited by journal
10 Dec, 2021
Editor assigned by journal
10 Dec, 2021
Editor invited by journal
10 Dec, 2021
Submission checks completed at journal
10 Dec, 2021
First submitted to journal
02 Dec, 2021

You are reading this latest preprint version

Learning Processes in Hierarchical Pairs Regulate Entire Gene Expression in Cells

Status:

Version 1

Abstract

Figures

Introduction

Results

Amplification and error-dependent decay for learning

Hierarchical clustering of human genes

A model with a signal transduction cascade and an mRNA pool

A common model for human gene expression

Discussion

Methods

Computation

Approximation of error

Hierarchical clustering analysis

Learning hierarchical-pair model with an mRNA pool

Resource datasets

Statistical analysis

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1