Medusa-structure of gene regulatory network
10.05.11
Category: Scientific News
Dominance of transcription factors in cancer subtypes: Practical consequences beyond theoretical biology
Over the past decades, researchers seeking to understand molecular mechanisms underlying various diseases, notably cancer, have taken advantage of DNA microarrays to interrogate tissues specimen of patients for the expression status of thousands of genes at once. Jointly, such gene expression status of each gene in the genome, measured as the level of their transcripts, constitutes the gene expression profile. Since each of the tens of thousands of genes can be switched on or off, a gene expression profile contains complex information. While microarray data was initially used by gene hunters to identify novel genes, such as those which are only active in samples of particular cancer tissues, researchers have later learned to employ sophisticated computational tools to find subgroup specific signatures.
In cancer research such statistical analysis of gene expression patterns can serve to identify new cancer subtypes and help classify patients more accurately, but gene expression profiles are used for clustering tumors often without considering the underlying reason that the distinct patterns of expression arise because of constraints in the realization of gene expression profiles imposed by the gene regulatory network. Although not often asked by computational biologists using statistical analysis to extract information, this is a central and basic biological question.
A team of researchers at Harvard Medical School's Children's Hospital led by Sui Huang (who is now at the University of Calgary) have analyzed gene expression profiles with precisely this question in mind. In the work published in the May 2011 issue of Experimental Biology and Medicine, Dr. Huang and his students, Guo, Feng and Trivedi, offer a first step towards understanding the source of the stable pattern of gene expression profiles by testing whether gene expression profiles are indeed established by a gene regulatory network that has the structure of a "medusa", with a command and control "head" and an enslaved periphery, as proposed by theoreticians.
The expression of a gene is regulated by particular types of proteins, the transcription factors (TFs), of which there are 2000 or so in the human genome. Thus, obviously, the entire gene expression profile is determined by the collective activity of these TFs. Since they also control the expression of each other, this subset of TF genes forms a "core network" of mutual regulation. In addition, they must also control the "non-transcription factor" proteins of the cell, such as cytoskeletal proteins or metabolic enzymes which are also regulated by TFs (as are all genes) but do not regulate the expression of other genes. In this elementary picture, the pattern of the gene expression profile would be determined essentially by the core network which represents the medusa head and controls the peripheral, regulated but not- regulating genes, the medusa arms (tentacles).
If the entire gene expression profile that characterizes the phenotype of cell types, controlled by the core of a just few thousands genes rather than the entire genome of tens of thousands of genes then, as Dr. Huang explains "this would have practical consequences beyond theoretical biology, for it would facilitate gene expression pattern based disease characterisation and diagnosis by allowing efficient computation focused on the regulatory core."
Huang's team has now used a set of gene expression profiles of lung cancer tissues from a group of patients to show that the expression patterns are consistent with a medusa network. They found using various statistical tests that instead of the entire set of almost 10,000 genes available on the DNA microarrays less than a thousand transcription factor genes were sufficient to classify the patient lung cancer samples according to the diagnosed cancer types. The subset of a few hundred TFs performed as well or better than the set of nearly 10,000 genes that represent much of the genome. The effect persisted after correction for gene number and expression levels. Conversely metabolic genes which would correspond to the subordinate arms of the medusa, and hence should have minimal influence on the gene expression profile, performed most poorly in the same comparison.
Interestingly, Huang and his group also found that microRNAs were even more powerful than transcription factors. Since miRNAs are part of the regulatory core, this was not entirely surprising. But why did they perform so much better than transcription factors? As Huang explains, continuing to draw the analogy of molecular regulation to computing the gene expression patterns, microRNAs act as "canalizing Boolean functions" of networks – an old idea first proposed by Stuart Kauffman in the 1970s. MicroRNAs, through their action at the post-transcriptional level, override the input of other regulators. Their occurrence suppresses what in network theory is called chaotic dynamics and allows the network to produce multiple stable gene expression patterns capable of self-organization, so called attractor states. The coexistence of many such stable attractor states is the very basis of multi-cellularity and of cancer, since these cell phenotypes possess distinct stable gene expression profiles. Perhaps it is no coincidence that microRNAs appeared in evolution when gene regulatory networks became highly complex and just before the emergence of multi-cellular organisms.
Dr. Steven Goodman, Editor-in-Chief of Experimental Biology and Medicine, said "The work by Huang and colleagues supporting the organizing role of the Medusa network suggests that gene expression profile interpretation can be performed in the context of our increased understanding of the relationship between the expression profile and the underlying network".
Experimental Biology and Medicine is a journal dedicated to the publication of multidisciplinary and interdisciplinary research in the biomedical sciences. It is the journal of the Society of Experimental Biology and Medicine.
In cancer research such statistical analysis of gene expression patterns can serve to identify new cancer subtypes and help classify patients more accurately, but gene expression profiles are used for clustering tumors often without considering the underlying reason that the distinct patterns of expression arise because of constraints in the realization of gene expression profiles imposed by the gene regulatory network. Although not often asked by computational biologists using statistical analysis to extract information, this is a central and basic biological question.
A team of researchers at Harvard Medical School's Children's Hospital led by Sui Huang (who is now at the University of Calgary) have analyzed gene expression profiles with precisely this question in mind. In the work published in the May 2011 issue of Experimental Biology and Medicine, Dr. Huang and his students, Guo, Feng and Trivedi, offer a first step towards understanding the source of the stable pattern of gene expression profiles by testing whether gene expression profiles are indeed established by a gene regulatory network that has the structure of a "medusa", with a command and control "head" and an enslaved periphery, as proposed by theoreticians.
The expression of a gene is regulated by particular types of proteins, the transcription factors (TFs), of which there are 2000 or so in the human genome. Thus, obviously, the entire gene expression profile is determined by the collective activity of these TFs. Since they also control the expression of each other, this subset of TF genes forms a "core network" of mutual regulation. In addition, they must also control the "non-transcription factor" proteins of the cell, such as cytoskeletal proteins or metabolic enzymes which are also regulated by TFs (as are all genes) but do not regulate the expression of other genes. In this elementary picture, the pattern of the gene expression profile would be determined essentially by the core network which represents the medusa head and controls the peripheral, regulated but not- regulating genes, the medusa arms (tentacles).
If the entire gene expression profile that characterizes the phenotype of cell types, controlled by the core of a just few thousands genes rather than the entire genome of tens of thousands of genes then, as Dr. Huang explains "this would have practical consequences beyond theoretical biology, for it would facilitate gene expression pattern based disease characterisation and diagnosis by allowing efficient computation focused on the regulatory core."
Huang's team has now used a set of gene expression profiles of lung cancer tissues from a group of patients to show that the expression patterns are consistent with a medusa network. They found using various statistical tests that instead of the entire set of almost 10,000 genes available on the DNA microarrays less than a thousand transcription factor genes were sufficient to classify the patient lung cancer samples according to the diagnosed cancer types. The subset of a few hundred TFs performed as well or better than the set of nearly 10,000 genes that represent much of the genome. The effect persisted after correction for gene number and expression levels. Conversely metabolic genes which would correspond to the subordinate arms of the medusa, and hence should have minimal influence on the gene expression profile, performed most poorly in the same comparison.
Interestingly, Huang and his group also found that microRNAs were even more powerful than transcription factors. Since miRNAs are part of the regulatory core, this was not entirely surprising. But why did they perform so much better than transcription factors? As Huang explains, continuing to draw the analogy of molecular regulation to computing the gene expression patterns, microRNAs act as "canalizing Boolean functions" of networks – an old idea first proposed by Stuart Kauffman in the 1970s. MicroRNAs, through their action at the post-transcriptional level, override the input of other regulators. Their occurrence suppresses what in network theory is called chaotic dynamics and allows the network to produce multiple stable gene expression patterns capable of self-organization, so called attractor states. The coexistence of many such stable attractor states is the very basis of multi-cellularity and of cancer, since these cell phenotypes possess distinct stable gene expression profiles. Perhaps it is no coincidence that microRNAs appeared in evolution when gene regulatory networks became highly complex and just before the emergence of multi-cellular organisms.
Dr. Steven Goodman, Editor-in-Chief of Experimental Biology and Medicine, said "The work by Huang and colleagues supporting the organizing role of the Medusa network suggests that gene expression profile interpretation can be performed in the context of our increased understanding of the relationship between the expression profile and the underlying network".
Experimental Biology and Medicine is a journal dedicated to the publication of multidisciplinary and interdisciplinary research in the biomedical sciences. It is the journal of the Society of Experimental Biology and Medicine.
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου