Concept: Cis-regulatory element
Decoding post-transcriptional regulatory programs in RNA is a critical step towards the larger goal of developing predictive dynamical models of cellular behaviour. Despite recent efforts, the vast landscape of RNA regulatory elements remains largely uncharacterized. A long-standing obstacle is the contribution of local RNA secondary structure to the definition of interaction partners in a variety of regulatory contexts, including–but not limited to–transcript stability, alternative splicing and localization. There are many documented instances where the presence of a structural regulatory element dictates alternative splicing patterns (for example, human cardiac troponin T) or affects other aspects of RNA biology. Thus, a full characterization of post-transcriptional regulatory programs requires capturing information provided by both local secondary structures and the underlying sequence. Here we present a computational framework based on context-free grammars and mutual information that systematically explores the immense space of small structural elements and reveals motifs that are significantly informative of genome-wide measurements of RNA behaviour. By applying this framework to genome-wide human mRNA stability data, we reveal eight highly significant elements with substantial structural information, for the strongest of which we show a major role in global mRNA regulation. Through biochemistry, mass spectrometry and in vivo binding studies, we identified human HNRPA2B1 (heterogeneous nuclear ribonucleoprotein A2/B1, also known as HNRNPA2B1) as the key regulator that binds this element and stabilizes a large number of its target genes. We created a global post-transcriptional regulatory map based on the identity of the discovered linear and structural cis-regulatory elements, their regulatory interactions and their target pathways. This approach could also be used to reveal the structural elements that modulate other aspects of RNA behaviour.
The amniote phallus and limbs differ dramatically in their morphologies but share patterns of signaling and gene expression in early development. Thus far, the extent to which genital and limb transcriptional networks also share cis-regulatory elements has remained unexplored. We show that many limb enhancers are retained in snake genomes, suggesting that these elements may function in non-limb tissues. Consistent with this, our analysis of cis-regulatory activity in mice and Anolis lizards reveals that patterns of enhancer activity in embryonic limbs and genitalia overlap heavily. In mice, deletion of HLEB, an enhancer of Tbx4, produces defects in hindlimbs and genitalia, establishing the importance of this limb-genital enhancer for development of these different appendages. Further analyses demonstrate that the HLEB of snakes has lost hindlimb enhancer function while retaining genital activity. Our findings identify roles for Tbx4 in genital development and highlight deep similarities in cis-regulatory activity between limbs and genitalia.
The accurate discovery and annotation of regulatory elements remains a challenging problem. The growing number of sequenced genomes creates new opportunities for comparative approaches to motif discovery. Putative binding sites are then considered to be functional if they are conserved in orthologous promoter sequences of multiple related species. Existing methods for comparative motif discovery usually rely on pregenerated multiple sequence alignments, which are difficult to obtain for more diverged species such as plants. As a consequence, misaligned regulatory elements often remain undetected.
Gene regulation relies on the specificity of transcription factor (TF)-DNA interactions. Limited specificity may lead to crosstalk: a regulatory state in which a gene is either incorrectly activated due to noncognate TF-DNA interactions or remains erroneously inactive. As each TF can have numerous interactions with noncognate cis-regulatory elements, crosstalk is inherently a global problem, yet has previously not been studied as such. We construct a theoretical framework to analyse the effects of global crosstalk on gene regulation. We find that crosstalk presents a significant challenge for organisms with low-specificity TFs, such as metazoans. Crosstalk is not easily mitigated by known regulatory schemes acting at equilibrium, including variants of cooperativity and combinatorial regulation. Our results suggest that crosstalk imposes a previously unexplored global constraint on the functioning and evolution of regulatory networks, which is qualitatively distinct from the known constraints that act at the level of individual gene regulatory elements.
Myostatin is a negative regulator of myogenesis and has been suggested to be an important factor in the development of muscle wasting during viral infection. The objective of this study was to characterize the main regulatory element of the grouper myostatin promoter and to study changes in promoter activity due to viral stimulation. In vitro and in vivo experiments indicated that the E-box E6 is a positive cis-and trans-regulation motif, and an essential binding site for MyoD. In contrast, the E-box E5 is a dominant negative cis-regulatory. The characteristics of grouper myostatin promoter are similar in regulation of muscle growth to that of other species, but mainly through specific regulatory elements. According to these results, we conducted a study to investigate the effect of viral infection on myostatin promoter activity and its regulation. The nervous necrosis virus (NNV) treatment significantly induced myostatin promoter activity. The present study is the first report describing that specific myostatin motifs regulate promoter activity and response to viral infection.
Cell type-specific and housekeeping enhancers and promoters collectively control the transcriptional output of mammalian cells. Recent data clarify how DNA sequence features on the one hand control functional coupling of promoters with selected enhancers, and on the other impart high level of activity to a broad range of regulatory elements.
The sine oculis homeobox protein Six3 plays pivotal roles in the development of the brain and craniofacial structures. In humans, SIX3 haploinsufficiency results in holoprosencephaly, a defect in anterior midline formation. Although much is known about the evolutionarily conserved functions of Six3, the regulatory mechanism responsible for the expression pattern of Six3 remains relatively unexplored. To understand how the transcription of Six3 is controlled during embryogenesis, we screened ∼300 kb of genomic DNA encompassing the Six3 locus for cis-acting regulatory elements capable of directing reporter gene expression to sites of Six3 transcription in transgenic mouse embryos. We identified a novel enhancer element, whose activity recapitulates endogenous Six3 expression in the ventral midbrain, pretectum, and thalamus. Cross-species comparisons revealed that this Six3 brain enhancer is functionally conserved in other vertebrates. We also showed that normal Six3 transcription in the ventral midbrain and pretectum is dependent on Ascl1, a basic helix-loop-helix proneural factor. Moreover, loss of Ascl1 resulted in downregulation of the Six3 brain enhancer activity, emphasizing its unique role in regulating Six3 expression in the developing brain. This article is protected by copyright. All rights reserved.
RNAs from other cell types have minimal impact on male fecundity-associated sperm RNA elements.
Pollen-specific expression. Promoters comprise of various cis-regulatory elements which control development and physiology of plants by regulating gene expression. To understand the promoter specificity and also identification of functional cis-acting elements, progressive 5' deletion analysis of the promoter fragments is widely used. We have evaluated the activity of regulatory elements of 5' promoter deletion sequences of anther-specific gene OSIPP3, viz. OSIPP3-∆1 (1504 bp), OSIPP3-∆2 (968 bp), OSIPP3-∆3 (388 bp) and OSIPP3-∆4 (286 bp) through the expression of transgene GUS in rice. In silico analysis of 1504-bp sequence harboring different copy number of cis-acting regulatory elements such as POLLENLELAT52, GTGANTG10, enhancer element of LAT52 and LAT56 indicated that they were essential for high level of expression in pollen. Histochemical GUS analysis of the transgenic plants revealed that 1504- and 968-bp fragments directed GUS expression in roots and anthers, while the 388- and 286-bp fragments restricted the GUS expression to only pollen, of which 388 bp conferred strong GUS expression. Further, GUS staining analysis of different panicle development stages (P1-P6) confirmed that the GUS gene was preferentially expressed only at P6 stage (late pollen stage). The qRT-PCR analysis of GUS transcript revealed 23-fold higher expression of GUS transcript in OSIPP3-Δ1 followed by OSIPP3-Δ2 (eightfold) and OSIPP3-Δ3 (threefold) when compared to OSIPP3-Δ4. Based on our results, we proposed that among the two smaller fragments, the 388-bp upstream regulatory region could be considered as a promising candidate for pollen-specific expression of agronomically important transgenes in rice.
Gene expression regulatory elements are scattered in gene promoters and pre-mRNAs. In particular, RNA elements lying in untranslated regions (5' and 3'UTRs) are poorly studied because of their peculiar features (i.e., a combination of primary and secondary structure elements) which also pose remarkable computational challenges. Several years ago, we began collecting experimentally characterized UTR regulatory elements, developing the specialized database UTRsite. This paper describes the detailed guidelines to annotate cis-regulatory elements in 5' and 3' UnTranslated Regions (UTRs) by computational analyses, retracing all main steps used by UTRsite curators.