Transcriptional and Post-transcriptional Gene Regulation by Long Non-coding RNA

Iain M. Dykes 1,*,a, Costanza Emanueli1,2,b   2017-07-05 23:26:23

1 School of Clinical Sciences, University of Bristol, Bristol BS2 8HW, United Kingdom

2 National Heart and Lung Institute, Imperial College London, London SW7 2AZ, United Kingdom

Received 31 October 2016; revised 8 December 2016; accepted 25 December 2016 Available online 19 May 2017

Handled by Yangming Wang

KEYWORDS

Long non-coding RNA;

MicroRNA;

Transcriptional regulation;

Epigenetics;

Post-transcriptional

regulation

Abstract Advances in genomics technology over recent years have led to the surprising discovery that the genome is far more pervasively transcribed than was previously appreciated. Much of the newly-discovered transcriptome appears to represent long non-coding RNA (lncRNA), a heterogeneous group of largely uncharacterised transcripts. Understanding the biological function of these molecules represents a major challenge and in this review we discuss some of the progress made to date. One major theme of lncRNA biology seems to be the existence of a network of interactions with microRNA (miRNA) pathways. lncRNA has been shown to act as both a source and an inhibitory regulator of miRNA. At the transcriptional level, a model is emerging whereby lncRNA bridges DNA and protein by binding to chromatin and serving as a scaffold for modifying protein complexes. Such a mechanism can bridge promoters to enhancers or enhancer-like non-coding genes by regulating chromatin looping, as well as conferring specificity on histone modifying complexes by directing them to specific loci.

The emerging field of long non-coding RNA

Recent advances in sequencing technologies have demonstrated that far more of the genome is transcribed than was previously appreciated. There has been an explosion in the number of described non-coding genes and with it a corresponding surge in interest in this emerging field. The term‘‘dark matter” was coined to describe the large number of previously-overlooked transcripts of uncertain function revealed by such work [1,2] indicating the existence of a large non-coding transcriptome far exceeding that of the more familiar coding genes. The rate of discovery has far outpaced our ability to functionally characterise these transcripts and the vast majority have no known function. Despite this, the importance of the dark matter is demonstrated by genome-wide association studies (GWAS) which have indicated that long non-coding RNA (lncRNA) genes are enriched for trait or disease-linked polymorphisms [3,4] and, indeed, over 90% of all GWAS hits lie outside of known coding genes [5]. This review will be focussed on efforts to understand the biological function of lncRNA. Of necessity, we will be focussing on a small number of the best-known genes, but throughout we will discuss to what extent these may represent widespread mechanisms.

The diversity of lncRNA

A large proportion of the genome is transcribed into RNA that lacks protein coding information and is never translated [6–8]. These non-coding RNAs (ncRNAs) are generally divided into long and short groupings using an arbitrary threshold of 200 nucleotides (nt). Short ncRNAs include the relatively wellknown microRNAs (miRNAs) [9,10], as well as smallinterfering RNAs (siRNAs) and P-element-induced wimpy testis (PIWI)-interacting RNA (piRNAs) [11], which generally act to negatively regulate gene expression. In contrast, the long ncRNA (lncRNAs) are a large, heterogeneous group of ncRNAs of largely-unknown function. lncRNAs share many features with coding transcripts, such as the presence of epigenetic marks indicating differential expression [12], the presence of introns, and the existence of splice variants. Many but not all lncRNAs are polyadenylated, and there is evidence indicating that many lncRNAs exist in both polyadenylated and nonpolyadenylated forms (termed bimorphic) [3]. The broad term lncRNA includes many different types of RNA, exhibiting a range of genomic structures and relationships to the coding transcriptome. Some are pseudogenes, copies of coding genes harbouring mutations rendering them non-coding [13]. Many lncRNAs overlap coding genes, and indeed, one estimate suggests that 20% of human transcripts exist as sense-antisense pairs [14]. These transcripts may overlap the entire gene or only a part of it, and non-coding transcripts may originate from either the sense or antisense strand [15,16]. Many lncRNAs are described as intergenic (meaning that they do not lie within or overlap coding genes) and are sometimes known as long intergenic ncRNAs (lincRNAs) [3]. We now know that transcripts do not have to be linear, with the discovery that circular RNA (circRNA) is a common form of transcribed RNA [17,18]. Although many circRNAs are transcribed from coding regions, these transcripts are believed to be non-coding.

With the discovery of all of these forms of RNA, the genome can no longer be thought of as a linear array of distinct transcriptional units, but rather is‘‘an amazingly complex landscape of interlacing and overlapping transcripts, not only on opposite strands, but also on the same strand, so that there is often no clear distinction between splice variants and overlapping and neighbouring genes”[19].

How much of the genome is transcribed?

Tiling microarrays, in which arrays are made of probes at short intervals designed to span the entire genome, were thefirst to suggest nearly pervasive transcription of the entire genome [1]. One study by Affymetrix on our smallest chromosomes, Chr.21 and Chr.22, has indicated that 94% of probes detect transcripts outside of known exons [6], while another study suggests that 49% of transcribed nucleotides are outside of any annotated gene [20]. The development of massively parallel sequencing technologies, commonly referred to as RNA sequencing (RNA-seq), has offered many improvements over microarray-based technology in terms of reproducibility, sensi-tivity, coverage, and accuracy in mapping homologous sequences [21]. The Encyclopaedia of DNA Elements (ENCODE) project has utilised RNA-seq of 15 human cell lines to demonstrate that 74.7% of the genome is transcribed [7], while a similar experiment utilising RNA-seq of samples derived from 23 different human tissue types indicated that 85% of the genome is transcribed [3]. The Functional Annotation of the Mammalian Genome (FANTOM) project has revealed a similar level of pervasive transcription with 63% of the genome shown to be transcribed by using the cap analysis of gene expression (CAGE) to profile transcription initiation sites within the genome [8]. A related method is to predict the location of transcribed regions of the genome based on the pattern of histone modification. Actively-transcribed genes are marked by trimethylation of lysine 4 on histone H3 (H3K4me3) at their promoters and H3K36me3 along the length of the transcribed regions. These so-called K4-K36 regions can be used in genome-wide chromatin precipitation assays to predict transcripts [12].

Thus it seems that the majority of our DNA is transcribed into RNA at some point during development. However, much of the previously-unappreciated ncRNA exists at a very low level. Most of the RNA in a cell is rRNA and tRNA, whereas mRNA makes up 3%–7% of the total by mass. lncRNA makes up 0.03%–0.20% [22], thus being 15–230 times less abundant in the cell than coding transcripts.

Functions of lncRNA

While the vast majority of lncRNAs do not yet have any known function, we are beginning to understand the functions of a small number of characterised lncRNAs. lncRNAs can act at many different levels of gene expression and their functions are highly diverse. This diversity reflects the versatility of RNA itself: through folding into a variety of secondary structures, RNA can bind to a large number of substrates in a highlyspecific manner [23]. In addition, without the need for translation, ncRNA expression is highly dynamic and can be rapidly up- or down-regulated to modulate gene expression [23]. Here, we will review some of the major described functions and highlight what appear to be general principles of lncRNA biology. For a more detailed discussion on this subject, the reader is directed to several of the many excellent reviews published [23–26].

Post-transcriptional regulation by lncRNA

lncRNA as a source of miRNA miRNA is produced from primary transcript (pri-miRNA) by two processing stages. Drosha and DiGeorge syndrome chromosomal region 8 (DGCR8) cut the pri-miRNA while it is in the nucleus into a precursor (pre-miRNA) of~60 nt. The premiRNA is then exported to the cytoplasm where it is processed by a second enzyme complex, Dicer/TAR RNA protein (TRBP), to produce the mature miRNA of 20–23 nt [10,27]. Most pri-miRNAs are generally greater than 1 kb in length [27], and therefore may be regarded as a form of lncRNA. There are two major sources of pri-miRNAs in the genome: those that are embedded within another gene and whose expression is thus normally, but not always, linked to the expression of the parent transcript; and those that are transcribed independently from what had previously been considered to be intergenic regions. The latter group is transcribed from miRNA genes, which contain promoters that regulate their transcription by RNA polymerase II (RNAPII) in a manner similar to mRNA [28].

Approximately 50% of miRNAs are produced from non-coding transcripts [29] (Figure 1). Interestingly, in common with those embedded in coding genes, many miRNAs within non-coding genes are also located within introns [28]. Such a genomic organisation suggests that the host lncRNA does not simply act as a pri-miRNA but may have other additional roles encoded by the exons. Examples of lncRNAs harbouring intronic miRNAs include DLEU2, which is the host gene of the tumour suppressor miR-15a/16.1 cluster located within its third intron [30,31]. These miRNAs are frequently down-regulated in leukaemia. Interestingly, in adult chronic lymphocytic leukaemia, the expression of miR-15a/16.1 seems to be regulated by the host gene promoter, which is bound by the transcription factors (TFs) MYC [31] and paired box 5 (PAX5; previously also known as B-cell-specific activator protein, BSAP) [32], while in childhood acute myeloid leukaemia, data from methylation assays indicate that this miRNA cluster is regulated independently of its host gene [33]. A second example is the tumour suppressor miR-31 that is downregulated in breast cancer. MIR-31 gene is embedded within an intron of the lncRNA LOC554202 and its transcription is regulated by the methylation state of the host gene promoter [34].

A minority of lncRNA-embedded miRNAs reside not within introns but within an exon of the spliced lncRNA [28,35]. Many such lncRNAs are named for the miRNA which they encode. For example, the lncRNA MIR155 host gene (MIR155HG; formerly known as B-cell integration cluster, BIC) harbours an exonic miRNA, miR-155, and this region of the lncRNA shows the strongest cross-species conservation [36,37]. Similarly, MIR22HG encodes miR-22 within its second exon [35], while MIR17HG harbours a cluster of six miRNAs within its second exon [38].

One of thefirst lncRNAs to be discovered and perhaps the most studied, H19, contains miR-675 embedded within itsfirst intron [39]. Although the H19 transcript is widely expressed in the mouse embryo, miR-675 expression is limited to the placenta [40]. This indicates that processing of the H19 transcript to release miR-675 is inhibited, which would seem to be mediated by binding of a RBP, human antigen R (HuR), to a site upstream of miR-675, thus blocking Drosha processing of the primary transcript [40]. Furthermore, the disparity between H19 and miR-675 expression suggests that H19 may not simply function as a pri-miRNA but may have additional functions. This hypothesis is supported by the discovery of additional H19 functions (discussed below).

lncRNA as a negative regulator of miRNA

miRNAs are negative regulators of gene expression. Transcripts are targeted through binding of a short 7-nt seed sequence within the miRNA to an miRNA response element (MRE). MREs are short and binding does not have to be perfectly complementary [41], which makes predicting miRNA targets difficult. Computational predictions suggest that, potentially, a single miRNA may target hundreds of transcripts [41]. However, the number of target genes that are physiologically relevant targets of a given miRNA is often much lower [42]. There seems to be a disconnection between the number of predicted targets and the number of actual targets. Given the promiscuity of miRNA seed sequences, it is perhaps unsurprising that many lncRNAs contain predicted miRNA binding sites. This raises an interesting possibility that the function of many lncRNAs may be to regulate gene expression by sequestering miRNAs, thus limiting their concentration within the cell and thereby reducing the pool of available miRNA in the cell. In this way, the lncRNA acts as a negative regulator of miRNA function and, by extension, a positive regulator of gene expression. This is known as the‘‘competing endogenous RNA (ceRNA)” hypothesis [43] (Figure 2).

Examples of this type of interaction include the intergenic lincRNA-ROR, which inhibits miR-145 in pluripotent embryonic stem cells (ESCs) [44]. lincRNA-ROR expression is activated by pluripotent TFs such as NANOG, SOX2, and OCT4 and genes encoding these TFs are targeted by miR145. Therefore, this lncRNA creates a feedback loop within the pluripotent gene network [44]. OCT4 expression is upregulated in many cancers including hepatocellular carcinoma [45], and thus in these cells miR-145 acts as a tumour suppressor. Interestingly, a non-coding pseudogene of OCT4 called OCT4-pg4 is co-expressed with OCT4 and appears to serve as an endogenous competitor of OCT4, protecting OCT4 from miR-145-mediated degradation [46]. Pseudogenes are copies of coding genes that arise through DNA duplication followed by the accumulation of mutations in one copy, rendering the gene non-coding. Despite this, many pseudogenes are expressed as lncRNAs. Clearly, a non-coding transcript that shares a high degree of homology with a coding gene is likely to share many of its MREs and therefore pseudogenes are good candidates to act as ceRNAs. Indeed, this seems to be the case [13,47,48]. Examples of such lncRNAs include a pseudogene homologous to the gene encoding tumour suppressor phosphatase and tensin homologue (PTEN), which contains multiple MREs within the 30UTR shared with the coding gene [48], as well as pseudogenes homologous to the 30 UTRs of the genes encoding tumour suppressor candidate 2 (TUSC2) [49] and forkhead box protein O1 (FOXO1) [50].Figure 1 lncRNA as a source of miRNAMany lncRNA genes contain embedded miRNA sequences (red boxes), which may be located within either an exon (blue box) or an intron (line) of the gene. Furthermore, miRNAs are encoded by independent transcriptional units and often occur in clusters within the genome. The three sources result in very different types of primary transcript but the pathways converge at the level of pre-miRNA structure. lncRNA, long non-coding RNA; miRNA, microRNA; pri-miRNA, primary miRNA; pre-miRNA, precursor miRNA.Some researchers remain sceptical about how widespread this mechanism may be, arguing that the low levels of expression of most lncRNAs relative to mRNAs means that alterations in lncRNA levels will not have more than a minor effect on miRNA availability, and so will be ineffective as a competitor [51]. Indeed, many studies on ceRNAs rely on ectopic overexpression of the ceRNA at artificially-elevated levels [49,50]. Thomson and Dinger argue that while the ceRNA hypothesis is attractive because it could provide an explanation of the functions for so many uncharacterised transcripts, its biological relevance may be limited [51]. mRNA would be a more effective ceRNA for this reason and, interestingly, the ceRNA hypothesis is not limited to lncRNA. Salmena et al.[43] suggest that mRNAs could also function, in part, to regulate the expression of other mRNAs through a similar mechanism. If true, the mRNA of some coding genes may have a protein-independent non-coding function [43], in effect acting as a lncRNA. This is supported by the observation that the 30UTRs of over 1500 human mRNAs are expressed independently of the coding part of the same transcript [52]. Furthermore, in about half of such genes examined in mice, the expression pattern of the independent 30 UTRs is distinct from that of the parent mRNAs [52].

One lncRNA that is expressed at a high level, and therefore may be effective as a ceRNA, is the previously-mentioned H19. This lncRNA is highly expressed in undifferentiated muscle cells, while its expression decreases in differentiated cells at about the same time as the expression of miRNA let-7 increases [53]. H19 contains let-7 binding sites and siRNAinduced depletion of H19 in mouse C2C12 muscle cells leads to reduced expression of let-7 target genes and increased expression of markers of muscle differentiation [53]. H19 also binds to members of the miR-17-5p seed family. Expression levels of H19 target mRNAs during myoblast differentiation suggest H19 is competing for miR-17-5p binding [54]. Thus, H19 acts as a primary transcript of one miRNA and a ceRNA for a number of others.

CircRNAs are potentially very stable because, unlike linear lncRNAs, they are resistant to exonuclease digestion. This feature raises the possibility that these RNA species may act as ceRNAs. The natural antisense transcript of the gene encoding cerebellar degeneration-related protein 1 (CDR1-AS, also known as ciRS-7) is one of the best-characterised circRNAs [55–57]. It is highly expressed in mouse hippocampus and neocortex, where expression of CDR1-AS overlaps with miR-7 expression domains [55,57]. CDR1-AS contains a large number of miR-7 binding sites, which have been shown to deplete miR-7 and therefore to regulate expression of miR-7 target genes in vitro [55,57]. Interestingly, there are mismatches in the central region of the miR-7 binding sites within CDR1-AS [55]. These render CDR1-AS resistant to miRNA-mediated degradation, thus the pool of ceRNA is not depleted by the process of competition, making it a very effective competitor [55]. In contrast, the binding site within CDR1-AS for miR-671 is nearly perfectly complementary, and so binding of this miRNA negatively regulates the level of CDR1-AS expression [55].Figure 2 The ceRNA hypothesismRNA contains MREs (ovals), which are normally located within the 3 0 UTR. miRNA binding to the identical MREs may be present in a number of ncRNA species, including pseudogenes, circRNAs, other forms of lncRNA, and independently-transcribed mRNA 30 UTRs. All of these RNAs could potentially compete for a limited pool of miRNA, thus positively regulating gene expression. lncRNA and circRNA may carry MREs for multiple miRNAs (indicated by differently coloured ovals). MRE, miRNA response element; UTR, untranslated region; miRNA, microRNA; lncRNA, long non-coding RNA; circRNA, circular RNA; CDS, coding sequence; ceRNA, competing endogenous RNA.miRNA-independent mRNA degradation

In addition to regulating gene expression through an interaction with miRNA, there is evidence that some lncRNAs can directly target mRNA for degradation. Staufen 1 (STAU1) is a protein that recognises a specific motif in the 30 UTR of mRNAs and mediates their degradation by nonsensemediated mRNA decay (NMD) [58]. For example, STAU1 binds to a double-stranded RNA motif within the 30 UTR of the mRNA encoding ADP-ribosylation factor 1 (ARF1) [59], where it is formed by a stem loop structure within the mRNA itself. However, some mRNAs targeted by Staufen-mediated decay, such as the one encoding serpin peptidase inhibitor, clade E member 1 (SERPINE1), contain only a singlestranded binding site within the 30 UTR, lacking the stem loop structure. Interestingly, it appears that such mRNAs may be targeted by a lncRNA carrying a complementary singlestranded binding site, imperfect binding of the lncRNA to the mRNA thus creating a double-stranded RNA binding motif for STAU1 [60]. This class of lncRNAs has been named half STAU1 binding site RNA (1/2-SBS1RNA) [60]. It is of note that the lncRNA terminal differentiation-induced ncRNA (TINCR) also recruits STAU1 to mRNAs such as the one encoding peptidoglycan recognition protein 3 (PGLYRP3) in epidermis [61]. Nonetheless, the interaction between TINCR and STAU1 does not trigger NMD and instead such binding increases stability of interacting mRNAs containing the TINCR box motif [61]. These findings suggest that a number of outcomes are possible, perhaps dependent on recruitment of additional factors.

Transcriptional regulation by lncRNA

Transient lncRNA is transcribed from active enhancers Transcription of most genes involves an interaction of a proximal promoter with more distant enhancer elements. Enhancers are located often at a large distance away from the transcriptional start site (TSS) and bind tissue-specific TFs that function to regulate differential gene expression [62]. The expression of a given gene is often regulated by the combinatorial effects of one or more enhancers, each active at a specific developmental time point or in a specific tissue. For example, expression of the gene NODAL during embryonic development is regulated by the interaction between at leastfive enhancers [63].

Active enhancers are bound by RNAPII, which may reflect their interaction with the promoter. By studying enhancers activated by calcium signalling in mouse neurons, Kim et al. [64] made the surprising discovery that an ncRNA of about 2 kb is bidirectionally transcribed from active enhancers. Expression of this enhancer RNA (eRNA) seems to be correlated with the activity of the enhancer [64,65]. eRNA may either be polyadenylated [66] or lack polyadenylation [64], the latter suggesting transience and instability. However, a number of studies suggest that it may nevertheless be functional. For example, the nuclear receptors nardilysin 1 (NRD1) and NRD2 (also known as Rev-Erbs) regulate transcription of the target gene encoding matrix metallopeptidase 9 (MMP9) by inhibiting the expression of the eRNA transcribed from an MMP9 enhancer [67]. Global run-on sequencing was used to characterise these transcripts, demonstrating that the MMP9 enhancer is 983 bp in length and consists of a central core of 388 bp containing TF binding sites flanked by sequence encoding sense and antisense eRNAs [67].

It has been proposed that eRNAs may play a role in chromatin remodelling, acting to promote chromatin accessibility [66] and stabilise the DNA loop necessary to bring a distal enhancer into apposition with its promoter [68]. For instance, chromatin at the FOXC1 locus is stabilised by a complex including the oestrogen receptor alpha (ERa) and its ligand, as well as an eRNA transcribed from a FOXC1 enhancer [68].

Enhancer-like activity of lncRNA genes

A related class of lncRNAs is the activating ncRNAs (ncRNAas). These are a species of lncRNAs transcribed from independent loci, but not from enhancers. They also have a transcriptional activation function [69–71]. ncRNA-as specifically activate the transcription of neighbouring coding genes in an RNA-dependent fashion, requiring the activity of the coding gene promoter [69]. Thus functionally they are highly similar to eRNAs. However, in contrast to eRNAs, ncRNA-as are spliced, polyadenylated stable transcripts. In common with both DNA enhancers and eRNAs, gene activation mediated by the ncRNA-a requires a change in chromosomal conformation to bring the ncRNA-a locus close to the promoter of its target gene [72]. Mediator is a protein complex which, along with cohesin, is involved in bridging together enhancers and promoters [73]. A number of ncRNA-as have been shown to be associated with the mediator complex, and depletion of the complex inhibits looping between the ncRNA-a locus and its target gene. Thus eRNA and ncRNA-a may function by interacting with the same set of molecules, forming a scaffold for a protein complex that bridges the enhancer-like element and the promoter of a coding gene (Figure 3).

While most circRNAs are cytoplasmic and thus may function in post-transcriptional regulation, a specific subclass of circRNAs demonstrates nuclear localisation and appears to function in transcriptional regulation. These circRNAs, in contrast to those described above, retain both exons and introns and therefore have been named exon–intron circRNAs (EIcircRNAs) [74]. EIcircRNAs are able to enhance transcription of their parental (coding) genes by interacting with RNAPII in an interaction that requires the U1 small nuclear RNA (snRNA) [74].

Transcriptional regulation by recruitment of chromatin modifiers A number of the best-characterised lncRNAs, such as X-chromosome inactivation (XIST) and H19, are involved in processes such as X-chromosome inactivation (XIST) and imprinting (H19), in which a large region of a chromosome is inactivated such that only one copy is expressed [75]. In both cases, the lncRNA is expressed from the inactivated chromosome and has a binary relationship with its coding gene expression from the other chromosome, which is controlled by a cis-acting master control region [75]. H19 is expressed exclusively from the maternally-inherited chromosome, while the adjacent gene encoding insulin-like growth factor 2 (IGF2) is expressed exclusively from the paternal chromosome [76,77]. Deletion of H19 in mice leads to maternal expression of IGF2, resulting in an overgrowth phenotype [78] which can be rescued by transgenic expression of H19 [79]. Mechanistically, it has been shown that many of these lncRNAs are associated with chromatin modifying complexes and that this interaction directs inhibitory epigenetic modification of histones at adjacent loci. For example, the master control region lying between H19 and IGF2 consists of a region of differentially-methylated chromatin. It seems that H19 is able to modify this control region by interacting with methyl CpG binding domain protein 1 (MBD1), which in turn interacts with histone lysine methyltransferases resulting in H3K9Me3 [80]. Another imprinted lncRNA, antisense Igf2r RNA (AIR), has been shown to be localised to the promoter of a target gene encoding solute carrier family 22 member 3 (SLC22A3) and to direct the histone methyltransferase G9a to this locus [81]. The imprinted KCNQ1 overlapping transcript 1 (KCNQ1OT1) operates via a similar mechanism [15].

While many imprinted lncRNAs appear to act in cis, regulating nearby loci, there is also evidence that lncRNAs can act in trans. An elegant model for this is regulation of the homeobox (HOX) gene clusters, a group of homeotic genes that function to convey positional information in the embryo to establish the antero-posterior body axis. Each HOX gene is expressed in a specific domain along this axis, demonstrating colinearity, in which the order of genes on the chromosome reflects their expression domains in the body [82]. Remarkably, human fibroblasts retain this positional information in primary culture, permitting in vitro studies [83]. Differential gene expression is maintained by the creation of broad regions of open or closed chromatin within the HOX gene clusters by the action of the histone lysine methyltransferases polycomb repressive complex 2 (PRC2) and trithorax, which have opposing actions [84]. Over 200 lncRNAs are transcribed from the HOX clusters and seem to play a central role in the regulation of histone methylation states [84]. HOX transcript antisense RNA (HOTAIR), one of these lncRNAs expressed from the HOX-C cluster, has been shown to recruit PRC2 and lysine (K)-specific demethylase 1A (LSD1) to the HOX-D cluster by recognising a specific GA-rich DNA motif [85], maintaining repression over a 40-kb region of the chromosome [84,86,87]. Similarly, XIST also recruits PRC2 [88]. Indeed a genomewide RNA immunoprecipitation experiment suggests that as many as 20% of lncRNAs across various human cell types may be associated with PRC2, suggesting that this is a general mechanism [89]. Another HOX cluster lncRNA, HOX-A transcript at the distal tip (HOTTIP), is transcribed from the 50 end of the HOX-A cluster and targets a methylation complex to the locus required for expression of a number of HOX-A genes [90]. There are many more examples of this kind of interaction [23].

Details on the precise mechanisms underlying these activities are unclear. It seems that the lncRNA must serve two functions: (1) it must bind to a protein or protein complex, or at least facilitate the formation of a complex, perhaps acting as a scaffold [87]; and (2) the lncRNA must be able to target this complex to a specific DNA sequence. Thus, the function of the lncRNA is to provide specificity to the chromatin modifying enzymes, acting as a tethered scaffold (Figure 3).

Histone modifications have been demonstrated to be able to influence alternative splicing of mRNA, acting through an adaptor complex consisting of a chromatin-reading protein linked to RNAPII and the splicing machinery [91]. For example, a different splice form of the gene encodingfibroblast growth factor receptor 2 (FGFR2) is expressed in epithelial cells compared to that expressed in mesenchymal cells. It has been shown that differential splicing is dependent on the methylation state of histone H3K36 and H3K27 at the FGFR2 locus [91]. An antisense lncRNA to FGFR2, asFGFR2, is expressed in epithelial cells and acts to repress mesenchymaltype splicing by recruiting PRC2 and the histone demethylase lysine-specific demethylase 2a (KDM2a) to the locus [92].Figure 3 Models of transcriptional regulationIn the bridging scaffold model (A), activating RNAs (red line) are transcribed from enhancer-like non-coding genes and are required to recruit the mediator complex and to mediate chromatin conformational changes bridging the enhancer-like non-coding gene and the promoter of a coding gene. In the tethered scaffold model (B), lncRNA (red line) recognises specific DNA motifs and recruits histone modifying enzymes such as PRC2 to the locus. lncRNA, long non-coding RNA; PRC2, polycomb repressive complex 2; RNAPII, RNA polymerase II.Concluding remarks

In this review we have described the highly diverse biological functions of lncRNAs, reflecting the versatility of the RNA molecule itself. We have proposed a general model in which these functions may broadly be divided into those representing an interaction with miRNA networks in order to regulate gene expression at the post-transcriptional level and those representing an interaction of lncRNA with enhancers, promoters, and chromatin-modifying complexes to regulate gene expression at the transcriptional level. Some lncRNAs such as H19 appear to act at multiple levels of gene regulation. It remains to be seen whether this is a general phenomenon. Indeed, it should be noted that much of our current understanding of lncRNA comes from studies of a small number of lncRNAs and it is presently unclear whether these are representative of the group as a whole. One thing that seems likely is that as we begin to understand these molecules better, the rather crude classification into long and short ncRNAs will need to be refined to better reflect their diverse functions.

Any review on this subject will be always incomplete for the simple reason that the field is rapidly expanding. In addition to the mechanisms described here, lncRNA has also been implicated in the regulation of mRNA splicing by modulating the levels of serine/arginine splicing factors within nuclear speckles [93]. An antisense lncRNA overlapping the start codon of ubiquitin carboxy-terminal hydrolase L1 (UCHL1) has been shown to positively regulate translation [94]. These lncRNAs consist of two domains: one domain overlapping the coding gene, which confers target specificity, and the other domain containing a SINEB2 repeat element that seems to recruit polysomes [94]. In contrast, lincRNA-p21 inhibits translation by recruiting the translational repressor Rck [95]. Finally, the lncRNA growth arrest-specific 5 (GAS5) has been shown to act as a DNA mimic, sequestering the glucocorticoid receptor, and acting as a decoy to prevent its binding to a DNA motif in target genes [96] in an interesting parallel to the sequestration of miRNAs described above.

While we have made great strides in our understanding of lncRNA, we are still only at the start of this road. Many challenges must be overcome. One difficulty facing thefield is the low level of cross-species conservation of the many lncRNAs, which makes the use of model organisms such as mice orfish difficult. This creates challenges to the understanding of lncRNA functions in whole body processes such as embryonic development and complex diseases. This lack of primary sequence conservation could be interpreted to mean a lack of evolutionary constraints on lncRNA. However, some have argued that we should not apply the same criteria for sequence conservation to lncRNA as we do to coding genes because lack of sequence conservation does not in itself indicate lack of functional conservation [97] and evolution may act on lncRNA at another level [98]. For example, the lncRNA HOTAIR shows poor primary sequence conservation between humans and mice, yet both its function and its genomic location within the HOX-C cluster are conserved [86].

In fact, there is some evidence that selection may act on lncRNA at the structural level rather than at the level of the primary sequence [99,100]. This raises the possibility that functional orthologues may be studied in model organisms. For this reason, methods to determine the structure of lncRNA will become increasingly important. Traditionally RNA secondary structures have been determined using either nuclease digestion, in which enzymes specifically cleave either single or double stranded RNA, or with the use of chemicals to modify exposed nucleotides [101]. Such methods have been successful in determining the structure of individual lncRNAs such as HOTAIR [102]. However we need tofind ways to adapt these methods for high-throughput genome-wide applications. Early attempts to do just this include parallel analysis of RNA structure (PARS) [103] and FraqSeq [104], in which RNA sequencing is used to analyse the resulting products of nuclease digestion.

Another important line for future research will be to determine the binding partners of lncRNA. Many studies take a protein-centric approach to this problem and use an antibody to purify all lncRNAs associated with a particular protein. This is useful in identifying that many lncRNAs associate with the PRC2 complex [89]. However, in some cases it is more helpful to look at this problem from an RNA-centric viewpoint. A useful tool for such studies is the S1 tag, an RNA motif that mimics the structure of biotin and may thus be puri-fied from cell extracts using streptavidin [105,106]. The S1 tag is short and so may be used to tag lncRNA molecules in a similar manner to protein tags such as HA and V5. This method has been successfully employed to identify binding partners of the lncRNA H19 [53].

One of the exciting prospects arising from our increased understanding of the lncRNAfield is the possibility of understanding the roles of lnRNA in human disease. GWAS analyses have in the past tended to focus only on the coding part of the genome, although it has been known for some time that certain diseases show a strong association with regions of the genome lacking coding genes. For example, multiple SNPs linked to a number of cancers map to a gene desert surrounding the oncogene MYC at the 8q24 locus [107]. As it becomes clear that lncRNA is both abundant within the genome and that it has a distinct biological role, it is logical to suggest that mutations within these sequences will be found to have clinical implications. Indeed this seems to be the case and a number of lncRNAs are now known to be transcribed from 8q24 and to regulate the expression of MYC [108]. Similarly in heart disease, SNPs mapped to a number of lncRNAs including myocardial infarction associated transcript (MIAT) [109], antisense non-coding RNA in the INK4 locus (ANRIL) [110], and the aforementioned H19 [111] have been shown to have a disease association. The challenge facing such research will be to predict which of the many variants within the population are functionally significant, a question that can only be answered by functional studies such as those described in this review.

Competing interests

The authors declare that there are no conflicts of interests.

Acknowledgments

Funding was provided by the British Heart Foundation, UK (Grant No. CH/15/1/31199).

References

[1] Johnson JM, Edwards S, Shoemaker D, Schadt EE. Dark matter in the genome: evidence of widespread transcription detected by microarray tiling experiments. Trends Genet 2005;21:93–102.

[2] Kapranov P, St Laurent G. Dark matter RNA: existence,function, and controversy. Front Genet 2012;3:60.

[3] Hangauer MJ, Vaughn IW, McManus MT. Pervasive transcrip-tion of the human genome produces thousands of previously unidentified long intergenic noncoding RNAs. PLoS Genet 2013;9:e1003569.

[4] Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP,Collins FS, et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A 2009;106:9362–7.

[5] St Laurent G, Vyatkin Y, Kapranov P. Dark matter RNA illuminates the puzzle of genome-wide association studies. BMC Med 2014;12:97.

[6] Kapranov P, Cawley SE, Drenkow J, Bekiranov S, Strausberg RL, Fodor SP, et al. Large-scale transcriptional activity in chromosomes 21 and 22. Science 2002;296:916–9.

[7] Djebali S, Davis CA, Merkel A, Dobin A, Lassmann T,Mortazavi A, et al. Landscape of transcription in human cells. Nature 2012;489:101–8.

[8] de Hoon M, Shin JW, Carninci P. Paradigm shifts in genomics through the FANTOM projects. Mamm Genome 2015;26:391–402.

[9] Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell 2004;116:281–97.

[10] Winter J, Jung S, Keller S, Gregory RI, Diederichs S. Many roads to maturity: microRNA biogenesis pathways and their regulation. Nat Cell Biol 2009;11:228–34.

[11] Siomi MC, Sato K, Pezic D, Aravin AA. PIWI-interacting small RNAs: the vanguard of genome defence. Nat Rev Mol Cell Biol 2011;12:246–58.

[12] Guttman M, Amit I, Garber M, French C, Lin MF, Feldser D,et al. Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature 2009;458:223–7.

[13] Milligan MJ, Lipovich L. Pseudogene-derived lncRNAs: emerg-ing regulators of gene expression. Front Genet 2014;5:476.

[14] Chen J, Sun M, Kent WJ, Huang X, Xie H, Wang W, et al. Over 20% of human transcripts might form sense-antisense pairs. Nucleic Acids Res 2004;32:4812–20.

[15] Pandey RR, Mondal T, Mohammad F, Enroth S, Redrup L,Komorowski J, et al. Kcnq1ot1 antisense noncoding RNA mediates lineage-specific transcriptional silencing through chromatin-level regulation. Mol Cell 2008;32:232–46.

[16] Pastori C, Peschansky VJ, Barbouth D, Mehta A, Silva JP,Wahlestedt C. Comprehensive analysis of the transcriptional landscape of the human FMR1 gene reveals two new long noncoding RNAs differentially expressed in Fragile X syndrome and Fragile X-associated tremor/ataxia syndrome. Hum Genet 2014;133:59–67.

[17] Salzman J. Circular RNA expression: its potential regulation and function. Trends Genet 2016;32:309–16.

[18] Salzman J, Gawad C, Wang PL, Lacayo N, Brown PO. Circular RNAs are the predominant transcript isoform from hundreds of human genes in diverse cell types. PLoS One 2012;7:e30733.

[19] Mattick JS, Makunin IV. Non-coding RNA. Hum Mol Genet 2006;15:R17–29.

[20] Kampa D, Cheng J, Kapranov P, Yamanaka M, Brubaker S,Cawley S, et al. Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22. Genome Res 2004;14:331–42.

[21] Cloonan N, Grimmond SM. Transcriptome content and dynam-ics at single-nucleotide resolution. Genome Biol 2008;9:234.

[22] Palazzo AF, Lee ES. Non-coding RNA: what is functional and what is junk? Front Genet 2015;6:2.

[23] Geisler S, Coller J. RNA in unexpected places: long non-coding RNA functions in diverse cellular contexts. Nat Rev Mol Cell Biol 2013;14:699–712.

[24] Beermann J, Piccoli MT, Viereck J, Thum T. Non-coding RNAs in development and disease: background, mechanisms, and therapeutic approaches. Physiol Rev 2016;96:1297–325.

[25] Rinn JL, Chang HY. Genome regulation by long noncoding RNAs. Annu Rev Biochem 2012;81:145–66.

[26] Nagano T, Fraser P. No-nonsense functions for long noncoding RNAs. Cell 2011;145:178–81.

[27] Zeng Y. Principles of micro-RNA production and maturation.Oncogene 2006;25:6156–62.

[28] Kim VN, Han J, Siomi MC. Biogenesis of small RNAs in animals. Nat Rev Mol Cell Biol 2009;10:126–39.

[29] Saini HK, Griffiths-Jones S, Enright AJ. Genomic analysis of human microRNA transcripts. Proc Natl Acad Sci U S A 2007;104:17719–24.

[30] Calin GA, Dumitru CD, Shimizu M, Bichi R, Zupo S, Noch E,et al. Frequent deletions and down-regulation of micro-RNA genes miR15 and miR16 at 13q14 in chronic lymphocytic leukemia. Proc Natl Acad Sci U S A 2002;99:15524–9.

[31] Lerner M, Harada M, Loven J, Castro J, Davis Z, Oscier D,et al. DLEU2, frequently deleted in malignancy, functions as a critical host gene of the cell cycle inhibitory microRNAs miR15a and miR-16-1. Exp Cell Res 2009;315:2941–52.

[32] Kasar S, Underbayev C, Yuan Y, Hanlon M, Aly S, Khan H,et al. Therapeutic implications of activation of the host gene (Dleu2) promoter for miR-15a/16-1 in chronic lymphocytic leukemia. Oncogene 2014;33:3307–15.

[33] Morenos L, Chatterton Z, Ng JL, Halemba MS, Parkinson-Bates M, Mechinaud F, et al. Hypermethylation and downregulation of DLEU2 in paediatric acute myeloid leukaemia independent of embedded tumour suppressor miR-15a/16-1. Mol Cancer 2014;13:123.

[34] Augoff K, McCue B, Plow EF, Sossey-Alaoui K. MiR-31 and its host gene lncRNA LOC554202 are regulated by promoter hypermethylation in triple-negative breast cancer. Mol Cancer 2012;11:5.

[35] Slezak-Prochazka I, Kluiver J, de Jong D, Kortman G, Halsema N, Poppema S, et al. Cellular localization and processing of primary transcripts of exonic microRNAs. PLoS One 2013;8: e76647.

[36] Tam W. Identification and characterization of human BIC, a gene on chromosome 21 that encodes a noncoding RNA. Gene 2001;274:157–67.

[37] Elton TS, Selemon H, Elton SM, Parinandi NL. Regulation of the MIR155 host gene in physiological and pathological processes. Gene 2013;532:1–12.

[38] de Pontual L, Yao E, Callier P, Faivre L, Drouin V, Cariou S,et al. Germline deletion of the miR-17 approximately 92 cluster causes skeletal and growth defects in humans. Nat Genet 2011;43:1026–30.

[39] Cai X, Cullen BR. The imprinted H19 noncoding RNA is a primary microRNA precursor. RNA 2007;13:313–6.

[40] Keniry A, Oxley D, Monnier P, Kyba M, Dandolo L, Smits G,et al. The H19 lincRNA is a developmental reservoir of miR-675 that suppresses growth and Igf1r. Nat Cell Biol 2012;14:659–65.

[41] Thomas M, Lieberman J, Lal A. Desperately seeking microRNA targets. Nat Struct Mol Biol 2010;17:1169–74.

[42] Seitz H. Redefining microRNA targets. Curr Biol 2009;19:870–3.

[43] Salmena L, Poliseno L, Tay Y, Kats L, PandolfiPP. A ceRNA hypothesis: the Rosetta Stone of a hidden RNA language, Cell 2011;146:353–8.

[44] Wang Y, Xu Z, Jiang J, Xu C, Kang J, Xiao L, et al.Endogenous miRNA sponge lincRNA-RoR regulates Oct4, Nanog, and Sox2 in human embryonic stem cell self-renewal. Dev Cell 2013;25:69–80.

[45] Kumar SM, Liu S, Lu H, Zhang H, Zhang PJ, Gimotty PA, et al.Acquired cancer stem cell phenotypes through Oct4-mediated dedifferentiation. Oncogene 2012;31:4898–911.

[46] Wang L, Guo ZY, Zhang R, Xin B, Chen R, Zhao J, et al.Pseudogene OCT4-pg4 functions as a natural micro RNA sponge to regulate OCT4 expression by competing for miR-145 in hepatocellular carcinoma. Carcinogenesis 2013;34:1773–81.

[47] An Y, Furber KL, Ji S. Pseudogenes regulate parental gene expression via ceRNA network. J Cell Mol Med 2017;21:185–92.

[48] Poliseno L, Salmena L, Zhang J, Carver B, Haveman WJ,Pandolfi PP. A coding-independent function of gene and pseudogene mRNAs regulates tumour biology. Nature 2010;465:1033–8.

[49] Rutnam ZJ, Du WW, Yang W, Yang X, Yang BB. The pseudogene TUSC2P promotes TUSC2 function by binding multiple microRNAs. Nat Commun 2014;5:2914.

[50] Yang J, Li T, Gao C, Lv X, Liu K, Song H, et al. FOXO1 30 UTR functions as a ceRNA in repressing the metastases of breast cancer cells via regulating miRNA activity. FEBS Lett 2014;588:3218–24.

[51] Thomson DW, Dinger ME. Endogenous microRNA sponges:evidence and controversy. Nat Rev Genet 2016;17:272–83.

[52] Mercer TR, Wilhelm D, Dinger ME, Solda G, Korbie DJ,Glazov EA, et al. Expression of distinct RNAs from 3 untranslated regions. Nucleic Acids Res 2011;39:2393–403.

[53] Kallen AN, Zhou XB, Xu J, Qiao C, Ma J, Yan L, et al. The imprinted H19 lncRNA antagonizes let-7 microRNAs. Mol Cell 2013;52:101–12.

[54] Imig J, Brunschweiger A, Brummer A, Guennewig B, Mittal N,Kishore S, et al. MiR-CLIP capture of a miRNA targetome uncovers a lincRNA H19-miR-106a interaction. Nat Chem Biol 2015;11:107–14.

[55] Hansen TB, Jensen TI, Clausen BH, Bramsen JB, Finsen B,Damgaard CK, et al. Natural RNA circles function as efficient microRNA sponges. Nature 2013;495:384–8.

[56] Hansen TB, Wiklund ED, Bramsen JB, Villadsen SB, Statham AL, Clark SJ, et al. MiRNA-dependent gene silencing involving Ago2-mediated cleavage of a circular antisense RNA. EMBO J 2011;30:4414–22.

[57] Memczak S, Jens M, Elefsinioti A, Torti F, Krueger J, Rybak A,et al. Circular RNAs are a large class of animal RNAs with regulatory potency. Nature 2013;495:333–8.

[58] Park E, Maquat LE. Staufen-mediated mRNA decay. Wiley Interdiscip Rev RNA 2013;4:423–35.

[59] Kim YK, Furic L, Desgroseillers L, Maquat LE. Mammalian Staufen1 recruits Upf1 to specific mRNA 30 UTRs so as to elicit mRNA decay. Cell 2005;120:195–208.

[60] Gong C, Maquat LE. LncRNAs transactivate STAU1-mediated mRNA decay by duplexing with 30 UTRs via Alu elements.Nature 2011;470:284–8.

[61] Kretz M, Siprashvili Z, Chu C, Webster DE, Zehnder A, Qu K,et al. Control of somatic tissue differentiation by the long noncoding RNA TINCR. Nature 2013;493:231–5.

[62] Ong CT, Corces VG. Enhancer function: new insights into the regulation of tissue-specific gene expression. Nat Rev Genet 2011;12:283–93.

[63] Papanayotou C, Benhaddou A, Camus A, Perea-Gomez A,Jouneau A, Mezger V, et al. A novel Nodal enhancer dependent on pluripotency factors and smad2/3 signaling conditions a regulatory switch during epiblast maturation. PLoS Biol 2014;12: e1001890.

[64] Kim TK, Hemberg M, Gray JM, Costa AM, Bear DM, Wu J,et al. Widespread transcription at neuronal activity-regulated enhancers. Nature 2010;465:182–7.

[65] Andersson R, Gebhard C, Miguel-Escalada I, Hoof I, Bornholdt J, Boyd M, et al. An atlas of active enhancers across human cell types and tissues. Nature 2014;507:455–61.

[66] Mousavi K, Zare H, Dell’orso S, Grontved L, Gutierrez-Cruz G,Derfoul A, et al. ERNAs promote transcription by establishing chromatin accessibility at defined genomic loci. Mol Cell 2013;51:606–17.

[67] Lam MT, Cho H, Lesch HP, Gosselin D, Heinz S, Tanaka-Oishi Y, et al. Rev-Erbs repress macrophage gene expression by inhibiting enhancer-directed transcription. Nature 2013;498:511–5.

[68] Li W, Notani D, Ma Q, Tanasa B, Nunez E, Chen AY, et al.Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation. Nature 2013;498:516–20.

[69] Orom UA, Derrien T, Beringer M, Gumireddy K, Gardini A,Bussotti G, et al. Long noncoding RNAs with enhancer-like function in human cells. Cell 2010;143:46–58.

[70] Orom UA, Derrien T, Guigo R, Shiekhattar R. Long noncoding RNAs as enhancers of gene expression. Cold Spring Harb Symp Quant Biol 2010;75:325–31.

[71] Li W, Lam MT, Notani D. Enhancer RNAs. Cell Cycle 2014;13:3151–2.

[72] Lai F, Orom UA, Cesaroni M, Beringer M, Taatjes DJ, Blobel GA, et al. Activating RNAs associate with Mediator to enhance chromatin architecture and transcription. Nature 2013;494:497–501.

[73] Malik S, Roeder RG. The metazoan Mediator co-activator complex as an integrative hub for transcriptional regulation. Nat Rev Genet 2010;11:761–72.

[74] Li Z, Huang C, Bao C, Chen L, Lin M, Wang X, et al. Exon-intron circular RNAs regulate transcription in the nucleus. Nat Struct Mol Biol 2015;22:256–64.

[75] Lee JT, Bartolomei MS. X-inactivation, imprinting, and long noncoding RNAs in health and disease. Cell 2013;152:1308–23.

[76] Zemel S, Bartolomei MS, Tilghman SM. Physical linkage of two mammalian imprinted genes, H19 and insulin-like growth factor 2. Nat Genet 1992;2:61–5.

[77] Gabory A, Jammes H, Dandolo L. The H19 locus: role of an imprinted non-coding RNA in growth and development. BioEssays 2010;32:473–80.

[78] Ripoche MA, Kress C, Poirier F, Dandolo L. Deletion of the H19 transcription unit reveals the existence of a putative imprinting control element. Genes Dev 1997;11:1596–604.

[79] Gabory A, Ripoche MA, Le Digarcher A, Watrin F, Ziyyat A,Forne T, et al. H19 acts as a trans regulator of the imprinted gene network controlling growth in mice. Development 2009;136:3413–21.

[80] Monnier P, Martinet C, Pontis J, Stancheva I, Ait-Si-Ali S,Dandolo L. H19 lncRNA controls gene expression of the Imprinted Gene Network by recruiting MBD1. Proc Natl Acad Sci U S A 2013;110:20693–8.

[81] Nagano T, Mitchell JA, Sanz LA, Pauler FM, Ferguson-Smith AC, Feil R, et al. The Air noncoding RNA epigenetically silences transcription by targeting G9a to chromatin. Science 2008;322:1717–20.

[82] Krumlauf R. Hox genes in vertebrate development. Cell 1994;78:191–201.

[83] Rinn JL, Bondre C, Gladstone HB, Brown PO, Chang HY.Anatomic demarcation by positional variation infibroblast gene expression programs. PLoS Genet 2006;2:e119.

[84] Rinn JL, Kertesz M, Wang JK, Squazzo SL, Xu X, Brugmann SA, et al. Functional demarcation of active and silent chromatin domains in human HOX loci by noncoding RNAs. Cell 2007;129:1311–23.

[85] Chu C, Qu K, Zhong FL, Artandi SE, Chang HY. Genomic maps of long noncoding RNA occupancy reveal principles of RNA-chromatin interactions. Mol Cell 2011;44:667–78.

[86] Li L, Liu B, Wapinski OL, Tsai MC, Qu K, Zhang J, et al.Targeted disruption of Hotair leads to homeotic transformation and gene derepression. Cell Rep 2013;5:3–12.

[87] Tsai MC, Manor O, Wan Y, Mosammaparast N, Wang JK, Lan F, et al. Long noncoding RNA as modular scaffold of histone modification complexes. Science 2010;329:689–93.

[88] Sarma K, Cifuentes-Rojas C, Ergun A, Del Rosario A, Jeon Y,White F, et al. ATRX directs binding of PRC2 to Xist RNA and Polycomb targets. Cell 2014;159:869–83.

[89] Khalil AM, Guttman M, Huarte M, Garber M, Raj A, Rivea Morales D, et al. Many human large intergenic noncoding RNAs associate with chromatin-modifying complexes and affect gene expression. Proc Natl Acad Sci U S A 2009;106:11667–72.

[90] Wang KC, Yang YW, Liu B, Sanyal A, Corces-Zimmerman R,Chen Y, et al. A long noncoding RNA maintains active chromatin to coordinate homeotic gene expression. Nature 2011;472:120–4.

[91] Luco RF, Pan Q, Tominaga K, Blencowe BJ, Pereira-Smith OM,Misteli T. Regulation of alternative splicing by histone modifications. Science 2010;327:996–1000.

[92] Gonzalez I, Munita R, Agirre E, Dittmer TA, Gysling K, Misteli T, et al. A lncRNA regulates alternative splicing via establishment of a splicing-specific chromatin signature. Nat Struct Mol Biol 2015;22:370–6.

[93] Tripathi V, Ellis JD, Shen Z, Song DY, Pan Q, Watt AT, et al.The nuclear-retained noncoding RNA MALAT1 regulates alternative splicing by modulating SR splicing factor phosphorylation. Mol Cell 2010;39:925–38.

[94] Carrieri C, Cimatti L, Biagioli M, Beugnet A, Zucchelli S, Fedele S, et al. Long non-coding antisense RNA controls Uchl1 translation through an embedded SINEB2 repeat. Nature 2012;491:454–7.

[95] Yoon JH, Abdelmohsen K, Srikantan S, Yang X, Martindale JL,De S, et al. LincRNA-p21 suppresses target mRNA translation. Mol Cell 2012;47:648–55.

[96] Kino T, Hurt DE, Ichijo T, Nader N, Chrousos GP. Noncoding RNA GAS5 is a growth arrest- and starvation-associated repressor of the glucocorticoid receptor. Sci Signal 2010;3:ra8.

[97] Johnsson P, Lipovich L, Grander D, Morris KV. Evolutionary conservation of long non-coding RNAs; sequence, structure, function. Biochim Biophys Acta 2014;1840:1063–71.

[98] Diederichs S. The four dimensions of noncoding RNA conser-vation. Trends Genet 2014;30:121–3.

[99] Smith MA, Gesell T, Stadler PF, Mattick JS. Widespread purifying selection on RNA structure in mammals. Nucleic Acids Res 2013;41:8220–36.

[100] Pegueroles C, Gabaldon T. Secondary structure impacts patterns of selection in human lncRNAs. BMC Biol 2016;14:60.

[101] Novikova IV, Hennelly SP, Sanbonmatsu KY. Tackling struc-tures of long noncoding RNAs. Int J Mol Sci 2013;14:23672–84.

[102] Somarowthu S, Legiewicz M, Chillon I, Marcia M, Liu F, Pyle AM. HOTAIR forms an intricate and modular secondary structure. Mol Cell 2015;58:353–61.

[103] Wan Y, Qu K, Ouyang Z, Chang HY. Genome-wide mapping of RNA structure using nuclease digestion and high-throughput sequencing. Nat Protoc 2013;8:849–69.

[104] Underwood JG, Uzilov AV, Katzman S, Onodera CS, Mainzer JE, Mathews DH, et al. FragSeq: transcriptome-wide RNA structure probing using high-throughput sequencing. Nat Methods 2010;7:995–1001.

[105] Leppek K, Stoecklin G. An optimized streptavidin-binding RNA aptamer for purification of ribonucleoprotein complexes identi-fies novel ARE-binding proteins. Nucleic Acids Res 2014;42:e13.

[106] Srisawat C, Engelke DR. RNA affinity tags for purification of RNAs and ribonucleoprotein complexes. Methods 2002;26:156–61.

[107] Sotelo J, Esposito D, Duhagon MA, Banfield K, Mehalko J,Liao H, et al. Long-range enhancers on 8q24 regulate c-Myc. Proc Natl Acad Sci U S A 2010;107:3001–5.

[108] Xiang JF, Yang L, Chen LL. The long noncoding RNA regulation at the MYC locus. Curr Opin Genet Dev 2015;33:41–8.

[109] Ishii N, Ozaki K, Sato H, Mizuno H, Saito S, Takahashi A, et al.Identification of a novel non-coding RNA, MIAT, that confers risk of myocardial infarction. J Hum Genet 2006;51:1087–99.

[110] Broadbent HM, Peden JF, Lorkowski S, Goel A, Ongen H,Green F, et al. Susceptibility to coronary artery disease and diabetes is encoded by distinct, tightly linked SNPs in the ANRIL locus on chromosome 9p. Hum Mol Genet 2008;17:806–14.

[111] Gao W, Zhu M, Wang H, Zhao S, Zhao D, Yang Y, et al.Association of polymorphisms in long non-coding RNA H19 with coronary artery disease risk in a Chinese population. Mutat Res 2015;772:15–22.

上一篇回2017年6月第3期目录 下一篇 (方向键翻页,回车键返回目录)加入书签

© 2016 毕业论文网 > Transcriptional and Post-transcriptional Gene Regulation by Long Non-coding RNA