The functional role of long non-coding RNA in human carcinomas
© Gibb et al; licensee BioMed Central Ltd. 2011
Received: 13 January 2011
Accepted: 13 April 2011
Published: 13 April 2011
Long non-coding RNAs (lncRNAs) are emerging as new players in the cancer paradigm demonstrating potential roles in both oncogenic and tumor suppressive pathways. These novel genes are frequently aberrantly expressed in a variety of human cancers, however the biological functions of the vast majority remain unknown. Recently, evidence has begun to accumulate describing the molecular mechanisms by which these RNA species function, providing insight into the functional roles they may play in tumorigenesis. In this review, we highlight the emerging functional role of lncRNAs in human cancer.
- ANRIL :
Antisense Non-coding RNA in the INK4 Locus
cAMP response element binding protein
- DD3 :
Differential Display Code 3
hepatitis B virus
hepatocellular carcinoma cell lines
- HOTAIR :
HOX Antisense Intergenic RNA
- HULC :
Highly Upregulated in Liver Cancer
Insulin-Like Growth Factor 2
long or large intergenic non-coding RNA
long non-coding RNA
long terminal repeat
- MALAT1 :
Metastatis-Associated Lung Adenocarcinoma Transcript 1
- MEG3 :
Maternally Expressed Gene 3
- PCNCR1 :
Prostate Cancer Non-Coding RNA 1
polycomb repressive complex
- PTEN :
phosphatase and tensin homolog
- PTENP1 :
phosphatase and tensin homolog pseudogene 1
- RNA pol II:
RNA polymerase II
- RNA pol III:
RNA polymerase III
serine-arginine rich splicing proteins
transcriptional gene silencing
transcription initiation RNAs
transcribed ultraconserved region
- XIST :
One of modern biology's great surprises was the discovery that the human genome encodes only ~20,000 protein-coding genes, representing <2% of the total genome sequence [1, 2]. However, with the advent of tiling resolution genomic microarrays and whole genome and transcriptome sequencing technologies it was determined that at least 90% of the genome is actively transcribed [3, 4]. The human transcriptome was found to be more complex than a collection of protein-coding genes and their splice variants; showing extensive antisense, overlapping and non-coding RNA (ncRNA) expression [5–10]. Although initially argued to be spurious transcriptional noise, recent evidence suggests that the proverbial "dark matter" of the genome may play a major biological role in cellular development and metabolism [11–17]. One such player, the newly discovered long non-coding RNA (lncRNA) genes, demonstrate developmental and tissue specific expression patterns, and aberrant regulation in a variety of diseases, including cancer [18–27].
Types of human non-coding RNAs
Ribosomal 5S and 5.8S RNAs
Piwi interacting RNAs
Tiny transcription initiation RNAs
Small interfering RNAs
Promoter-associated short RNAs
Termini-associated short RNAs
Antisense termini associated short RNAs
Small ncRNA (18 to 200 nt in size)
Small nucleolar RNAs
Transcription start site antisense RNAs
Small nuclear RNAs
Human Y RNA
Unusually small RNAs
Small NF90-associated RNAs
Ribosomal 18S and 28S RNAs
Long or large intergenic ncRNAs
Transcribed ultraconserved regions
GAA-repeat containing RNAs
Long ncRNA (lncRNAs, 200 nt to >100 kb in size)
Long intronic ncRNAs
Promoter-associated long RNAs
Promoter upstream transcripts
Stable excised intron RNAs
Long stress-induced non-coding transcripts
In contrast to miRNAs, lncRNAs, the focus of this article, are mRNA-like transcripts ranging in length from 200 nt to ~100 kilobases (kb) lacking significant open reading frames. Many identified lncRNAs are transcribed by RNA polymerase II (RNA pol II) and are polyadenylated, but this is not a fast rule [47, 48]. There are examples of lncRNAs, such as the antisense asOct4-pg5 or the brain-associated BC200, which are functional, but not polyadenylated [49–51]. Generally, lncRNA expression levels appear to be lower than protein-coding genes [52–55], and some lncRNAs are preferentially expressed in specific tissues . However, recent findings have suggested novel lncRNAs may contribute a significant portion of the aforementioned 'dark matter' of the human transcriptome [56, 57]. In an exciting report by Kapranov et.al., it was revealed the bulk of the relative mass of RNA in a human cell, exclusive of the ribosomal and mitochondrial RNA, is represented by non-coding transcripts with no known function .
Publically available long non-coding RNA online databases
Background on Long Non-Coding RNA
Nomenclature and Classification of LncRNA
The definition 'non-coding RNA' is typically used to describe transcripts where sequence analysis has failed to identify an open reading frame. However, one should exercise caution when exploring putative non-coding transcripts, as there are cases where 'non-coding' transcripts were found to encode short, functional peptides . Currently, a universal classification scheme to define lncRNAs does not exist and therefore there are numerous synonyms describing the same type of transcripts. Terms such as large non-coding RNA, mRNA-like long RNA, and intergenic RNA all define cellular RNAs, exclusive of rRNAs, greater than 200 nt in length and having no obvious protein-coding capacity . Consequently, this has led to confusion in the literature as to exactly which transcripts should constitute a lncRNA. For example, one subclass of lncRNAs is called large or long intergenic ncRNAs (lincRNAs). These lncRNAs are exclusively intergenic and are marked by a chromatin signature indicative of transcription [8, 58]. To further compound classification confusion, RNA species that are bifunctional preclude categorization into either group of protein-coding or ncRNAs as their transcripts function both at the RNA and protein levels . In these rare cases, they are classified on a case-by-case basis, reserving the term 'lncRNA' to describe transcripts with no protein-coding capacity. In the meantime, and for the purposes of this review, we will consider lncRNAs as a blanket term to encompass mRNA-like ncRNAs, lincRNAs, as well as antisense and intron-encoded transcripts, T-UCRs and transcribed pseudogenes.
Discovery of LncRNAs
Emerging Role of Long Non-Coding RNA in Tumorigenesis
Human cancer-associated lncRNAs
Breast, lung, uterus, pancreas, colon, prostate, liver, cervix1, neuroblastoma1, osteosarcoma
Liver, hepatic colorectal metastasis
Breast, cervix, esophagus, lung, ovary, parotid, tongue
Bladder, lung, liver, breast, endometrial, cervix, esophagus, ovary, prostate, choricarcinoma, colorectal
1.4 kb, 2.2 kb, 2.7 kb
Bladder, colon, cervix, lung, thyroid, liver, breast, esophagus, stomach
0.6 kb, 2 kb, 4 kb
TUC338 (encodes uc.338)
34.8 kb & splice variants
1.6 kb & splicing isoforms
Breast, uterus, ovary (hormone responsive tissue)
2186 nt, 2087 nt
Imprinted lncRNA genes
Imprinting is a process whereby the copy of a gene inherited from one parent is epigenetically silenced [87, 88]. Intriguingly, imprinted regions often include multiple maternal and paternally expressed genes with a high frequency of ncRNA genes. The imprinted ncRNA genes are implicated in the imprinting of the region by a variety of mechanisms including enhancer competition and chromatin remodeling . A key feature of cancer is the loss of this imprinting resulting in altered gene expression [90, 91]. Two of the best known imprinted genes are in fact lncRNAs.
The H19 gene encodes a 2.3 kb lncRNA that is expressed exclusively from the maternal allele. H19 and its reciprocally imprinted protein-coding neighbor the Insulin-Like Growth Factor 2 or IGF2 gene at 11p15.5 were among the first genes, non-coding or otherwise, found to demonstrate genomic imprinting [74, 92]. The expression of H19 is high during vertebrate embryo development, but is downregulated in most tissues shortly after birth with the exception of skeletal tissue and cartilage [20, 93, 94]. Loss of imprinting and subsequent strong gene expression has been well-documented in human cancers. Likewise, loss of imprinting at the H19 locus resulted in high H19 expression in cancers of the esophagus, colon, liver, bladder and with hepatic metastases [95–97].
H19 has been implicated as having both oncogenic and tumor suppression properties in cancer. H19 is upregulated in a number of human cancers, including hepatocellular, bladder and breast carcinomas, suggesting an oncogenic function for this lncRNA [97–99]. In colon cancer H19 was shown to be directly activated by the oncogenic transcription factor c-Myc, suggesting H19 may be an intermediate functionary between c-Myc and downstream gene expression . Conversely, the tumor suppressor gene and transcriptional activator p53 has been shown to down-regulate H19 expression [100, 101]. H19 transcripts also serve as a precursor for miR-675, a miRNA involved in the regulation of developmental genes . miR-675 is processed from the first exon of H19 and functionally downregulates the tumor suppressor gene retinoblastoma (RB1) in human colorectal cancer, further implying an oncogenic role for H19 .
There is evidence suggesting H19 may also play a role in tumor suppression [104, 105]. Using a mouse model for colorectal cancer, it was shown that mice lacking H19 manifested an increased polyp count compared to wild-type . Secondly, a mouse teratocarcinoma model demonstrated larger tumor growth when the embryo lacked H19, and finally in a hepatocarcinoma model, mice developed cancer much earlier when H19 was absent . The discrepancy as to whether H19 has oncogenic or tumor suppressive potential may be due in part to the bifunctional nature of the lncRNA or may be context dependent. In either case, the precise functional and biological role of H19 remains to be determined.
XIST - X-inactive-specific transcript
The 17 kb lncRNA XIST is arguably an archetype for the study of functional lncRNAs in mammalian cells, having been studied for nearly two decades. In female cells, the XIST transcript plays a critical role in X-chromosome inactivation by physically coating one of the two X-chromosomes, and is necessary for the cis-inactivation of the over one thousand X-linked genes [75, 108–110]. Like the lncRNAs HOTAIR and ANRIL, XIST associates with polycomb-repressor proteins, suggesting a common pathway of inducing silencing utilized by diverse lncRNAs. In mice, X inactivation in the extraembryonic tissues is non-random, and the initial expression of Xist is always paternal in origin, followed later by random X inactivation in the epiblast associated with random mono-allelic expression . Intriguingly, regulation of Xist in mouse has been shown to be controlled by interactions amongst additional ncRNAs, including an antisense transcript, Tsix, enhancer-associated ncRNAs (Xite) and the upstream Jpx and Ftx lncRNAs [109, 112–115]. Intriguingly, murine Xist and Tsix duplexes are processed into small RNAs by an apparently Dicer-dependent manner, suggesting a significant overlap between the regulatory networks of both lncRNAs and sRNAs . It is unclear, however, how much of this regulation is conserved in humans, who do not show imprinted X inactivation . While XIST expression levels are correlated with outcome in some cancers, such as the therapeutic response in ovarian cancer , the actual role that XIST may play in human carcinomas, if any, is not entirely clear.
There is generally believed to be only a limited developmental window in which X inactivation can occur, and loss of XIST from an inactive X chromosome does not result in reactivation of the X chromosome [119, 120]. Thus, tumors with additional X chromosomes generally keep the inactivation status of the duplicated X. For tumors in which two active X chromosomes are observed, as has been frequently observed in breast cancer, the most common mechanism involves loss of the inactive X and duplication of the active X, often resulting in heterogeneous XIST expression in these tumors [121–123]. Loss of the inactive X and abnormalities of XIST expression may be indicative of more general heterochromatin defects . Analogously, failure to properly reset XIST and X inactivation may serve as a marker of proper resetting of epigenetic marks in stem cells or induced pluripotent stem cells [125, 126]. Correlations between XIST expression and cancer state can also occur spuriously. Notably, in cytogenetically normal cells, XIST is found only in females as males do not have an inactive X. Therefore in cancers correlations can be observed with XIST expression due to differential presence of male or female samples, and many cancers do show different onsets and progressions in males and females. Furthermore, XIST expression will increase with the number of inactive X chromosomes. While it might be anticipated that there would be little advantage to a tumor acquiring inactive chromosomes, it has been shown that approximately 15% of human X-linked genes continue to be expressed from the inactive X chromosome .
Involvement in metastasis
HOTAIR - HOX antisense intergenic RNA
Several lncRNAs have been implicated in metastasis. One of the first lncRNAs described to have a fundamental role in cancer was the metastasis-associated HOX Antisense Intergenic RNA (HOTAIR), a 2.2 kb gene located in the mammalian HOXC locus on chromosome 12q13.13 . This lncRNA was found to be highly upregulated in both primary and metastatic breast tumors, demonstrating up to 2000-fold increased transcription over normal breast tissue . High levels of HOTAIR expression were found to be correlated with both metastasis and poor survival rate, linking a ncRNA with cancer invasiveness and patient prognosis . Furthermore, it was demonstrated that if cells expressing HOTAIR were grafted into mouse mammary fat pads, a modest increase in the rate of primary tumor growth was observed . Interestingly, there are reports indicating that numerous lncRNAs are transcribed from the HOX locus, suggesting that HOTAIR may be only one example of a global regulatory phenomena .
HOTAIR binding results in a genome-wide re-targeting of the PRC2 complex. The HOXD locus on chromosome 2 is a PRC2 target (Figure 3). The consequence of PRC2/HOTAIR localization is the transcriptional silencing of a 40 kb region of the HOXD locus which remodels the gene expression pattern of breast epithelial cells to more closely resemble that of embryonic fibroblasts . The HOTAIR RNA appears to act as a molecular scaffold, binding at least two distinct histone modification complexes. The 5' region of the RNA binds the PRC2 complex responsible for H3K27 methylation, while the 3' region of HOTAIR binds LSD1, a histone lysine demethylase that mediates enzymatic demethylation of H3K4Me2 [128, 134]. Although, the precise mechanism of HOTAIR activities remains to be elucidated, it is clear that HOTAIR reprograms chromatin state to promote cancer metastasis.
MALAT1 - Metastasis-associated lung adenocarcinoma transcript 1
The MALAT1 gene, or metastasis-associated lung adenocarcinoma transcript 1, was first associated with high metastatic potential and poor patient prognosis during a comparative screen of non-small cell lung cancer patients with and without metastatic tumors . This lncRNA is widely expressed in normal human tissues [135, 136] and is found to be upregulated in a variety of human cancers of the breast, prostate, colon, liver and uterus [27, 137–139]. Notably, the MALAT1 locus at 11q13.1 has been identified to harbor chromosomal translocation breakpoints associated with cancer [140–142].
Following the correlation between high levels of MALAT1 expression and metastasis, a number of studies have implicated MALAT1 in the regulation of cell mobility. For example, RNA interference (RNAi)-mediated silencing of MALAT1 impaired the in vitro migration of lung adenocarcinoma cells through concomitant regulation of motility regulated genes via transcriptional and/or post-transcriptional means . Similarly, short hairpin RNA inhibition of MALAT1 reduced cell proliferation and invasive potential of a cervical cancer cell line . Collectively, these studies suggest that MALAT1 regulates the invasive potential of metastatic tumor cells. Other examples of cancer-associated lncRNAs subject to post-transcriptional processing include the miR-155 host gene BIC and the miR-17-92 cluster in B-cell lymphoma and neuroblastoma, respectively [153, 154].
Recent efforts have focused on elucidating the molecular role of nuclear speckle localized MALAT1 transcripts, while the function for the mascRNA has yet to be determined. Nuclear speckles are thought to be involved in the assembly, modification and/or storage of pre-mRNA processing machinery [136, 155]. A recent publication revealed MALAT1 RNA strongly associates with serine-arginine rich splicing factor (SR) proteins which are involved in both constitutive and alternative splicing and the levels of MALAT1 regulated the cellular levels of phosphorylated SR proteins . These findings imply that the lncRNA MALAT1 may serve a function in the regulation of alternate splicing by modulating the activity of SR proteins, although precisely how this may contribute to tumorigenesis remains unknown.
LncRNA-mediated tumor suppression by P53 stimulation
MEG3 - Maternally expressed gene 3
The maternally expressed gene 3 (MEG3) was the first lncRNA proposed to function as a tumor suppressor. The MEG3 gene is expressed in many normal human tissues, with the highest expression in the brain and pituitary gland [156, 157]. MEG3 expression was not detectable in various brain cancers, nor in a range of human cancer cell lines implicating a potential role of this lncRNA in suppression of cell growth. Moreover, ectopic expression of MEG3 RNA was found to suppress the growth of several human cancer cell lines, further supporting the role of MEG3 as a tumor suppressor . In clinically nonfunctioning pituitary tumors, it was demonstrated that hypermethylation of the MEG3 regulatory region was associated with the loss of MEG3 expression, providing evidence for a mechanism of MEG3 inactivation .
MEG3 is a paternally imprinted, single copy gene comprised of 10 exons . To date, 12 MEG3 isoforms have been detected (MEG3; a-3k) due to alternative splicing . Each isoform contains the common exons 1-3 and 8-10, but varies in the combination of exons 4-7 in the middle of the transcript . Notably, the last intron of MEG3 encodes the evolutionarily conserved miR-770, a miRNA with a number of putative mRNA targets . This arrangement mirrors the presence of miR-675 in H19, except in that instance the miRNA was exon encoded . The originally identified isoform of MEG3 is expressed as a 1.6 kb polyadenylated transcript that is localized to the nucleus where it is associated with chromatin although some cytoplasmic MEG3 transcripts have been detected [156, 160, 161]. All 12 MEG3 isoforms demonstrate three distinct secondary folding motifs designated M1, M2 and M3 .
Functionally, MEG3 has been implicated as a top-level regulatory RNA due to its ability to stimulate both p53-dependent and p53-independent pathways [158, 162]. Critically, the MEG3-mediated functional activation of p53 is dependent on the secondary structure of the MEG3 RNA rather than on primary sequence conservation. In an elegant series of experiments, Zhang et al. demonstrated that replacing particular regions of the MEG3 with unrelated sequence had no effect on the activation of p53 provided the original secondary structure was preserved . This observation strengthens the argument that the rapid evolution and relative lack of sequence conservation does not limit the potential functionality of these unique RNA species . Conversely, the novel lincRNA-p21 has been described as a downstream repressor in the p53 transcriptional response, suggesting the complex p53 transcriptional network includes numerous regulatory lncRNAs .
Depletion of miRNAs by a lncRNA 'molecular decoy' or 'miRNA sponge'
Expression of miRNAs in cancer can be deregulated by a range of mechanisms, including copy number alterations and epigenetic silencing [165–168]. Two recent examples have demonstrated that lncRNAs can act as natural 'miRNA sponges' to reduce miRNA levels .
HULC - Highly Upregulated in Liver Cancer
The most highly upregulated transcript found in a microarray-based study of gene expression in hepatocellular carcinoma was determined to be the ncRNA HULC, or Highly Upregulated in Liver Cancer . Transcribed from chromosome 6p24.3, this lncRNA demonstrates the hallmarks of a typical mRNA molecule, including a single spliced GT-AG intron, canonical polyadenylation signals upstream of the poly(A) tail and nuclear export demonstrating strong localization to the cytoplasm. Although HULC was found to co-purify with ribosomes, no translation product for this lncRNA has been detected, supporting its classification as a non-coding transcript . In addition to liver cancer, HULC was found to be highly upregulated in hepatic colorectal cancer metastasis and in hepatocellular carcinoma cell lines (HCC) producing hepatitis B virus (HBV) .
Pseudogene pairs: PTEN and PTENP1
The discovery that HULC can act as a molecular decoy or 'miRNA sponge' mirrors a recent report describing an intricate relationship between the tumor suppressor phosphatase and tensin homolog (PTEN) and the matching lncRNA phosphatase and tensin homolog pseudogene 1 (PTENP1) . The RNA transcripts of the PTEN/PTENP1 pair share similar 3' untranslated regions (3'UTRs) which both bind the same miRNAs. By binding miRNAs, PTENP1 transcripts reduce the effects of translational repression on PTEN therefore allowing expression of this tumor suppressor. In cancer, specific mutations inactivate these miRNA binding sites in PTENP1, therefore reducing the translation of PTEN and promoting tumor growth . This is especially pertinent as subtle changes in PTEN levels can influence cancer susceptibility .
LncRNA-mediated deregulation of the tumor suppressors
ANRIL - Antisense Non-coding RNA in the INK4 Locus
Located as part of the 42kb INK4b-ARF-INK4a locus on chromosome 9p21.3, the Antisense Non-coding RNA in the INK4 Locus (ANRIL) is transcribed by RNA pol II and processed into alternatively spliced isoforms, including an unspliced transcript of 34.8 kb termed p15AS [175, 176]. The INK4b-ARF-INKa locus has an important role in cell cycle control, cell senescence, stem cell renewal and apoptosis through P14ARF-MDM2-P53 and P16Ink4a/p15Ink4b-Cdk4/6-pRb pathways [177–179]. A mouse model suggests that the well-characterized tumor suppressor genes encoded within this locus are regulated by Polycomb proteins .
Aberrant expression and single nucleotide polymorphisms (SNPs) within ANRIL have been associated with susceptibility to a range of human diseases, including cancer [181, 182]. Moreover, the INK4b-ARF-INK4a locus is subject to frequent deletion or hypermethylation in cancers, including leukemia, melanoma, lung and bladder cancers . ANRIL has been associated with epigenetic silencing of the tumor suppressor gene p15, although the molecular events leading to this silencing were unclear .
Evidence has suggested an intriguing mechanisms for ANRIL-mediated silencing of the INK4b-ARF-INK4a locus. Recently an interaction between chromatin modifying PRC complexes and ANRIL has been described, further elucidating the regulatory mechanism of this lncRNA. Like the lncRNA HOTAIR which binds both the polycomb repressor complex PRC2 and the LSD1 complex, ANRIL binds and recruits two polycomb repressor complexes modifying complexes, PRC1 and PRC2 [183, 184]. This resulted in ANRIL/PRC mediated silencing of the genes in the INK4b-ARF-INK4a locus. While the distinct regions of the HOTAIR lncRNA required for interactions with each protein complex were determined , future studies will be necessary to elucidate the structural requirements of lncRNAs such as ANRIL with chromatin regulators such as PRC1/PRC2.
The molecular details of ANRIL-mediated tumor suppression are becoming more clear. However, other mechanisms of lncRNA-mediated suppression of tumor suppressor genes have been reported. For example, a change in the expression ratio of bidirectional genes has been shown to mediate the expression of the tumor suppressor p21 . Collectively, these observations suggest that lncRNA-mediated silencing of tumor suppressor genes may be a major mechanism driving tumorigenesis.
Cancer type specific lncRNA expression
Many of the described lncRNAs are expressed in a variety of cancers, however a select few thus far have been associated with a single cancer type. HOTAIR, for example, has only been described in breast cancer, while three lncRNAs PCGEM1, DD3 and PCNCR1 have been associated solely with prostate cancer [68, 186, 187]. The most recently described of these, Prostate Cancer Non-Coding RNA 1 (PCNCR1) lncRNA, was identified in a 'gene desert' on chromosome 8q24.2 and is associated with susceptibility to prostate cancer. PCNCR1 is expressed as an intronless, ~13 kb transcript with a potential role in trans-activation of androgen receptor (AR), a key player in prostate cancer progression . Likewise, PCGEM1 (Prostate Specific Gene 1) was found to have properties supporting tumorigenesis, as ectopic overexpression of PCGEM1 RNA resulted in increased cell growth and colony formation in cell lines . The lncRNA Differential Display Code 3 (DD3) is also highly over expressed in prostate cancer, yet little is known about the role DD3 may play in prostate cancer progression [189, 190]. Finally, the liver associated lncRNA HULC is highly expressed in primary liver tumors, and in colorectal carcinomas that metastasized to the liver, but not in the primary colon tumors or in non-liver metastases .
RNA polymerase III transcription of lncRNA
The lncRNAs described thus far are products of RNA pol II transcription, yet many ncRNAs are transcribed by RNA polymerase III (RNA pol III) . Importantly, RNA pol III is frequently deregulated in cancer cells resulting in increased activity [192, 193]. The molecular mechanisms driving increased RNA pol III activity in tumor cells include overexpression of RNA pol III transcription factors, escape from RNA pol III repressors and direct oncogene-mediated activation [193–195]. Aberrant RNA pol III function may have consequences to the expression of lncRNAs transcribed by this polymerase.
For example, the lncRNA BC200 is a small cytoplasmic lncRNA in the neurons of primate nervous systems and human cancers, but not in non-neuronal organs [20, 50, 51, 196, 197]. Unlike the majority of lncRNAs described thus far, BC200 is transcribed by RNA pol III and shares unique homology with human Alu elements [196, 198]. Similarly, the lncRNA HULC also shares homology with mobile DNA, in this case with a long terminal repeat (LTR) retroelement . The BC200 RNA has been characterized as a negative regulator of eIF4A-dependent translation initiation .
Due to the fact that many whole transcriptome sequencing methods were developed to enrich for poly(A) purified transcripts, RNA pol III transcripts may have been excluded from analysis. This suggests that other, yet unidentified RNA pol III lncRNAs over-expressed in cancer may be participating in tumorigenesis.
Aberrant T-UCR expression in human carcinoma
The high degree of conservation of T-UCRs, combined with their tissue-specific expression, suggests these ncRNAs may play a critical role in cellular metabolism and development . The expression of many T-UCRs is significantly altered in cancer, notably in adult chronic lymphocytic leukemias, colorectal and hepatocellular carcinomas and neuroblastomas [85, 200]. Their aberrant transcription profiles can be used to differentiate types of human cancers and have been linked to patient outcome . Some of the T-UCRs reside in genomic regions associated with specific types of cancer . For example, in colon cancer, the T-UCR uc.73A is one of the most highly upregulated T-UCRs, and this lncRNA has been found to show oncogenic properties by proliferation assays . Similarly, T-UCR uc.338 is significantly upregulated in human hepatocarcinoma tumor and cell lines, and uc.338 was found to be part of a larger transcriptional unit coined TUC338 which is involved in cell growth .
The mechanism by which T-UCRs are differentially expressed is unclear. A study profiling T-UCRs in neuroblastomas did not find a consistent association between genomic alterations and T-UCR expression, suggesting aberrant T-UCR expression may be a consequence of epigenetic mechanisms. Like miRNAs and coding genes, T-UCR expression has been shown to be repressed by CpG island hypermethylation [85, 204]. Similarly, miRNAs have been shown to bind to, and downregulate T-UCRs, suggesting a complex regulatory mechanism may exist between ncRNAs in human cancers [85, 200]. Collectively, these reports have suggested that aberrant T-UCR expression may have a yet undetermined biological role in tumorigenesis.
Utility of LncRNAs in Cancer Diagnostics and Therapies
Like their smaller non-coding miRNA counterparts, lncRNAs represent a significant untapped resource in terms of developing diagnostics and therapies. Differential or high level expression of certain cancer type-specific lncRNAs can be exploited for the development of novel biomarkers as lncRNA expression or may potentially correlate with patient response to chemotherapy. Understanding the mechanism(s) by which lncRNAs act will continue to provide novel approaches to regulating genes including the development of mimetics to compete with binding sites for miRNAs, chromatin remodelers, or DNA. It has been suggested that mediating transcriptional gene silencing (TGS) pathways, especially those of tumor suppressors or oncogenes, could be of high therapeutic benefit .
It has been widely reported that cancer-specific miRNAs are detectable in the blood, sputum and urine of cancer patients [206–210]. Likewise, lncRNAs have demonstrated utility as fluid-based markers of specific cancers. For example, the prostate specific lncRNA DD3 has been developed into highly specific, nucleic acid amplification-based marker of prostate cancer, which demonstrated higher specificity than serum prostate-specific antigen (PSA) [211, 212]. Similarly, the highly expressed hepatocarcinoma-associated lncRNA HULC is detectable in the blood of hepatocarcinoma patients by conventional PCR methods .
The use of lncRNAs as therapeutic agents is only beginning to be explored . Although our understanding of the molecular mechanisms of lncRNA function is limited, some features of lncRNAs make them ideal candidates for therapeutic intervention. Many lncRNAs appear to have protein-binding or functional potential that is dependent on secondary structure, this may provide a means of intervention . Preventing the interactions of HOTAIR with the PRC2 or LSD1 complexes, for example, may limit the metastatic potential of breast cancer cells .
The tumor expression of certain lncRNAs provides a source of regulatory regions that can be used to reduce the risk of affecting normal tissues during transgene-mediated treatment. For example, H19 is strongly expressed in embryonic cells and in a wide-range of human cancers [20, 93, 94]. As such, a plasmid-based system has been developed to exploit the tumor-specific expression of H19, primarily tested in treating bladder cancer. A plasmid construct, harboring a diptheria toxin gene driven by H19 specific regulatory sequences, is either administered via intratumoral injection as naked DNA, or complexed to the cationic polymer polyethylenimine (PEI) to form a polyplex vector . PEI-complexed plasmid is thought to increase the efficiency of DNA uptake via clathrin-dependent and -independent (cholesterol-dependent) pathways . Upon uptake, high levels of diptheria toxin are expressed in the tumor, resulting in a reduction in tumor size in human trials [218–221]. Collectively, these advances indicate the potential in developing lncRNA mediated diagnostics and therapies.
Differential expression of lncRNAs is becoming recognized as a hallmark feature in cancer, however the functional role for the vast majority of these unique genes is still in question. In this review, we highlight characterized lncRNAs described to play a functional role in cancer-associated processes, such as metastasis and loss of imprinting. Aberrant lncRNA expression participates in carcinogenesis by disrupting major biological processes, such as redirecting chromatin remodeling complexes or inactivating major tumor suppressor genes. We also describe the potential role of dysregulated T-UCRs in cancer, and place these unique RNAs in the lncRNA category. Finally, we note the potential utility of lncRNAs in cancer as diagnostic and prognostic markers, as well as the potential of developing lncRNA mediated therapy.
EAG is a research fellow at the BC Cancer Agency Research Centre, specializing in non-coding RNA gene structure and function. CJB is a professor of Medical Genetics at the University of British Columbia. Her research focuses on human X chromosome inactivation, in particular the role of XIST, a gene she first described during postdoctoral studies in the Willard lab. WLL is a professor of Pathology and Laboratory Medicine at the University of British Columbia. His team invented tiling path array technologies for whole genome and methylome analyses.
We would like to thank Chad Malloff and Gavin Wilson for their insightful comments. This work was supported by funds from the Canadian Institutes for Health Research (MOP 86731, MOP 77903 to WLL and MOP 13690 to CJB), Canadian Cancer Society (CCS20485), NCI Early Detection Research Network (5U01 CA84971-10) and Department of Defense (CDMRP W81XWH-10-1-0634).
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.