A 3′UTR polymorphism modulates mRNA stability of the oncogene and drug target Polo-like Kinase 1
© Akdeli et al.; licensee BioMed Central Ltd. 2014
Received: 11 September 2013
Accepted: 15 April 2014
Published: 26 April 2014
The Polo-like Kinase 1 (PLK1) protein regulates cell cycle progression and is overexpressed in many malignant tissues. Overexpression is associated with poor prognosis in several cancer entities, whereby expression of PLK1 shows high inter-individual variability. Although PLK1 is extensively studied, not much is known about the genetic variability of the PLK1 gene. The function of PLK1 and the expression of the corresponding gene could be influenced by genomic variations. Hence, we investigated the gene for functional polymorphisms. Such polymorphisms could be useful to investigate whether PLK1 alters the risk for and the course of cancer and they could have an impact on the response to PLK1 inhibitors.
The coding region, the 5′ and 3′UTRs and the regulatory regions of PLK1 were systematically sequenced. We determined the allele frequencies and genotype distributions of putatively functional SNPs in 120 Caucasians and analyzed the linkage and haplotype structure using Haploview. The functional analysis included electrophoretic mobility shift assay (EMSA) for detected variants of the silencer and promoter regions and reporter assays for a 3′UTR polymorphism.
Four putatively functional polymorphisms were detected and further analyzed, one in the silencer region (rs57973275), one in the core promoter region (rs16972787), one in intron 3 (rs40076) and one polymorphism in the 3′untranslated region (3′UTR) of PLK1 (rs27770). Alleles of rs27770 display different secondary mRNA structures and showed a distinct allele-dependent difference in mRNA stability with a significantly higher reporter activity of the A allele (p < 0.01).
The present study provides evidence that at least one genomic variant of PLK1 has functional properties and influences expression of PLK1. This suggests polymorphisms of the PLK1 gene as an interesting target for further studies that might affect cancer risk, tumor progression as well as the response to PLK1 inhibitors.
KeywordsPolymorphism PLK1 rs27770 3′UTR
Polo-like kinases (PLKs) belong to the family of serin/threonin kinases. They are involved in the regulation of cell division and centrosome cycle. Until now, four human PLKs have been identified. PLK1 is so far the best characterized polo-like kinase and a target for anticancer therapy [1, 2].
PLK1 promotes proliferation by supporting mitotic entry and inhibits apoptosis by interaction with p53 [3–5]. PLK1 is up-regulated in many different tumour tissues like head and neck squamous cell carcinoma, oesophagus and stomach cancer, ovarian cancer, non-small cell lung cancer, liver cancer, cervical cancer and breast cancer [6–9]. Overexpression of PLK1 has been suggested as a biomarker for numerical chromosomal aberration [10, 11]. Furthermore, overexpression is associated with poor prognosis in several cancer entities [9, 12–15]. Consistent with these findings, different PLK1 inhibitors, i.e. small molecules as well as an siRNA-based formulation, are currently under preclinical and clinical evaluation as promising anticancer drugs [16–18].
PLK1 is an important oncogene and drug target in many cancer entities. Genetic variability of such proteins can have an impact upon the risk and the outcome of different cancer types as well as the response of an individual to drug treatments [27, 28]. Until now, only very limited information about functionally relevant genetic variations of the PLK1 gene is available. The aim of this study was to systematically search for functional polymorphisms in the PLK1 gene, which could alter gene expression or protein function. We reviewed dbSNP and HapMap data and sequenced functionally relevant regions of PLK1. Retrieved polymorphisms were analyzed by in silico methods to predict functional polymorphisms. Four SNPs were selected for further evaluation and analyzed for linkage and haplotype structure. We identified rs27770 as a functional polymorphism that modulates the secondary structure and stability of PLK1 mRNA.
Sequencing results of the PLK1 gene and linkage analysis of SNPs in regulatory regions
Genotype distributions and allele frequencies in healthy Caucasians
rs57973275 (c.-1706 G > A) and rs16972787 (c.-233G > A) alleles generate different putative transcription factor binding sites
rs27770 (c.*154A > G) alleles generate different secondary mRNA structures and the A allele leads to increased expression in HEK293 reporter assays
According to our database review and sequencing results the coding region of PLK1 is conserved and polymorphisms are located in intronic and regulatory regions. This is in line with general findings with regard to the occurrence rate of genetic variations and especially of SNPs in different gene regions . We considered database-derived polymorphisms with a MAF of at least 1% in Caucasians. Nevertheless, the analyzed databases comprise a relevant number of rare variants of the coding region with MAFs of less than 1% in Caucasian that could have a functional impact on PLK1. Some of these variants reach a MAF of more than 1% in other ethnicities (e.g. rs2230914). Due to the number of chromosomes investigated by sequencing, the probability to detect new, undescribed polymorphisms with a MAF of 1% was 33% only. An adequate probability of at least 90% for detecting new polymorphisms was only reached for polymorphisms with a MAF of more than 5%. Therefore, it is possible that the PLK1 gene still harbors undetected rare variants (most likely non-SNP variations). Furthermore, other databases that were not systematically analyzed for this study might contain additional variations with MAFs above 1% in Caucasians. For example, after completion of our experiments we became aware of a missense variant (rs45569335) with an overall MAF of 0.7%, but with a MAF of 1.2% within the Caucasian subset of the 1000 genome browser .
We selected four candidate SNPs for further investigation, which were either located within the regulatory regions of PLK1 (rs57973275, rs16972787 and rs27770) or showed an association with bladder cancer outcome in a previous study (rs40076). The results of the haplotype analysis implied that it is not necessary to genotype all 4 polymorphisms in future association studies  and that two tagging SNPs, either rs57973275 or rs16972787 in combination with rs27770 would be sufficient to represent the haplo- and diplotype structure of these PLK1 SNPs. Furthermore, the strong linkage disequilibrium suggests rs27770 as the underlying functional SNP in the detected association of the PLK1 intron 3 SNP rs40076 with bladder cancer outcome .
Since the two SNPs 5′ of the coding region are located within previously identified important regulatory regions of PLK1 they were considered eligible candidates for bioinformatic and experimental assessment , and because different software applications access different databases, we used three in silico tools to predict putative TF binding sites. Analysis revealed different allele-specific candidates for both SNPs. Computational approaches for identifying binding sites suffer from high error rates because binding motifs of TFs are typically short and degenerated , therefore, we performed EMSA experiments using three different cell lines to validate the in silico results. Unfortunately, the EMSA results clearly indicated no functional impact of these polymorphisms on TF binding, negating any further evaluation of the SNPs with regard to TF binding activity. Some authors reported that only 33% of all functional promoter variants were found in known consensus sequences or motifs , therefore, we cannot rule out an impact of the two SNPs on binding of other TFs which are not expressed by the selected cell lines. Another mechanism to regulate expression is methylation of CpG islands. Theoretically, the G allele of the promoter polymorphism rs16972787 could be a candidate for allele-specific methylation, however, changes of the methylation status in human malignant cells and tissues have been reported for PLK2 and PLK3 but not for PLK1[12, 43, 44]. A further study suggested that the PLK1 promoter is unmethylated in G0/G1 (PLK1 not expressed) as well as M phase (PLK1 expressed) and regulated during the cell cycle by transcription factors .
Alleles of the 3′UTR polymorphism rs27770 were analyzed with regard to different functional RNA motifs and microRNA binding sites. The analysis revealed neither motif nor target site differences. Until now at least six PLK1 mRNA-targeting microRNAs have been experimentally validated [46–51]. In line with our analysis, the predicted corresponding binding sites do not include the polymorphism. In consideration of the secondary mRNA structure, another key factor for microRNA target recognition is the accessibility of the binding site . Different reports proposed altered microRNA binding because of allele-dependent changes of the secondary mRNA structure due to SNPs outside of the microRNA binding site [53, 54]. Furthermore, alterations of the secondary structure itself can interfere with RNA-binding proteins, which can lead to altered mRNA stability . We therefore investigated the secondary structure of the PLK1 mRNA dependence on rs27770 alleles. Although only one nucleotide was substituted, major changes of the secondary structure were predicted. Reporter assays of the 3′UTR of PLK1 consistently showed statistically significant allele-dependent differences in mRNA stability. In comparison to the G allele, the A allele showed 25% more reporter activity, which faithfully reflects mRNA levels . Our results, as well as a previous bioinformatic comparison of HapMap and dbEST data, support a functional impact of rs27770. However, the results themselves are contradictory, because the previous report predicted an increased expression of the G allele . This could have several reasons. First, an experimental validation of a subset of the predicted candidate SNPs confirmed only 36% of the results and rs27770 was not part of the validation subset. Most of the SNPs (59%) showed no differential allelic expression, but alleles corresponding to 5% of the SNPs were significantly associated with gene expression in the opposite direction. Second, according to the usual practice, we investigated mRNA stability of the 3′UTR by reporter assay, however the complete PLK1 mRNA contributes to the secondary structure. It is therefore possible that the hybrid mRNA of the Firefly luciferase coding region and the PLK1 3′UTR could lead to biased results because of other secondary structures. Third, both results could be genuine, if the effect of the SNP is context-dependent and tissue-related. This is a well-known phenomenon and occurs often in connection with regulatory SNPs, especially if the SNP effect depends on differentially expressed transcription factors and microRNAs respectively .
Finally, genetic variability of the PLK1 gene and rs27770 in particular are interesting candidates for additional studies. Because PLK1 plays an important role in the cell cycle and inhibits apoptosis, it should be investigated whether PLK1 polymorphisms have an impact on proliferation of malignant and non-malignant cells [3, 5]. This would lead to altered expression profiles in cancer tissues and might partly explain the detected variability of PLK1 expression in different cancer entities . In some malignancies like acute lymphoblastic leukemia (ALL) PLK1 expression is highly variable, but expression is not associated with any clinical or biological feature while ALL cell lines respond very well to PLK1 inhibitor treatment . For these malignancies analysis of PLK1 polymorphisms would be an interesting approach to reanalyze genotype-dependent subsets with regards to expression patterns and clinical as well as biological features. PLK1 polymorphisms could be useful with regard to risk as well as outcome studies in Caucasian cancer patients but also in other ethnicities because, according to dbSNP data, rs27770 occurs in other ethnicities as well. Furthermore, functional SNPs in drug target genes may have an impact on targeted therapy. It would therefore be of interest to evaluate PLK1 inhibitor studies with regard to PLK1 polymorphisms, and, because of the predicted impact of the SNP on the secondary mRNA structure of PLK1 and the effects shown on mRNA stability, RNAi based PLK1 inhibitors would be of special interest in this case. It is also well-known that the target secondary structure has a major impact on siRNA and RNAi efficiency [59, 60]. We have no evidence for an interaction of PLK1 polymorphisms with currently clinically evaluated RNAi-based PLK1 inhibitors , but it would be reasonable to analyze the respective binding sites with regard to allele-dependent target accessibility.
Altogether, our results contribute to reveal the functional impact of genetic variants on PLK1 function. Based on such results, we can speculate about a putative clinical impact: I. these variants may play a role in carcinogenesis and modulate the risk for cancer; II. variants may contribute to altered tumor growth which could lead to different disease courses; and III. PLK1 inhibitor response might be genotype dependent. Although our analyses were not exhaustive, data presented here strongly indicate that a relevant amount of the detectable inter-individual variability of the PLK1 expression with concomitant molecular changes is determined by genomic variants of PLK1.
We have retrieved and analyzed data of genetic variations of the PLK1 gene region from NCBI dbSNP  and the HapMap database . To further analyze HapMap data we used Haploview 4.2  but none of the currently accessible HapMap versions contained data of all of the four polymorphisms of this study. Therefore, Haploview was used to analyze and visualize our own data only. Analysis of putative allele-dependent binding sites of transcription factors due to SNPs within regulatory regions of the PLK1 gene was performed with MatInspector , Consite  and Alibaba2.1  using default settings. To study whether microRNA binding or regulatory RNA motifs could be altered by SNPs, we performed analysis of the 3′UTR using RegRNA 2.0 . The mRNA sequences harboring the different rs27770 alleles were subjected to the web-tool mfold to predict secondary structures . The full mRNA sequence of PLK1, [NCBI RefSeq NM005030], was used for analysis. Sequences were folded with mfold in a locally automated manner. The structures predicted to have the lowest energy were used to identify the folding state. For computing suboptimal foldings the percent suboptimality value that controls the free energy increment was set to 5%.
Sequencing of the PLK1 gene
Primers for PCR and sequencing of PLK1
Sense (5′ – 3′)
Antisense (5′ – 3′)
Exon8 + 9
Genotyping of PLK1 polymorphisms
The polymorphisms rs57973275, rs40076 and rs27770 were genotyped by restriction fragment length polymorphism analyses. For all polymerase chain reactions (PCR) the Taq DNA Polymerase Master Mix RED (Ampliqon, Herlev, Denmark) was used. The PCR for rs57973275 was performed with following primers: 5′-TCCCTGGACTTTGTCCATG-3′ and 5′-ACCACCTCCTAGTCTGATG-3′ resulting in a PCR product of 138 bp. Amplified fragments were digested with restriction enzyme DdeI (New England Biolabs, Beverly, MA, USA) by incubating for 4 hours at 37°C. DdeI specifically cuts PCR products that carry the G allele (98 + 40 bp). For rs40076 an 110 bp fragment was amplified from genomic DNA with the following primers: 5′-TGTGGTCCATTGGGTGTATC-3′ and 5′-AAGGTCCACAGAAAAGGTC-3′. The variant G allele generates a PsyI (Fisher Scientific, Schwerte, Germany) restriction site that leads to two bands (50 + 60 bp). Genotypes of rs27770 were determined using the primers 5′-CTCCCGCGGTGCCATGTCT-3′ and 5′-CCGAACATGTACAAAAATAACGTA-3′ and the restriction enzyme RsaI (New England Biolabs, Ipswich, MA, USA) which cuts the G allele (87 + 13 bp). For rs16972787 no appropriate allele-specific restriction enzyme was available and it was therefore genotyped by Pyrosequencing. PCR was performed using forward primer 5′-GGTCTCCGCATCCACGCCGG-3′ and biotinylated reverse primer 5′-TCCAAACC-CGCCCGCCGCGC-3′ resulting in a 150 bp fragment. The DNA amplification was carried out using Taq PCR Mastermix (Eppendorf, Hamburg, Germany). The biotinylated strand was captured on streptavidin coated beads, annealed with sequencing primer 5′-CCAGGCTATCCCACGTGTT-3′ and sequenced with a PyroMark Q96 MD (Qiagen, Hilden, Germany). Results were analyzed using the PSQ96 SNP software (Qiagen, Hilden, Germany). Adequate negative and positive controls were used for genotyping of all SNPs. Accuracy of genotyping was additionally validated by direct sequencing of 10% randomly selected samples and of the samples harboring rare haplotypes with a frequency under 1%. This revealed complete concordance with previous results.
Electrophoretic mobility shift assays (EMSA)
Nuclear extracts from HEK293, HeLa and HepG2 cells were prepared using the NuCLEAR™ extraction kit (Sigma, Deisenhofen, Germany) and stored at −80°C until use. EMSAs were done with the DIG Gel Shift kit (Roche Applied Science, Mannheim, Germany) using digoxigenin (DIG)-labeled double-stranded oligonucleotides. The double-stranded oligonucleotides were made of synthesized single-stranded oligonucleotides: rs57973275 G allele 5′-ACTACAGGCTGAGTCTGTGAATCTCC-3′ and 5′-GGAGATTCACAGACTCAGCCTGTAGT-3′, A allele 5′-ACTACAGGCTGAATCTGTGAATCTCC-3′ and 5′- GGAGATTCACAGATTCAGCCTGTAGT-3′; and for rs16972787 G allele 5′-CCACGTGTTCGGGCGTCCGTGTCAAT-3′ and 5′-ATTGACACGGACGCCCGAACACGTGG-3′, A allele 5′-CCACGTGTTCAGGCGTCCGTGTCAAT-3′ and 5′-ATTGACACGGACGCCTGAACACGTGG-3′. Single-stranded oligonucleotides (200 pmol) were mixed in TEN buffer (10 mM Tris, 1 mM EDTA, 0.1 M NaCl, pH 8.0), incubated at 95°C for 10 min and chilled on ice to let oligonucleotides anneal. Double-stranded oligonucleotides (3.85 pmol) were DIG labeled and equal labelling efficiency was verified by dot blot analysis. Probes were incubated with 10 μg nuclear extracts for 20 min at room temperature followed by non-denaturating 6% polyacrylamide gel electrophoresis with 0.5-fold TBE running buffer (45 mM Tris, 45 mM boric acid, 1 mM EDTA, pH 8.0). Controls contained labeled probe alone and competition experiments were performed with an additional 250-fold molar excess of unlabeled probe. EMSAs were performed in triplicate for every cell line and both polymorphisms. DNA-protein complexes were electroblotted to positively charged nylon membranes (Roche, Mannheim, Germany) and the band shifts were visualized according to the user’s manual for the DIG Gel Shift kit.
Transient transfection of HEK293 cells and luciferase reporter assay
The 3′UTR of PLK1 was PCR-amplified from genomic DNA and cloned in the pGEM-T Easy Vector (Promega, Madison, WI, USA). The amplified 3′UTR was restricted from the pGEM-T Vector and cloned downstream of the Firefly luciferase coding region into the pMIR-REPORT™ vector (Applied Biosystems, Foster City, CA). HEK293 cells were plated into 96-well plates at a density of 1,5 × 104 cells/well in 100 μl of DMEM medium with 10% FBS. After 24 h co-transfections were carried out in 50 μl of DMEM medium without serum using 150 ng of the respective pMIR reporter construct and 50 ng of Renilla luciferase control vector (pGL4.74, Promega, Madison, WI, USA) containing 0.5 μl of Lipofectamine 2000 (Invitrogen, Karlsruhe, Germany) per transfection according to the manufacturer’s instructions. After 6 h, the transfection mix was removed, and cells were incubated with new DMEM medium. 24 hours after treatment, cells were harvested and assayed for Firefly and Renilla luciferase activities using the Dual-Glo Luciferase Assay System (Promega, Madison, WI, USA) on a Lumat LB 9501 Luminometer (Berthold, Bad Wildbad, Germany). To correct for variable transfection efficiency Firefly luciferase activity was normalized to Renilla luciferase activity.
Control for deviation from the Hardy–Weinberg equilibrium was conducted with a web-tool by Rodriguez et al. . Linkage disequilibrium and haplotypes were assessed using Haploview . One-way ANOVA was used to analyze the overall difference of reporter activities. To correct for multiple comparisons Holm-Sidak’s post-hoc multiple comparisons test was used for pairwise comparisons of reporter activities. All statistical analyses were performed using GraphPad Prism 6.0 (GraphPad Software, San Diego, CA, USA). Differences were regarded as significant at p < 0.05.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.