Skip to main content

Advertisement

  • Research
  • Open Access
  • Meta-analysis of glioblastoma multiforme versus anaplastic astrocytoma identifies robust gene markers

    Molecular Cancer20098:71

    https://doi.org/10.1186/1476-4598-8-71

    • Received: 23 February 2009
    • Accepted: 4 September 2009
    • Published:

    Abstract

    Background

    Anaplastic astrocytoma (AA) and its more aggressive counterpart, glioblastoma multiforme (GBM), are the most common intrinsic brain tumors in adults and are almost universally fatal. A deeper understanding of the molecular relationship of these tumor types is necessary to derive insights into the diagnosis, prognosis, and treatment of gliomas. Although genomewide profiling of expression levels with microarrays can be used to identify differentially expressed genes between these tumor types, comparative studies so far have resulted in gene lists that show little overlap.

    Results

    To achieve a more accurate and stable list of the differentially expressed genes and pathways between primary GBM and AA, we performed a meta-analysis using publicly available genome-scale mRNA data sets. There were four data sets with sufficiently large sample sizes of both GBMs and AAs, all of which coincidentally used human U133 platforms from Affymetrix, allowing for easier and more precise integration of data. After scoring genes and pathways within each data set, we combined the statistics across studies using the nonparametric rank sum method to identify the features that differentiate GBMs and AAs. We found >900 statistically significant probe sets after correction for multiple testing from the >22,000 tested. We also used the rank sum approach to select >20 significant Biocarta pathways after correction for multiple testing out of >175 pathways examined. The most significant pathway was the hypoxia-inducible factor (HIF) pathway. Our analysis suggests that many of the most statistically significant genes work together in a HIF1A/VEGF-regulated network to increase angiogenesis and invasion in GBM when compared to AA.

    Conclusion

    We have performed a meta-analysis of genome-scale mRNA expression data for 289 human malignant gliomas and have identified a list of >900 probe sets and >20 pathways that are significantly different between GBM and AA. These feature lists could be utilized to aid in diagnosis, prognosis, and grade reduction of high-grade gliomas and to identify genes that were not previously suspected of playing an important role in glioma biology. More generally, this approach suggests that combined analysis of existing data sets can reveal new insights and that the large amount of publicly available cancer data sets should be further utilized in a similar manner.

    Keywords

    • Malignant Glioma
    • Gene List
    • Glioblastoma Multiforme
    • Anaplastic Astrocytoma
    • Feature List

    Background

    High-grade gliomas, which include World Health Organization grade III astrocytomas (anaplastic astrocytoma: AA) and grade IV astrocytomas (glioblastoma multiforme: GBM), are the most common intrinsic brain tumors in adults and are almost universally fatal. GBMs are particularly invasive and aggressive. Patients diagnosed with GBM have a median survival time of one year [1], and less than 20% survive two years [2]; in contrast, the median survival for patients with AA is 30 months [1]. Nearly all GBMs (>90%) are primary, i.e. they develop de novo with no evidence of a less malignant precursor lesion, whereas secondary GBMs develop from lower-grade astrocytomas [3]. Histological criteria are currently the basis for tumor grading and prognosis, with GBM showing increased necrosis, vascular proliferation, nuclear pleomorphism, mitoses and invasiveness when compared to AA. The molecular basis for the histological and prognostic differences between grade III and grade IV astrocytomas remains an area of active investigation, e.g. one study found genes associated with necrosis in high-grade gliomas [4]. A deeper understanding of the basis for these differences may lead to new therapeutic strategies for treating these tumors.

    Differences in chromosomal alterations in AA and GBM have been described in several studies. For example, loss of heterozygosity for chromosome 10 was often observed in high-grade astrocytomas, and its frequency was found to be different between AA and GBM [5]. Aberrations involving p53, EGFR, PTEN, and other genes have also been reported as having different frequencies in AA and GBM. Importantly, differences within the same grade were also observed. Aberrations on chromosome 10, for example, were found to be an independent, adverse prognostic marker for survival, even after accounting for age and grade [5, 6]. With the advent of microarrays, molecular portraits of these tumor grades were refined, and expression profiling was found to be a better predictor of outcome than histological criteria [7, 8]. These and other studies revealed the presence of molecular subgroups of malignant gliomas. One recent study identified three molecular subclasses of GBM that were characterized by proneural, proliferative, and mesenchymal mRNA expression signatures [9], and another isolated an expression signature that distinguished survival phenotypes [10].

    Although a number of expression profiling studies have been performed on AA and GBM, they give conflicting results with regard to the list of relevant, differentially expressed genes between GBM and AA. This variability may be due to several factors. Most importantly, the sample sizes for these studies were relatively small due to the limited availability of suitable specimens and the significant costs associated with these studies. Other factors include differences in: the quality of the tissue specimens used (e.g. presence of non-tumor brain tissue or extensive necrosis), the microarray platforms used, the statistical methods employed to identify differentially expressed genes [11], and patient demographics such as age, gender, and race. Given the large number of factors that influence the list of differentially expressed genes, it is not surprising that gene lists from independent studies show little overlap. This lack of overlap has been observed in nearly all diseases in which microarrays have been employed, although the extent of the discrepancy depends on the heterogeneity of the disease [12].

    To compile the most accurate and robust list of relevant genes, we performed a meta-analysis of multiple independent publicly available data sets, mostly from the Gene Expression Omnibus (GEO). GEO is the largest public repository of microarray data; it now contains over 250,000 samples and its size is rapidly increasing [13]. While many of the published microarray data sets are centrally stored through GEO, the format and quality of the data sets are variable, and the annotations, both in terms of the probes on the platform and the sample phenotypes, are often incomplete. Thus, integrating information from these data sets requires significant bioinformatics analysis. In this study, many data sets were examined, and four data sets that satisfied our criteria for suitability in meta-analysis were selected. The resulting list of genes that are differentially expressed between AA and GBM is likely to be more robust and stable than that derived from any individual study to date.

    Results

    Identification of appropriate data sets

    To identify gene expression differences between AA and GBM, we searched GEO and many other databases containing publicly-available microarray data for data sets that contain information for both grades. One possible strategy for meta-analysis would have been to collect all data sets containing GBMs and all data sets containing AAs separately, and to then perform a single differential analysis. However, this could potentially lead to artifactual results due to methodological or technical differences among the studies, as mentioned above. Platform differences, for example, can have a significant influence on the results of microarray analyses. We have previously shown that even the differences arising from the use of successive generations of microarray platforms produced by the same company (e.g. Affymetrix) can be larger than the differences among patient samples [14]. While such artifactual effects can be reduced somewhat with careful normalization and use of robust statistics, they cannot be eliminated. A more conservative approach is to combine the information obtained at the level of "within-experiment" gene lists, so that platform-specific and other biases are reduced.

    To increase the reliability of the results, we further enforced stringent criteria for data inclusion. Distinct expression profiles exist between primary and secondary GBMs [15], and it is estimated that 95% of GBMs are primary [3]. Hence, to pinpoint targets for forced grade reduction of primary GBMs, we focus exclusively on contrasting AAs to primary GBMs. (Although there is not sufficient data for a meta-analysis of AAs versus secondary GBMs, interesting findings have come out of this comparison, e.g. [16].) We found four large in vivo expression data sets, three from GEO and one from UCLA, that assay AAs and primary GBMs. All four studies used the Affymetrix (Santa Clara, CA) human U133 platform. This similarity of expression platforms simplifies the analysis, although the same process with minor modifications would have worked well even with differing platforms. Table 1 summarizes the data used for meta-analysis. Note that this platform consistency is not by design; our search of human in vivo expression studies did not yield any other studies that assayed five or more GBMs and AAs in patient samples, regardless of platform.
    Table 1

    Summary of the data sets

     

    Petalidis

    Phillips

    Sun

    Tso

    TOTAL

    AA

    19

    21

    19

    9

    68

    GBM

    39

    56

    81

    45

    221

    TOTAL

    58

    77

    100

    54

    289

    GEO ID

    GSE1993

    GSE4271

    GSE4290

    (at UCLA)

     

    Affy chip

    U133A

    U133A and U133B

    U133 plus 2.0

    U133A

     

    Journal

    Mol Cancer Ther

    Cancer Cell

    Cancer Cell

    Cancer Research

     

    Year

    2008

    2006

    2006

    2006

     

    Statistical approach to combining data sets

    Analysis of differential expression in a single data set has been examined in great detail in the past decade [17, 18]. See [11] for a comparison of some commonly used methods. For this work, we focus on the methods for combining multiple data sets, i.e. for combining the scores of individual features across microarray experiments. Two classical statistical techniques that combine a feature's p-values directly are Fisher's method [19, 20], which relies on the sum of the logarithm of the p-values, and an alternative method proposed by Stouffer et al. (1949; cited in [21]), which transforms p-values into z-scores. Fisher's method was used, for instance, for analysis of microarray data on breast cancer [22], and both Fisher's and a weighted version of Stouffer's method were applied to study prostate cancer [23, 24]. Meta-analytic methods have also been developed specifically for genomics, many of which rely on traditional statistical approaches such as random effects [25, 26] and Bayesian modelling [27, 28], and some techniques have been advanced specifically for combining cancer microarray data [29]. Meta-analysis for genomics has accrued so much literature that there is now a book dedicated to the topic [30].

    Breitling et al. (2004) and Hong and Breitling (2007) have proposed a simple, intuitive method that evaluates genes based only on the product (or the sum) of its ranks [31, 32]. This method ranks each feature (such as a gene) within an experiment based on that feature's score (say, a t-statistic), and then combines these ranks, rather than combining the data or p-values themselves. For example, if a certain gene is the most differentially expressed gene in one experiment and is the tenth most differentially expressed gene in the three others, then its rank sum will be 1+10+10+10 = 31 and its rank product will be 1*10*10*10 = 1000, where the smaller is the rank sum or rank product, the more significant is the gene. The two approaches differ only in how they penalize the larger ranks; the rank product becomes very large even with a single high rank. Because rank-based procedures do not make assumptions about the model and parameters from which the data came, they are termed non-parametric.

    We chose to use a rank-based method because: 1) in practice, the main purpose of microarray experiments is to rank genes rather than to obtain precise estimates of their statistical significance, since the number of statistically significant genes often greatly exceeds the number of genes that can be validated [33], 2) non-parametric analyses are more robust in general, 3) the techniques and assumptions used in the estimation of p-values and the subsequent correction for multiple hypothesis testing may be different between data sets and may not be directly comparable, and 4) using non-parametric methods to rank genes has proven highly effective in the context of genomics. Although more sophisticated rank-based procedures are available [34], the rank sum and rank product methods have been shown to give good results on microarray data [32]. Because the rank sum technique is more robust than the rank product approach and is preferable when the variance of some features may be larger than others [35], we employ the rank sum procedure.

    As a complement to the ordered gene list for each study, which we derive using moderated t-statistics [36], we also quantify differentially activated pathways between GBM and AA. The benefit of testing the significance of a priori defined gene sets (which correspond to pathways in this article) is that the recognition of such pathways may allow for better elucidation of the underlying biology, improved drug target development, and greater generalizability [37]. In this work, we used a statistical method that we previously developed to identify significant gene sets while accounting for the differing sizes of gene sets and their correlation structure [38].

    Meta-analysis gene list

    Of our 22,215 probe sets (see Methods), we identified 933 with rank sum based q-values [39], an analogue of p-values when many features are being tested, below 1%. These meta-analytic statistics provide a combined ranking of significant genes while not allowing strong p-values from any individual study to dominate the results. The amount of differential expression observed is illustrated in Figure 1, which shows histograms depicting the q-values from each study on the left and the relatively conservative q-values from the meta-analysis on the right. The high proportion of genes with low q-values indicates that many more genes are found to be differentially expressed than expected by chance.
    Figure 1
    Figure 1

    Gene q-value histograms. Histograms depicting the significance levels of probe sets from each of the four studies, and from the meta-analysis.

    The top row of Figure 2 displays these findings through Concordance At the Top (CAT) plots [40], which quantify the concordance of two lists along list ranks. These plots have a straightforward interpretation. For example, if two gene lists share in common 80 of their top 100 genes, then at rank 100 their concordance would be 80%. Hence, the gene plots in Figure 2 show that no study dominates the final meta-analysis gene list and that these studies' gene lists, although far more concordant than would be expected by chance, contain ample heterogeneity. This coupling of apparent heterogeneity with abundant meta-analytic significance points to a wealth of information only available through a powerful meta-analysis.
    Figure 2
    Figure 2

    CAT plots. Concordance At the Top plots comparing ranked probe set lists from pairs of studies (top left) and from each study to the meta-analysis probe set list (top right), and similarly for pathways (bottom row).

    Information on the top 30 meta-analysis genes is displayed in Table 2, while Additional file 1 contains information for all probe sets. A survey of the literature indicates that several of these top 30 genes have been demonstrated to play an important role in the biology of malignant gliomas. These include CHI3L1 [41, 42], FN1 [43, 44], CLIC1 [45], VEGFA [46], IGFBP2 [47], ADM [48], and COL4A1/COL4A2 [49].
    Table 2

    Comparison of top genes

    Petalidis

    Phillips

    Sun

    Tso

    Meta

    Meta Gene Name

    TIMP1*

    CALR*

    SRPX2

    RPS3A*

    GLUD1*

    glutamate dehydrogenase 1

    ADCY2

    CSF1*

    ADAM12*

    SC5DL

    GLUD2

    glutamate dehydrogenase 2

    LAMC1*

    MTSS1*

    ALDH5A1*

    ZNF192*

    CLIC1*

    chloride intracellular channel 1

    ABHD6

    ANXA2P3

    LAMB1*

    ANXA1*

    COL4A1*

    collagen, type IV, alpha 1

    COL5A2

    CRHR2*

    ACTN1*

    MYST4

    CCDC109B

    coiled-coil domain containing 109B

    CSDC2

    BMP8A*

    NTSR2*

    CAPG*

    VEGFA*

    vascular endothelial growth factor A

    RBP1*

    CALCRL

    FAM69A*

    ACVR2B*

    COL4A2*

    collagen, type IV, alpha 2

    EFEMP2*

    CG030*

    PPP2R5A*

    SPP1*

    RASL10A

    RAS-like, family 10, member A

    COL4A1*

    SCD*

    HRSP12*

    CHI3L1*

    IGFBP2*

    insulin-like growth factor binding protein 2, 36kDa

    KIAA049

    SORBS1

    COL5A2

    MYOZ2

    FN1*

    fibronectin 1

    IGFBP2*

    PEX5

    KDELR3

    SLC25A21

    PPP2R5A*

    protein phosphatase 2, regulatory subunit B', alpha isoform

    LGALS1*

    ADCY2

    SERPINH1*

    TMLHE

    PSRC1*

    proline/serine-rich coiled-coil 1

    PLP2

    GLUD2

    SRF*

    GNAL

    LUZP2

    leucine zipper protein 2

    CLIC1*

    EDG8

    ZHX2

    AKT3*

    LPIN1

    lipin 1

    USH1C*

    HSDL2

    LOC727942

    TSC22D2

    ARL4C

    ADP-ribosylation factor-like 4C

    FKBP9

    PI4K2A

    FOSL2*

    RICS*

    PDIA4*

    protein disulfide isomerase family A, member 4

    BMP2*

    GLUD1*

    LOXL2*

    LUZP2

    ADM*

    adrenomedullin

    AKAP6

    DCAKD

    CHPF

    NET1*

    CSDC2

    cold shock domain containing C2, RNA binding

    LDHA*

    CCL19

    LMNA*

    GUSB*

    TMSB10*

    thymosin, beta 10

    DHTKD1

    RAB40C

    SLC16A3

    C11orf41

    CHI3L1*

    chitinase 3-like 1 (cartilage glycoprotein-39)

    ADM*

    DIP2C

    BICD1

    GLUD1*

    NET1*

    neuroepithelial cell transforming gene 1

    KIAA0746

    LOC645226*

    IQCK

    LITAF*

    MCAM*

    melanoma cell adhesion molecule

    KDELR2*

    TNKS2*

    FAM129A*

    CEP350*

    LDHA*

    lactate dehydrogenase A

    LBH

    AP2B1*

    COL4A2*

    RASL10A

    COL1A2*

    collagen, type I, alpha 2

    COL4A2*

    HIRA

    CSGlcA-T

    HLA-C*

    ALDH2*

    aldehyde dehydrogenase 2 family (mitochondrial)

    CALD1*

    MAP2K3*

    TGFB1I1*

    CLCA2*

    COL3A1

    collagen, type III, alpha 1 (Ehlers-Danlos syndrome type IV, autosomal dominant)

    COL3A1

    GRWD1

    C21orf7

    S100A11*

    LAMC1*

    laminin, gamma 1 (formerly LAMB2)

    TMED9

    LPIN1

    ARL4C

    PLCB1*

    SLC1A4*

    solute carrier family 1 (glutamate/neutral amino acid transporter), member 4

    TAGLN2*

    VPS13D*

    ZGPAT

    TRIP4*

    MSN*

    moesin

    PELO

    MUC8*

    PVR*

    SERPINA3*

    CNTN1*

    contactin 1

    The first 5 columns contain the top 30 genes from each study and from the meta-analysis. Column 6 contains gene names for the top meta-analysis genes, all of which have a q-value < 0.001. Genes in the first 4 columns that appear in the top 30 of the meta-analysis gene list are in bold, while those that are related to the PubMed query "glioma OR cancer OR astrocytoma" are given an asterisk.

    Comparison to literature

    An automated search of the PubMed database for relevant abstracts related to the top 30 genes derived from each study and from the meta-analysis (using the search term "glioma OR cancer OR astrocytoma") indicates that the meta-analysis genes are associated with the greatest number of relevant citations and contain the highest proportion of genes with relevant citations (see Table 3). For example, the gene VEGFA, which is known to be important in glioma and generates more than 750 pertinent citations on its own, is 6th in the meta-analysis list but does not fall among the top 30 genes on any of the individual studies' lists.
    Table 3

    Counts of relevant citations

     

    Petalidis

    Phillips

    Sun

    Tso

    Meta

    # citations (PubMed)

    128

    50

    58

    127

    154

    % cited (PubMed)

    53%

    47%

    57%

    67%

    73%

    % cited (Ingenuity)

    47%

    33%

    27%

    40%

    60%

    Comparison of relevant citation counts of top 30 genes from each of the four studies and from the meta-analysis.

    The largest number of relevant citations derived from the top 30 genes of any single study is 128. To ensure that VEGFA does not dominate the comparison, we assigned it the same number of citations as the second best performing gene from all of Table 2 (which is SPP1, with 41 relevant citations). After this citation reduction for VEGFA, the meta-analysis list still generates 154 glioma/cancer/astrocytoma-related citations. The meta-analysis list's top 30 genes also have more citations related to the search term (22 genes) than any of the four studies. We further substantiated the results obtained from PubMed by evaluating these same gene lists using the manually curated Ingenuity Pathways Analysis software program (Ingenuity Systems, http://www.ingenuity.com). Ingenuity found that 60% of the top 30 unique genes from the meta-analysis have known connections to cancer, which is a 13% increase above the top-performing individual study.

    Meta-analysis pathway list

    We next performed pathway analysis to determine whether the genes identified by the meta-analysis might work cooperatively. Of 178 Biocarta http://www.biocarta.com pathways, 21 had a q-value below 2%. These 21 gene sets are shown in Table 4, while Additional file 2 contains the information for all gene sets. These significance results coupled with the bottom row of Figure 2 show that, similarly to our gene-level analysis, we are able to glean insight from heterogeneous pathway lists using the rank sum method. As hoped, owing to the greater reproducibility in general of gene set analyses, the pathway lists of the individual studies also exhibit greater concordance to each other than do their gene lists (this can be witnessed by examining fixed percentiles of the gene and pathway lists in Figure 2.)
    Table 4

    Top pathways

     

    Pathway

    Size

    % up

    Change

    q-value

    HIF

    Hypoxia-Inducible Factor in the Cardiovascular System

    31

    65

    up

    0

    PROTEASOME

    Proteasome Complex

    37

    89

    up

    5.00E-04

    MYOSIN

    PKC-catalyzed phosphorylation of inhibitory phosphoprotein of myosin phosphatase

    25

    28

    down

    0.00367

    VITCB

    Vitamin C in the Brain

    28

    61

    up

    0.0055

    LYMPHOCYTE

    Adhesion Molecules on Lymphocyte

    27

    81

    up

    0.00942

    NOS1

    Nitric Oxide Signaling

    53

    25

    down

    0.00942

    P53HYPOXIA

    Hypoxia and p53 in the Cardiovascular system

    36

    72

    up

    0.00942

    SALMONELLA

    How does salmonella hijack a cell

    26

    81

    up

    0.00942

    ATM

    ATM Signaling

    36

    72

    up

    0.00942

    CASPASE

    Caspase Cascade in Apoptosis

    43

    72

    up

    0.00942

    PAR1

    Thrombin signaling and protease-activated receptors

    39

    31

    down

    0.00942

    G2

    Cell Cycle: G2/M Checkpoint

    43

    70

    up

    0.00942

    TSP1

    TSP-1 Induced Apoptosis in Microvascular Endothelial Cell

    21

    86

    up

    0.00946

    ACTINY

    Y branching of actin filaments

    31

    77

    up

    0.0109

    NEUTROPHIL

    Neutrophil and Its Surface Molecules

    21

    81

    up

    0.0152

    ATRBRCA

    Role of BRCA1, BRCA2 and ATR in Cancer Susceptibility

    40

    85

    up

    0.0152

    MONOCYTE

    Monocyte and its Surface Molecules

    30

    80

    up

    0.0155

    FAS

    FAS signaling (CD95)

    65

    62

    up

    0.0177

    PGC1A

    Regulation of PGC-1a

    54

    24

    down

    0.0184

    CELLCYCLE

    Cyclins and Cell Cycle Regulation

    37

    73

    up

    0.0186

    VEGF

    VEGF, Hypoxia, and Angiogenesis

    54

    61

    up

    0.0186

    Pathways with overall q-value < 2%. Columns are the full Biocarta pathway name, the number of genes in the pathway, the percentage of genes that are expressed more highly in GBM relative to AA, the direction of change of the pathway in GBM vs. AA, and the q-value.

    Several themes emerged from the pathway analysis. The most statistically significant gene set identified in the meta-analysis was the hypoxia-inducible factor (HIF) pathway, which has been repeatedly implicated in GBM. This pathway of 31 genes was highly upregulated in GBM and performed extremely well in all four studies, ranking #1 twice, #3, and #9. The HIF1A gene encodes a transcription factor that is induced by hypoxia and that controls the expression of a set of genes that promote angiogenesis and invasion. Importantly, several of the top 30 genes on the meta-analysis gene list are direct HIF1A transcriptional targets, including VEGFA [50, 51], ADM [52], IGFBP2 [53], LDHA [54], and FN1 [55]. VEGF, in turn, induces expression of a number of collage subtypes and extracellular matrix proteins needed for the generation of new blood vessels and for invasion [56]. Among these are several additional genes listed among the top 30 on the meta-analysis list, including LAMC, COL4A1, COL4A2, COL1A2 and FN1. TMSB10 can also be regulated by VEGF [57]. When considered together, these genes suggest differential activation of the HIF1A/VEGF network in GBM when compared to AA. Figure 3 illustrates the interrelationship between HIF1A, VEGFA (whose pathway is ranked 21st), and related genes that are found among the top 30 on the meta-analysis list.
    Figure 3
    Figure 3

    Relationship of HIF1A and VEGFA to top meta-analysis genes. Relationships from the literature among HIF1A, VEGFA, and some of the top 30 meta-analysis genes.

    Discussion

    Because it is not feasible to control for all the factors influencing gene expression in studies of human tumor specimens, it is important to aggregate as much high-quality data as possible to eliminate these sources of bias. Given the large volume of microarray data being generated by laboratories across the world, taking advantage of these data through meta-analysis has become a fruitful and inexpensive yet under-utilized approach. In our comparison of AA and GBM, including only four, albeit very large, studies does leave our results somewhat dependent on the quality of these microarray data sets. However, our methodology disfavors genes whose top ranks are not consistent. As future high-quality data sets become available, they can be incorporated into this framework to validate and improve the stability and accuracy of these results without the worry that such additions will lead to dramatic alterations in the ordering of features. Such benefits offer practical, concrete reasons for our choice of meta-analytic methodology and provide promising evidence for applying this analysis workflow to other pressing conditions.

    Other similar studies may further benefit from using survival time as the phenotype of interest. Molecular signatures have been found to be better predictors of survival than histological grade in some cases [7, 8], i.e. the survival associated with tumors whose molecular profile was an "exception" to their histological grade was more strongly dictated by gene expression profile than by grade. However, Petalidis et al. [58] demonstrated that molecular signatures derived from histological grading of gliomas can be robust prognostic indicators whose accuracy in delineating survival subclasses may outperform classifiers trained on survival data, histological grade per se, and tumor subtypes defined by other studies. Nonetheless, meta-analyses that exploit survival data could potentially provide a more relevant list of candidate genes and a more powerful molecular classification of tumors. In this study, we had to restrict our focus to tumor grade because not all of our data sets had survival data available. This underscores the need for careful clinical annotation of the samples in these studies, even if a study does not involve survival analysis.

    Although there is already a deep literature on the molecular properties of glioblastoma multiforme, The Cancer Genome Atlas project (TCGA) chose GBM as its first cancer to study [59]. TCGA is an ongoing effort coordinated by the NIH in which numerous groups from many institutions collaboratively utilize the gamut of genome analysis technologies to accelerate our understanding of the molecular basis of cancer http://cancergenome.nih.gov/. It will be important to integrate our findings with those of TCGA (which compares GBM to normal tissue) and to identify pathways that are differentially expressed between GBM and AA with the hope of targeting these pathways therapeutically and increasing the survival of patients with GBM so that it approaches that of AA. Evidence that this approach is a useful one can be found in the fact that experimental and clinical studies have shown that agents that target the HIF1A/VEGF network can decrease tumor growth and prolong survival in both animals and humans [60, 61].

    Conclusion

    We have identified >900 probe sets and >20 pathways whose expression is statistically significantly different between GBM and AA. These feature lists are likely to be more accurate and stable because of the greater sensitivity and specificity that result from integration of data. Further, both the top genes and pathways implicate HIF1A/VEGF network activation as a major contributor to the increased growth and invasion displayed by GBM when compared to AA. The importance of these pathways is also evidenced by the utility of VEGF and HIF1A inhibitors in decreasing glioma growth and prolonging survival in vivo. This type of meta-analysis could be utilized to aid in the diagnosis and prognosis of malignant gliomas, and in the development of new therapies for these devastating tumors.

    Methods

    Data description and processing

    Both the Human Genome U133A and U133B Affymetrix platforms contain >22,000 probe sets with no overlap between these two arrays. The Human Genome U133 Plus 2.0 array is composed of all of the probe sets on each of these two arrays as well as 9,921 new probe sets, giving it >54,000 probe sets in total. To accommodate all four of our studies, we used the 22,215 probe sets from the U133A array. Note that these are one-channel microarrays, so that only one sample is hybridized to each microarray. Hence, no controls were involved in our direct comparison of AAs to GBMs and there is no bias due to different controls used.

    Petalidis et al. (2008) identified molecular signatures from primary human astrocytic tumors that define survival prognostic subclasses [58]. Phillips et al. (2006) determined molecular subclasses of human gliomas useful in prediction of prognosis and disease progression [9]. Sun et al. (2006) examined stem cell factor in primary human gliomas [62]. Tso et al. (2006) identified glioblastoma associated genes in primary and secondary human gliomas [15] and deposited the data at UCLA: http://genomics.ctrl.ucla.edu/~snelson/PublicDATASETS/Tso_CancerResearch_2006/. Although the data set of Freije et al. [7] would have satisfied our criteria, its samples heavily overlap with those of Tso et al. [15].

    Only the studies of Phillips et al. [9] and Tso et al. [15] had raw data (Affymetrix CEL) files available. We preprocessed these using RMA normalization [63] from the affy package [64], which is the same method employed by Petalidis et al. [58]. Sun et al. [62] already applied the normalization procedure of Li and Wong [65]. These differences in normalization technique, however, do not pose a hazard to this analysis due to our combination of "within-experiment" feature lists. All data was put on a log base 2 scale.

    Statistical analysis

    The implementation of the meta-analysis followed several steps. Firstly, features (either probe sets or pathways) were scored within each study. Secondly, features were ranked within each study by the magnitude (i.e. absolute value) of their respective statistic (say, a t-statistic), where the statistic closest to zero was given rank one, while that furthest from zero received the largest rank. Negative signs were then given to ranks corresponding to negative statistics to allow for asymmetric (i.e. if there are more upregulated than downregulated features, or vice versa) feature lists. Thirdly, a feature's ranks were summed across the four studies, assigning each feature a single rank sum. Lastly, these rank sums were compared to null ranks sums, derived by randomly permuting column labels and re-running the analysis, to obtain q-values. Note that according to this method, rank sums with larger magnitude are more significant.

    To create per-study gene lists, we employed the empirical Bayes package limma [66], which offers a moderated t-statistic [36] for each gene, along with its associated p-value and conservatively estimated [67] q-value. The empirical Bayes methodology, and this package in particular, have been found in independent bioinformatics comparisons to be highly robust [11] and a preferred analysis method for Affymetrix GeneChips [68]. Annotation was derived from the Affymetrix HG-U133A annotation files in CSV format, downloaded from NetAffx Analysis Center http://www.affymetrix.com/analysis.

    Gene sets were derived from the Gene Set Enrichment Analysis (GSEA) Molecular Signature Database v2.5 [69], where we extracted Biocarta pathways from the "C2: curated gene sets" collection that hold between 20 and 500 genes. This gave 178 Biocarta pathways. Gene set elements were converted from gene symbols to U133A probe sets using GSEA's chip2chip tool [37, 69]. Analysis of gene sets was performed using our SigPathway Bioconductor package [38], which compares each gene set to a column and row permutation null distribution separately, giving two normalized enrichment scores per gene set. These enrichment scores were used separately for the column permutation and row permutation q-values in Additional file 2. Otherwise, within-experiment gene set rank was computed using the minimum (in magnitude) of these two scores for the overall q-value.

    To compare the gene lists to the literature, we were able to automate our search of relevant citation counts for top genes by using the hgu133a package, which maps Affymetrix probe sets to Entrez Gene identifiers to PubMed identifiers, and the annotate package, which allows searching of PubMed abstracts. All statistical analysis was done in the R software [70] using packages from the Bioconductor project [71].

    Declarations

    Acknowledgements

    This work was supported by the National Center for Biomedical Computing grant (U54LM008748) to PJP, and the NIH Director's New Innovator Award (DP2OD002319) to MDJ.

    Authors’ Affiliations

    (1)
    Partners HealthCare Center for Personalized Genetic Medicine, Brigham and Women's Hospital, Boston, MA 02115, USA
    (2)
    Department of Neurosurgery, Brigham and Women's Hospital, Boston, MA 02115, USA
    (3)
    Center for Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
    (4)
    Children's Hospital Informatics Program, Boston, MA 02115, USA

    References

    1. Lin C, Lieu A, Lee K, Yang Y, Kuo T, Hung M, Loh J, Yen C, Chang C, Howng S, Hwang S: The conditional probabilities of survival in patients with anaplastic astrocytoma or glioblastoma multiforme. Surg Neurol. 2003, 60: 402-406. discussion 406., 10.1016/S0090-3019(03)00322-7View ArticlePubMedGoogle Scholar
    2. Mischel P, Shai R, Shi T, Horvath S, Lu K, Choe G, Seligson D, Kremen T, Palotie A, Liau L, Cloughesy T, Nelson S: Identification of molecular subtypes of glioblastoma by gene expression profiling. Oncogene. 2003, 22: 2361-2373. 10.1038/sj.onc.1206344View ArticlePubMedGoogle Scholar
    3. Ohgaki H, Kleihues P: Genetic pathways to primary and secondary glioblastoma. Am J Pathol. 2007, 170: 1445-1453. 10.2353/ajpath.2007.070011PubMed CentralView ArticlePubMedGoogle Scholar
    4. Raza S, Fuller G, Rhee C, Huang S, Hess K, Zhang W, Sawaya R: Identification of necrosis-associated genes in glioblastoma by cDNA microarray analysis. Clin Cancer Res. 2004, 10: 212-221. 10.1158/1078-0432.CCR-0155-3View ArticlePubMedGoogle Scholar
    5. Balesaria S, Brock C, Bower M, Clark J, Nicholson S, Lewis P, de Sanctis S, Evans H, Peterson D, Mendoza N, Glaser M, Newlands E, Fisher R: Loss of chromosome 10 is an independent prognostic factor in high-grade gliomas. Br J Cancer. 1999, 81: 1371-1377. 10.1038/sj.bjc.6693403PubMed CentralView ArticlePubMedGoogle Scholar
    6. Burton E, Lamborn K, Feuerstein B, Prados M, Scott J, Forsyth P, Passe S, Jenkins R, Aldape K: Genetic aberrations defined by comparative genomic hybridization distinguish long-term from typical survivors of glioblastoma. Cancer Res. 2002, 62: 6205-6210.PubMedGoogle Scholar
    7. Freije W, Castro-Vargas F, Fang Z, Horvath S, Cloughesy T, Liau L, Mischel P, Nelson S: Gene expression profiling of gliomas strongly predicts survival. Cancer Res. 2004, 64: 6503-6510. 10.1158/0008-5472.CAN-04-0452View ArticlePubMedGoogle Scholar
    8. Nutt C, Mani D, Betensky R, Tamayo P, Cairncross J, Ladd C, Pohl U, Hartmann C, McLaughlin M, Batchelor T, Black P, von Deimling A, Pomeroy S, Golub T, Louis D: Gene expression-based classification of malignant gliomas correlates better with survival than histological classification. Cancer Res. 2003, 63: 1602-1607.PubMedGoogle Scholar
    9. Phillips H, Kharbanda S, Chen R, Forrest W, Soriano R, Wu T, Misra A, Nigro J, Colman H, Soroceanu L, Williams P, Modrusan Z, Feuerstein B, Aldape K: Molecular subclasses of high-grade glioma predict prognosis, delineate a pattern of disease progression, and resemble stages in neurogenesis. Cancer Cell. 2006, 9: 157-173. 10.1016/j.ccr.2006.02.019View ArticlePubMedGoogle Scholar
    10. Marko N, Toms S, Barnett G, Weil R: Genomic expression patterns distinguish long-term from short-term glioblastoma survivors: a preliminary feasibility study. Genomics. 2008, 91: 395-406. 10.1016/j.ygeno.2008.01.002View ArticlePubMedGoogle Scholar
    11. Jeffery I, Higgins D, Culhane A: Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data. BMC Bioinformatics. 2006, 7: 359- 10.1186/1471-2105-7-359PubMed CentralView ArticlePubMedGoogle Scholar
    12. Miklos G, Maleszka R: Microarray reality checks in the context of a complex disease. Nat Biotechnol. 2004, 22: 615-621. 10.1038/nbt965View ArticlePubMedGoogle Scholar
    13. Barrett T, Troup D, Wilhite S, Ledoux P, Rudnev D, Evangelista C, Kim I, Soboleva A, Tomashevsky M, Edgar R: NCBI GEO: mining tens of millions of expression profiles--database and tools update. Nucleic Acids Res. 2007, 35: D760-765. 10.1093/nar/gkl887PubMed CentralView ArticlePubMedGoogle Scholar
    14. Hwang K, Kong S, Greenberg S, Park P: Combining gene expression data from different generations of oligonucleotide arrays. BMC Bioinformatics. 2004, 5: 159- 10.1186/1471-2105-5-159PubMed CentralView ArticlePubMedGoogle Scholar
    15. Tso C, Freije W, Day A, Chen Z, Merriman B, Perlina A, Lee Y, Dia E, Yoshimoto K, Mischel P, Liau L, Cloughesy T, Nelson S: Distinct transcription profiles of primary and secondary glioblastoma subgroups. Cancer Res. 2006, 66: 159-167. 10.1158/0008-5472.CAN-05-0077View ArticlePubMedGoogle Scholar
    16. Bozinov O, Köhler S, Samans B, Benes L, Miller D, Ritter M, Sure U, Bertalanffy H: Candidate genes for the progression of malignant gliomas identified by microarray analysis. Neurosurg Rev. 2008, 31: 83-89. discussion 89-90., 10.1007/s10143-007-0107-3View ArticlePubMedGoogle Scholar
    17. Allison D, Cui X, Page G, Sabripour M: Microarray data analysis: from disarray to consolidation and consensus. Nat Rev Genet. 2006, 7: 55-65. 10.1038/nrg1749View ArticlePubMedGoogle Scholar
    18. Cui X, Churchill G: Statistical tests for differential expression in cDNA microarray experiments. Genome Biol. 2003, 4: 210- 10.1186/gb-2003-4-4-210PubMed CentralView ArticlePubMedGoogle Scholar
    19. Fisher R: Statistical Methods for Research Workers. 1932, Edinburgh: Oliver and Boyd, 4.Google Scholar
    20. Fisher R: Combining independent tests of significance. American Statistician. 1948, 2 (5): 30-10.2307/2681650. 10.2307/2681650Google Scholar
    21. Folks J: Combination of independent tests. 1984, New York: North-Holland.Google Scholar
    22. Smith D, Saetrom P, Snøve OJ, Lundberg C, Rivas G, Glackin C, Larson G: Meta-analysis of breast cancer microarray studies in conjunction with conserved cis-elements suggest patterns for coordinate regulation. BMC Bioinformatics. 2008, 9: 63- 10.1186/1471-2105-9-63PubMed CentralView ArticlePubMedGoogle Scholar
    23. Rhodes D, Barrette T, Rubin M, Ghosh D, Chinnaiyan A: Meta-analysis of microarrays: interstudy validation of gene expression profiles reveals pathway dysregulation in prostate cancer. Cancer Res. 2002, 62: 4427-4433.PubMedGoogle Scholar
    24. Setlur S, Royce T, Sboner A, Mosquera J, Demichelis F, Hofer M, Mertz K, Gerstein M, Rubin M: Integrative microarray analysis of pathways dysregulated in metastatic prostate cancer. Cancer Res. 2007, 67: 10296-10303. 10.1158/0008-5472.CAN-07-2173View ArticlePubMedGoogle Scholar
    25. Choi J, Yu U, Kim S, Yoo O: Combining multiple microarray studies and modeling interstudy variation. Bioinformatics. 2003, 19 (Suppl 1): i84-90. 10.1093/bioinformatics/btg1010View ArticlePubMedGoogle Scholar
    26. Stevens J, Doerge R: Meta-analysis combines affymetrix microarray results across laboratories. Comp Funct Genomics. 2005, 6: 116-122. 10.1002/cfg.460PubMed CentralView ArticlePubMedGoogle Scholar
    27. Wang J, Coombes K, Highsmith W, Keating M, Abruzzo L: Differences in gene expression between B-cell chronic lymphocytic leukemia and normal B cells: a meta-analysis of three microarray studies. Bioinformatics. 2004, 20: 3166-3178. 10.1093/bioinformatics/bth381View ArticlePubMedGoogle Scholar
    28. Jung Y, Oh M, Shin D, Kang S, Oh H: Identifying differentially expressed genes in meta-analysis via Bayesian model-based clustering. Biom J. 2006, 48: 435-450. 10.1002/bimj.200410230View ArticlePubMedGoogle Scholar
    29. Ma S, Huang J: Regularized gene selection in cancer microarray meta-analysis. BMC Bioinformatics. 2009, 10: 1- 10.1186/1471-2105-10-1PubMed CentralView ArticlePubMedGoogle Scholar
    30. Guerra R, Allison D, Goldstein D: Meta-analysis and Combining Information in Genetics and Genomics. 2009, Taylor & Francis, Inc.Google Scholar
    31. Breitling R, Armengaud P, Amtmann A, Herzyk P: Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments. FEBS Lett. 2004, 573: 83-92. 10.1016/j.febslet.2004.07.055View ArticlePubMedGoogle Scholar
    32. Hong F, Breitling R: A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments. Bioinformatics. 2008, 24: 374-382. 10.1093/bioinformatics/btm620View ArticlePubMedGoogle Scholar
    33. Smyth G, Yang Y, Speed T: Statistical issues in microarray data. Functional Genomics: Methods and Protocols. Edited by: Brownstein M, Khodursky A. 2003, 224: [Methods in Molecular Biology]., Totowa, NJ: Humana Press.View ArticleGoogle Scholar
    34. Zintzaras E, Ioannidis J: Meta-analysis for ranked discovery datasets: theoretical framework and empirical demonstration for microarrays. Comput Biol Chem. 2008, 32: 38-46. 10.1016/j.compbiolchem.2007.09.003View ArticlePubMedGoogle Scholar
    35. Breitling R, Herzyk P: Rank-based methods as a non-parametric alternative of the T-statistic for the analysis of biological microarray data. J Bioinform Comput Biol. 2005, 3: 1171-1189. 10.1142/S0219720005001442View ArticlePubMedGoogle Scholar
    36. Smyth G: Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004, 3: Article3.Google Scholar
    37. Mootha V, Lindgren C, Eriksson K, Subramanian A, Sihag S, Lehar J, Puigserver P, Carlsson E, Ridderstråle M, Laurila E, Houstis N, Daly M, Patterson N, Mesirov J, Golub T, Tamayo P, Spiegelman B, Lander E, Hirschhorn J, Altshuler D, Groop L: PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003, 34: 267-273. 10.1038/ng1180View ArticlePubMedGoogle Scholar
    38. Tian L, Greenberg S, Kong S, Altschuler J, Kohane I, Park P: Discovering statistically significant pathways in expression profiling studies. Proc Natl Acad Sci USA. 2005, 102: 13544-13549. 10.1073/pnas.0506577102PubMed CentralView ArticlePubMedGoogle Scholar
    39. Storey J, Tibshirani R: Statistical significance for genomewide studies. Proc Natl Acad Sci USA. 2003, 100: 9440-9445. 10.1073/pnas.1530509100PubMed CentralView ArticlePubMedGoogle Scholar
    40. Irizarry R, Warren D, Spencer F, Kim I, Biswal S, Frank B, Gabrielson E, Garcia J, Geoghegan J, Germino G, Griffin C, Hilmer S, Hoffman E, Jedlicka A, Kawasaki E, Martínez-Murillo F, Morsberger L, Lee H, Petersen D, Quackenbush J, Scott A, Wilson M, Yang Y, Ye S, Yu W: Multiple-laboratory comparison of microarray platforms. Nat Methods. 2005, 2: 345-350. 10.1038/nmeth756View ArticlePubMedGoogle Scholar
    41. Nigro J, Misra A, Zhang L, Smirnov I, Colman H, Griffin C, Ozburn N, Chen M, Pan E, Koul D, Yung W, Feuerstein B, Aldape K: Integrated array-comparative genomic hybridization and expression array profiles identify clinically relevant molecular subtypes of glioblastoma. Cancer Res. 2005, 65: 1678-1686. 10.1158/0008-5472.CAN-04-2921View ArticlePubMedGoogle Scholar
    42. Pelloski C, Mahajan A, Maor M, Chang E, Woo S, Gilbert M, Colman H, Yang H, Ledoux A, Blair H, Passe S, Jenkins R, Aldape K: YKL-40 expression is associated with poorer response to radiation and shorter overall survival in glioblastoma. Clin Cancer Res. 2005, 11: 3326-3334. 10.1158/1078-0432.CCR-04-1765View ArticlePubMedGoogle Scholar
    43. Castellani P, Borsi L, Carnemolla B, Birò A, Dorcaratto A, Viale G, Neri D, Zardi L: Differentiation between high- and low-grade astrocytoma using a human recombinant antibody to the extra domain-B of fibronectin. Am J Pathol. 2002, 161: 1695-1700.PubMed CentralView ArticlePubMedGoogle Scholar
    44. Liu C, Yao J, Mercola D, Adamson E: The transcription factor EGR-1 directly transactivates the fibronectin gene and enhances attachment of human glioblastoma cell line U251. J Biol Chem. 2000, 275: 20315-20323. 10.1074/jbc.M909046199View ArticlePubMedGoogle Scholar
    45. Kang M, Kang S: Pharmacologic blockade of chloride channel synergistically enhances apoptosis of chemotherapeutic drug-resistant cancer stem cells. Biochem Biophys Res Commun. 2008, 373: 539-544. 10.1016/j.bbrc.2008.06.070View ArticlePubMedGoogle Scholar
    46. Im S, Gomez-Manzano C, Fueyo J, Liu T, Ke L, Kim J, Lee H, Steck P, Kyritsis A, Yung W: Antiangiogenesis treatment for gliomas: transfer of antisense-vascular endothelial growth factor inhibits tumor growth in vivo. Cancer Res. 1999, 59: 895-900.PubMedGoogle Scholar
    47. Dunlap S, Celestino J, Wang H, Jiang R, Holland E, Fuller G, Zhang W: Insulin-like growth factor binding protein 2 promotes glioma development and progression. Proc Natl Acad Sci USA. 2007, 104: 11736-11741. 10.1073/pnas.0703145104PubMed CentralView ArticlePubMedGoogle Scholar
    48. Ouafik L, Sauze S, Boudouresque F, Chinot O, Delfino C, Fina F, Vuaroqueaux V, Dussert C, Palmari J, Dufour H, Grisoli F, Casellas P, Brünner N, Martin P: Neutralization of adrenomedullin inhibits the growth of human glioblastoma cell lines in vitro and suppresses tumor xenograft growth in vivo. Am J Pathol. 2002, 160: 1279-1292.PubMed CentralView ArticlePubMedGoogle Scholar
    49. Mahesparan R, Read T, Lund-Johansen M, Skaftnesmo K, Bjerkvig R, Engebraaten O: Expression of extracellular matrix components in a highly infiltrative in vivo glioma model. Acta Neuropathol. 2003, 105: 49-57.PubMedGoogle Scholar
    50. Forsythe J, Jiang B, Iyer N, Agani F, Leung S, Koos R, Semenza G: Activation of vascular endothelial growth factor gene transcription by hypoxia-inducible factor 1. Mol Cell Biol. 1996, 16: 4604-4613.PubMed CentralPubMedGoogle Scholar
    51. Büchler P, Reber H, Büchler M, Shrinkante S, Büchler M, Friess H, Semenza G, Hines O: Hypoxia-inducible factor 1 regulates vascular endothelial growth factor expression in human pancreatic cancer. Pancreas. 2003, 26: 56-64. 10.1097/00006676-200301000-00010View ArticlePubMedGoogle Scholar
    52. Garayoa M, Martínez A, Lee S, Pío R, An W, Neckers L, Trepel J, Montuenga L, Ryan H, Johnson R, Gassmann M, Cuttitta F: Hypoxia-inducible factor-1 (HIF-1) up-regulates adrenomedullin expression in human tumor cell lines during oxygen deprivation: a possible promotion mechanism of carcinogenesis. Mol Endocrinol. 2000, 14: 848-862. 10.1210/me.14.6.848View ArticlePubMedGoogle Scholar
    53. Feldser D, Agani F, Iyer N, Pak B, Ferreira G, Semenza G: Reciprocal positive regulation of hypoxia-inducible factor 1alpha and insulin-like growth factor 2. Cancer Res. 1999, 59: 3915-3918.PubMedGoogle Scholar
    54. Semenza G, Jiang B, Leung S, Passantino R, Concordet J, Maire P, Giallongo A: Hypoxia response elements in the aldolase A, enolase 1, and lactate dehydrogenase A gene promoters contain essential binding sites for hypoxia-inducible factor 1. J Biol Chem. 1996, 271: 32529-32537. 10.1074/jbc.271.51.32529View ArticlePubMedGoogle Scholar
    55. Krishnamachary B, Berg-Dixon S, Kelly B, Agani F, Feldser D, Ferreira G, Iyer N, LaRusch J, Pak B, Taghavi P, Semenza G: Regulation of colon carcinoma cell invasion by hypoxia-inducible factor 1. Cancer Res. 2003, 63: 1138-1143.PubMedGoogle Scholar
    56. Infanger M, Grosse J, Westphal K, Leder A, Ulbrich C, Paul M, Grimm D: Vascular endothelial growth factor induces extracellular matrix proteins and osteopontin in the umbilical artery. Ann Vasc Surg. 2008, 22: 273-284. 10.1016/j.avsg.2007.11.002View ArticlePubMedGoogle Scholar
    57. Vasile E, Tomita Y, Brown L, Kocher O, Dvorak H: Differential expression of thymosin beta-10 by early passage and senescent vascular endothelium is modulated by VPF/VEGF: evidence for senescent endothelial cells in vivo at sites of atherosclerosis. FASEB J. 2001, 15: 458-466. 10.1096/fj.00-0051comView ArticlePubMedGoogle Scholar
    58. Petalidis L, Oulas A, Backlund M, Wayland M, Liu L, Plant K, Happerfield L, Freeman T, Poirazi P, Collins V: Improved grading and survival prediction of human astrocytic brain tumors by artificial neural network analysis of gene expression microarray data. Mol Cancer Ther. 2008, 7: 1013-1024. 10.1158/1535-7163.MCT-07-0177PubMed CentralView ArticlePubMedGoogle Scholar
    59. Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature. 2008, 455: 1061-1068.Google Scholar
    60. Jensen R, Ragel B, Whang K, Gillespie D: Inhibition of hypoxia inducible factor-1alpha (HIF-1alpha) decreases vascular endothelial growth factor (VEGF) secretion and tumor growth in malignant gliomas. J Neurooncol. 2006, 78: 233-247. 10.1007/s11060-005-9103-zView ArticlePubMedGoogle Scholar
    61. Vredenburgh J, Desjardins A, Herndon Jn, Dowell J, Reardon D, Quinn J, Rich J, Sathornsumetee S, Gururangan S, Wagner M, Bigner D, Friedman A, Friedman H: Phase II trial of bevacizumab and irinotecan in recurrent malignant glioma. Clin Cancer Res. 2007, 13: 1253-1259. 10.1158/1078-0432.CCR-06-2309View ArticlePubMedGoogle Scholar
    62. Sun L, Hui A, Su Q, Vortmeyer A, Kotliarov Y, Pastorino S, Passaniti A, Menon J, Walling J, Bailey R, Rosenblum M, Mikkelsen T, Fine H: Neuronal and glioma-derived stem cell factor induces angiogenesis within the brain. Cancer Cell. 2006, 9: 287-300. 10.1016/j.ccr.2006.03.003View ArticlePubMedGoogle Scholar
    63. Irizarry R, Bolstad B, Collin F, Cope L, Hobbs B, Speed T: Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 2003, 31: e15- 10.1093/nar/gng015PubMed CentralView ArticlePubMedGoogle Scholar
    64. Gautier L, Cope L, Bolstad B, Irizarry R: affy--analysis of Affymetrix GeneChip data at the probe level. Bioinformatics. 2004, 20: 307-315. 10.1093/bioinformatics/btg405View ArticlePubMedGoogle Scholar
    65. Li C, Wong W: Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. Proc Natl Acad Sci USA. 2001, 98: 31-36. 10.1073/pnas.011404098PubMed CentralView ArticlePubMedGoogle Scholar
    66. Smyth G: Limma: linear models for microarray data. Bioinformatics and Computational Biology Solutions using R and Bioconductor. Edited by: Gentleman R, Carey V, Dudoit Irizarry R, Huber W. 2005, New York: Springer.Google Scholar
    67. Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society Series B. 1995, 57: 289-300.Google Scholar
    68. Choe S, Boutros M, Michelson A, Church G, Halfon M: Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset. Genome Biol. 2005, 6: R16- 10.1186/gb-2005-6-2-r16PubMed CentralView ArticlePubMedGoogle Scholar
    69. Subramanian A, Tamayo P, Mootha V, Mukherjee S, Ebert B, Gillette M, Paulovich A, Pomeroy S, Golub T, Lander E, Mesirov J: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005, 102: 15545-15550. 10.1073/pnas.0506580102PubMed CentralView ArticlePubMedGoogle Scholar
    70. , : R: A language and environment for statistical computing. 2008, Vienna: R Foundation for Statistical Computing.Google Scholar
    71. Gentleman R, Carey V, Bates D, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini A, Sawitzki G, Smith C, Smyth G, Tierney L, Yang J, Zhang J: Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004, 5: R80- 10.1186/gb-2004-5-10-r80PubMed CentralView ArticlePubMedGoogle Scholar

    Copyright

    © Dreyfuss et al; licensee BioMed Central Ltd. 2009

    This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

    Advertisement