Skip to main content

Kinome expression profiling and prognosis of basal breast cancers



Basal breast cancers (BCs) represent ~15% of BCs. Although overall poor, prognosis is heterogeneous. Identification of good- versus poor-prognosis patients is difficult or impossible using the standard histoclinical features and the recently defined prognostic gene expression signatures (GES). Kinases are often activated or overexpressed in cancers, and constitute targets for successful therapies. We sought to define a prognostic model of basal BCs based on kinome expression profiling.


DNA microarray-based gene expression and histoclinical data of 2515 early BCs from thirteen datasets were collected. We searched for a kinome-based GES associated with disease-free survival (DFS) in basal BCs of the learning set using a metagene-based approach. The signature was then tested in basal tumors of the independent validation set.


A total of 591 samples were basal. We identified a 28-kinase metagene associated with DFS in the learning set (N = 73). This metagene was associated with immune response and particularly cytotoxic T-cell response. On multivariate analysis, a metagene-based predictor outperformed the classical prognostic factors, both in the learning and the validation (N = 518) sets, independently of the lymphocyte infiltrate. In the validation set, patients whose tumors overexpressed the metagene had a 78% 5-year DFS versus 54% for other patients (p = 1.62E-4, log-rank test).


Based on kinome expression, we identified a predictor that separated basal BCs into two subgroups of different prognosis. Tumors associated with higher activation of cytotoxic tumor-infiltrative lymphocytes harbored a better prognosis. Such classification should help tailor the treatment and develop new therapies based on immune response manipulation.


Breast cancer (BC) is heterogeneous. Gene expression profiling has identified molecular subtypes with different biological features and different outcome [15], including basal BCs. Basal BCs, which represent ~15-20% of invasive BCs are high-grade tumors, frequently do not express hormone receptors (HR) and ERBB2, and have the worst prognosis overall [6, 7]. Yet, basal tumors show prognostic heterogeneity. Both the standard histoclinical features and the recently defined prognostic gene expression signatures (GES) fail to identify patients who will relapse and patients who will not respond to chemotherapy [8]. Defining the molecular bases of this heterogeneity should help better understand these tumors, identify new therapeutic targets and more reliable predictors of survival and therapeutic response.

Kinases, which constitute ~1.7% of human genes [9], are activated or overexpressed in cancers [10], and constitute current or future targets for successful therapies [11]. Previously, we identified a 16-kinase GES that improved the prognostic classification of luminal BCs [12]. A similar approach was successfully applied to 44 estrogen receptor (ER)-negative BCs, including ERBB2-positive tumors and less than 50% of basal tumors [13]. To our knowledge, the "kinome approach" has never been applied to basal BCs only.

We tested the hypothesis that the expression of kinase genes may distinguish good- from poor-prognosis basal tumors.


Patients' selection

Institut Paoli-Calmettes (IPC) and public retrospective data sets from BC samples profiled using oligonucleotide microarrays were collected (Additional file 1, Table S1). All were pre-treatment samples of invasive non-inflammatory and non-metastatic adenocarcinomas. Microarray data from our set are available through Gene Expression Omnibus (series entry GSE21653).

The "IPC" training set included 261 patients who underwent initial surgery in our institution between 1992 and 2007. Each patient gave written informed consent and this study has been approved by our institutional ethics committee. All samples were histologically reviewed in a standardized fashion by a pathologist (JJ) to asses the ER, progesterone receptor (PR) and ERBB2 status by immunohistochemistry (IHC), and the percent of cancer cells (superior to 60%). Antibodies used (Dak®) were SP1 clone for ER, PgR636 clone for PR and Herceptest™ for ERBB2. The cut-off for positivity was 1% of stained tumor cells for HR, and ERBB2 status (0-3+ score, DAKO HercepTest kit scoring guidelines) was defined as positive if 3+ or 2+ with amplification confirmed by in situ hybridization. Vascular invasion and lymphocytic infiltration were assessed by Hematoxylin and Eosin Staining (HES).

Twelve pooled public data sets constituted the validation set including a total of 2254 samples [5, 7, 1423]. DFS was the best annotated survival information among these sets and was chosen as survival end-point.

Gene expression data analysis

Details are given in Additional file 2 (Supplementary Material). Data sets were processed as previously described [24]. Briefly, for the Agilent sets, we applied quantile normalization to available processed data. Regarding the Affymetrix sets, we used Robust Multichip Average (RMA) with the non-parametric quantile algorithm as normalization parameter [25]. Quantile normalization or RMA was done in R using Bioconductor and associated packages. The five molecular subtypes were determined using the single sample predictor (SSP) classifier [26].

Other analyses were centered on 771 kinase and kinase-interacting genes, based on an update of the initial kinome description [9, 13]. This list was matched with genes available on the Affymetrix U133 Plus 2.0 microarrays used to profile the IPC tumor set, finally retaining 661 genes (Additional file 3, Table S2). Analyses were both unsupervised and supervised. Supervised t-test analysis searched for genes upregulated in basal samples compared to at least one of the four other molecular subtypes, with 5% significance and a false discovery rate (FDR) lower than 5%. To circumvent the problem of dissymmetry of variables with a number of samples inferior to the number of genes being tested [14, 2731], we grouped the resulting genes with correlated and interdependent expression (gene subsets) in a single "metagene". Metagene expression value is the mean of the normalized expression values of all genes in the respective gene subset. Each metagene is treated as if it were a single gene, thereby reducing data dimension. We defined our metagenes by hierarchical clustering using data median-centered on genes, Pearson correlation as similarity metrics and centroid linkage clustering [32]. We identified robust gene clusters (minimal cluster size and minimal Pearson correlation were 15 and 0.6, respectively) using the quality-threshold (QT) clustering method [32]. A metagene was then computed for each selected cluster, and its prognostic incidence (as continuous value) evaluated using a Cox regression univariate analysis. Once a metagene associated with DFS (5% level significance) was defined, its expression value was used to divide the training set into two subgroups then tested for association with DFS. The cut-off was defined as the best threshold dividing the population into two subgroups with the greater DFS difference, "Metagene-Low" (expression value inferior to the threshold) and "Metagene-High" (expression value above) subgroups. This cut-off was applied to basal tumors of each validation series, and the define subgroups were then pooled before prognostic analysis.

We tested the prognostic value of two recently reported classifiers associated with survival in basal BCs: the medullary BC (MBC) classifier [33] and the HER2-derived prognostic predictor (HDPP) [34] associated with survival in both ERBB2+ and basal tumors. We also tested three multigene signatures identified as prognostic in breast cancer, independently of molecular subtypes: the Genomic Grade Index [16], the 76-gene signature [15], and the 70-gene signature [5]. Ontology analysis was done using Ingenuity Pathway Analysis (IPA) software (Redwood City, CA, USA) [35]. We also determined if immune signatures [36] were overrepresented in one prognostic subgroup using the gene set enrichment analysis (GSEA) algorithm and 1000 permutations [37].

Statistical analysis

Correlations between sample groups and histoclinical factors were calculated with the Fisher's exact test and the t-test when appropriate. DFS was calculated from the date of diagnosis until date of first relapse or death using the Kaplan-Meier method, and follow-up was measured to the date of last news for event-free patients. Survival curves were compared with the log-rank test. Univariate and multivariate prognostic analyses used the Cox regression method. Univariate analyses tested classical histoclinical factors: age (≤50 years vs. > 50), pathological tumor size (pT≤20 mm vs. > 20), lymph node status (pN positive vs. negative), SBR grade (I vs. II-III), IHC ER status (negative vs. positive), peritumoral vascular invasion (negative vs. positive) and lymphocytic infiltrate. Data regarding the delivery of adjuvant chemotherapy and hormone therapy were also analyzed. Analyses included also binary classifications based on the immune metagene, the MBC and HDDP classifiers (good vs. poor-prognosis subgroups). Multivariate analyses tested all variables with a p-value inferior to 0.05 in univariate analysis and excluded patients with one or more missing data. All statistical tests were two-sided at the 5% level of significance. Analyses were done using the survival package (version 2.30), in the R software (version 2.9.1). Our analysis adhered to the reporting recommendations for tumor marker prognostic studies (REMARK) [38].


Identification of a prognostic kinase expression signature

Five hundreds and ninety-one out of 2515 tumors were basal, including 73/261 in our IPC series and 518/2254 in the public sets (Table 1). These tumors displayed classical histoclinical features of basal BC (Additional file 4, Table S3). Clinical outcome, available for 2109 patients, correlated with subtypes with 5-year DFS of 83% for luminal A, 60% for luminal B, 77% for normal-like, 61% for basal, and 61% for ERBB2-overexpressing.

Table 1 Histoclinical features of basal-like tumors (IPC and validation series)

The 73 IPC basal tumors were used as training set for identifying a prognostic kinase GES from the 661-gene list. Supervised analysis identified 581 genes differentially expressed in basal versus at least one other subtype, including 360 genes overexpressed in basal tumors (Additional file 3, Table S2). Within this series most of the patients (90%) received adjuvant chemotherapy. Twenty-five patients developed relapse or death with a median time-to-relapse of 19 months, and forty-eight patients remained disease-free with a median follow-up of 64 months. The 5-year DFS was 63%. Hierarchical clustering of these tumors with the 360-gene set (Figure 1A) revealed two main clusters, I (n = 24) and II (n = 49), with 5-year DFS superior in cluster I (77% versus 56%; p = 0.22, log-rank test; Figure 1B). QT clustering identified three gene clusters with a major role in this discrimination (Figure 1A, and Additional file 5, Table S4). One included 21 genes not related to any specific biologic function. A second cluster was associated with the cell cycle. The third cluster (thereafter designed immune cluster) contained 28 genes, which for many were involved in immune signaling (e.g. BLK, BTK, FYN, SYK, ITK, JAK3, LCK, LCP2, PRKCB, and ZAP70). Visually, lower expression of this cluster was associated with more relapses (Figure 1A). We built a metagene for each gene cluster, and analyzed their correlation with DFS using univariate Cox regression analysis. Only the immune metagene correlated with DFS (HR = 0.32, 95%CI [0.17-0.68], p = 2.4E-3, Wald test; Additional file 6, Table S5). Resampling with 100,000 iterations showed only a 0.8% probability to find a metagene built from 28 random genes with similar or better prognostic correlation than the immune metagene.

Figure 1
figure 1

Hierarchical clustering of basal breast cancer. (A) Unsupervised hierarchical clustering of 73 non-metastatic non-inflammatory basal BCs from IPC with 360 genes coding for kinase or kinase-interacting proteins overexpressed in basal tumors. Each row represents a gene and each column a sample. The expression level of each gene in each sample is relative to its median abundance across the samples and is depicted according to the color scale shown under the matrix. Red and green indicate expression levels respectively above and below the median. Relapses are indicated in the stripe under the dendrogram: white for no relapse during follow-up, and grey for relapse. Two tumor clusters (I and II) are delineated by the vertical green line. To the right, vertical colored bars indicate the three clusters identified by the QT clustering method: purple, immune-related cluster; green, biologically unspecific cluster; red, proliferation-related cluster. (B) Kaplan-Meier disease-free survival curves for cluster I patients (n = 24), and cluster II patients (n = 49).

We defined two subgroups of basal tumors according to the immune metagene expression value: "Immune-High" if above the value threshold (n = 25) and "Immune-Low" if under (n = 48). No histoclinical factor including the lymphocyte infiltrate was different between the two subgroups (Additional file 7, Table S6). Survival was different, with 91% 5-year DFS in "Immune-High" subgroup versus 49% in "Immune-Low" (p = 0.005, log-rank test, Figure 2). On univariate analysis (Table 2), two factors were associated with DFS: vascular invasion (HR = 2.32, 95%CI [1.04-5.18], p = 0.04, Wald test) and immune metagene expression (HR = 0.21, 95%CI [0.06-0.70], p = 0.01, Wald test). They remained significant on multivariate analysis (Table 2).

Figure 2
figure 2

Disease-free survival and basal subgroups in the learning set. Kaplan-Meier disease-free survival curves of basal BC patients in the IPC series according to the subgroups "Immune-High" (n = 25) and "Immune-Low" (n = 48).

Table 2 Univariate and multivariate analyses by Cox regression of basal tumors, IPC series

We also performed a similar analysis on genes underexpressed in basal tumors, but it did not allow the identification of any robust gene clusters.

Validation of the prognostic signature

The expression of the immune metagene was studied in the independent panel of 518 basal tumors not used to define the predictor. Follow-up for DFS was annotated for 380 patients: 158 developed relapse or death with a median time-to-relapse of 30 months, and 222 remained disease-free with a median follow-up of 93 months. The 5-year DFS was 60%. At least 25 out of 28 (mean = 27) genes included in the immune metagene were common to each separate set (Additional file 1, Table S1). A total of 122 patients were defined as "Immune-High" and 396 as "Immune-Low". Their histoclinical features (including the lymphocyte infiltrate available in 56 out of 518 tumors) were well balanced, except SBR grading, more frequently II-III in the "Immune-High" subgroup (Table 3). The 5-year DFS was 78% in the "Immune-High" subgroup and 54% in the "Immune-Low" one (p = 1.62E-04, log-rank test, Figure 3A), confirming the prognostic value of the immune metagene. Analysis by data set showed that the mean difference of 5-year DFS between "Immune-high" and "Immune-low" cases was 25% (95%CI, [13 - 37], p = 0.0038, T-test).

Table 3 Histoclinical comparison of the two basal subgroups defined with the immune metagene in the independent validation series
Figure 3
figure 3

Disease-free survival and basal subgroups in the validation set. Kaplan-Meier disease-free survival curves of basal BC patients in the independent validation series according to the subgroups "Immune-High" and "Immune-Low". (A) in all patients (95 versus 285 patients respectively), and (B) in patients having received no systemic adjuvant therapy (39 versus 148 patients respectively).

On univariate analysis (Table 4), two factors correlated with DFS: lymph node involvement (HR = 1.53, 95%CI [1.04-2.25]; p = 0.03, Wald test) and immune metagene expression (HR = 0.45, 95%CI [0.29-0.69]; p = 2.9E-04, Wald test). On multivariate analysis, both remained significant.

Table 4 Univariate and multivariate (with and without MBC-based classifier) DFS analyses by Cox regression of basal tumors: public series

Comparison with existing classifiers

Two prognostic multigenic models have been reported in basal BC: the MBC and HDPP classifiers [33, 34]. We assessed their prognostic value in the present 518 basal tumors. On univariate analysis, the MBC classifier correlated with DFS, with a HR for relapse of 0.59 (95% CI [0.43-0.82], p = 0.0017) for good-prognosis patients as compared with poor-prognosis patients. In multivariate analysis including this classifier, our immune metagene classifier and lymph node status showed that both genomic classifiers were significant, whereas node involvement was not (Table 4), suggesting that the multigenic models have independent prognostic value. The HDPP classifier confirmed its prognostic value for ERBB2-overexpressing tumors in our series (n = 214), but not in the 518 basal samples: 5-year DFS was 63% for the good-prognosis patients versus 61% for the poor-prognosis patients (p = 0.62, log-rank test).

We also assessed the prognostic impact of three published major prognostic expression signatures recently reported in early breast cancer. In each data set, each sample was assigned a good or a poor prognosis based on each signature. Data sets were then pooled, and survival was compared between the predicted good-prognosis and poor-prognosis subgroups. Univariate DFS analysis performed in the basal subtype showed that none of these classifiers was associated with survival (Table 5). These results show the absence of informative value of these signatures in the basal subtype, by contrast with our classifier.

Table 5 Comparison of the prognostic value of the immune-metagene classifier with three available signatures, Disease-free survival, Cox univariate analysis

Prognostic and/or predictive value of the immune classifier?

To determine the link of the immune metagene with metastatic risk and/or with response to chemotherapy, we analyzed - within the series of 518 basal BCs - the 187 cases with available follow-up who had not received any adjuvant systemic therapy. In this set, "Immune-High" patients had a longer DFS than "Immune-Low" patients with 5-year DFS of 82 vs. 55% respectively (p = 4.75E-03, log-rank test; Figure 3B).

Next, we studied the capacity of our model to predict pathological complete response (pCR) after anthracycline-based neoadjuvant chemotherapy. Information was available for two data sets with the following regimens: weekly paclitaxel and fluorouracil-doxorubicin-cyclophosphamide (55 patients with pCR and 70 without) [22], and fluorouracil-epirubicin-cyclophosphamide or docetaxel followed with docetaxel-epirubicin (34 patients with pCR and 99 without) [23]. We identified 98 basal cases out of the 258 included samples. "Immune-High" patients experienced more pCR (59%) than "Immune-Low" patients (43%), but the difference was not significant (Odds ratio = 1.87, 95%CI [0.57-6.40], p = 0.29, Fisher's exact test).

Altogether, these observations suggested that the immune metagene is associated with relapse risk, whereas its association with response to chemotherapy deserves to be tested in larger series.

The immune kinase metagene correlates with cytotoxic T-cell response

We next sought to elucidate the type of immune response associated with our metagene. Ontology analysis of the 28 genes using IPA software confirmed association with many pathways involved in immune response [35], particularly in lymphocyte activation processes, such as "T-cell receptor signaling", "CD28 signaling in T helper cells", "NK cell signaling", "PLC signaling", "Role of NFAT in regulation of the immune response", "NF-kB signaling", or "IL2 signaling" (Additional file 8 - Table S7, and Additional file 9 - Figure S1). The upregulation of BTK, CD3E, FYN, ITK, LCK, LCP2, PRKCs, SYK, ZAP70 and JAK3 clearly identified an activated profile of the lymphocytic lineage.

To better explore the molecular differences between "Immune-High" and "Immune-Low" basal BCs, we searched for the genes differentially expressed between the two subgroups in the IPC series using the whole genome and not only the kinome. Supervised analysis (0.1% FDR) identified 532 differential genes. Most of them (n = 506) were overexpressed in "Immune-High" samples (Additional file 10, Table S8A). Ontology analysis showed that these genes were particularly involved in immune response, and more specifically in adaptive immunity (Additional file 10, Table S8B). To confirm this observation, we applied GSEA using reported T-cell, CD8+ T-cell and B-cell expression signatures [36]. As shown in Additional file 11 (Figure S2), an enrichment of genes involved in T-cell, CD8+ T-cell and B-cell signatures was found in "Immune-High" cases.


Basal BCs are poor-prognosis tumors, which require both improvement of our ability to predict the clinical outcome for better tailoring treatment and identification of new therapeutic targets. Their prognosis is heterogeneous, and it is currently impossible to predict which patients will or will not relapse using classical histoclinical factors or the recently reported prognostic GES, notably those currently tested in clinical trials [39]. In the same way, the HDDP classifier [34] identified using ERBB2+ tumors, failed to classify basal samples. Prognostic analyses should be done per subtype [40].

Analysis of kinase and kinase-related genes might help develop new targeted therapies. We report a kinase-based model that divides basal BCs into two subgroups with balanced histoclinical factors but different survival (25% difference for 5-year DFS). This model is based on the expression of an immune 28-gene metagene. Identified in a learning set, its prognostic value was confirmed in an independent data set of 518 cases. The model outperformed the individual current prognostic factors on multivariate analysis, both in the learning and validation sets. Patients with high expression of the immune metagene had a better DFS than other patients. This prognostic value remained when applied to patients treated without any adjuvant chemotherapy, suggesting a link with the metastatic potential. An additional link with chemosensitivity cannot be excluded as "Immune-High" patients experienced a higher, but non significant, pCR rate than "Immune-Low" patients.

The favorable prognostic impact of the immune response, particularly the T-cell response, has been reported in ER-negative [8, 13, 14, 26, 4143] or ERBB2+ BCs [8, 28, 31, 44]. Similar finding was reported in 97 triple-negative BCs [45] in which increased expression of interferon-related genes tended to confer better prognosis. In our previous study [33] and the present one, we focused on basal BC only, since this subtype is even more homogeneous than the triple-negative group [46]. In our previous study, we defined a 368-gene prognosticator, which confirmed the positive influence of TH1 cells and high cytotoxic activity. This model outperformed two immune signatures in multivariate analysis of DFS [28, 42]. We showed here that both the immune kinase model and this previous model maintain their prognostic value in multivariate analysis, suggesting their independence. It is of note that our "immune-metagene" model presented a prognostic value in luminal B (p = 0.03, Wald test) and ERBB2-overexpressing cases (p = 0.02, Wald test), but not in luminal A and normal-like samples (p = 0.58 and 0.98, respectively, Wald test). Moreover, it is worth noting that previously published signatures (Genomic grade index, 70-gene signature, and 76-gene signature), mainly based on proliferation, failed to separate good from poor prognosis basal breast cancers.

Ingenuity analysis of both the 28 genes and the genes differentially expressed between the two subgroups defined by our kinase immune metagene confirmed that the differences between these histoclinically similar subgroups are in immune genes. Upregulated kinome-genes suggest the presence of an activated lymphocyte infiltrate in "Immune-High" patients. This lymphocyte-activated status is due to stimulations by cytokines (JAK3, STAT1, STAT4, TBX21 and TH1 cytokine receptors), by T-cell receptor (T-cell receptor chains [alpha, beta and gamma], CD3E, CD3D, CD247/CD3Z, CD28, CD27, CD2, CD8A, CD4, LAG3, MAL, LAT2, PIM2), by B-cell receptor (CD19, CD79b, CD27, CD40, IGJ, IGK@, IGH@, BTK, BLNK, BANK1), and by anti-tumor receptors (KLRK1, KLRB1, GAB3, SLAMF1, SLAMF6-8). The lymphocyte infiltrate is strictly TH1-biased with the overexpression of IL2RG, IL23RB and IL7R involved in lymphocyte survival, of IL12RB1, IL15RA, IL18BP, and IL21R TH1-biased receptors, of STAT1, STAT4, and TBX21 TH1 transcription factors, and of several interferon-inducible molecules (GVIN1, ISG20, GBP2, IRF1, IRF4, IRF7, and IRF8). This agrees with increased levels of cytotoxic granules and pore-forming molecules (VAMP1, GZMA, GZMB, GZMH, GZMK, GNL, PRF1, CFLAR, CASP1, and CASP10). Interestingly, there are also several genes encoding activated memory lymphocyte recruitment such as IL16, XCL1, CCL5, CCR5, CXCL9, CXCR3, CCL19, CCR7, and CXCR6 (mostly helper and cytotoxic T-cells), and CXCL13, CXCR5 (activated B-cells), among which some are strictly produced by activated T-cells, such as CCL4 and CCL5. Finally, we also found transcripts involved in lymphocyte migration and/or activation (ITGAL and ITGB2 heterodimers, ITGA4, ITGAX, ITGB7, SELL, SELP, SELPL, and CD69).

Thus, we show that the immune response, and notably the adaptive cytotoxic TH1-cell response [47], influence survival of basal BC patients. Despite the small size of the independent population with lymphocyte infiltrate data available, which does not allow to really conclude about the impact of the quantity of lymphocyte infiltrate on the expression of immune response-related genes, the absence of correlation between the immune metagene and lymphocyte infiltration in our cohort and in two independent data sets [5, 8] as well as the function of genes, suggest that this influence does not depend on the degree of lymphocyte infiltrate, but on the efficiency of its cytotoxic activation status. The differential expression of these "immune genes" is probably also due to a variable expression of epithelial-derived molecules [13, 42, 48], which activate (in "Immune-High" cases) or repress (in "Immune-Low" cases) the local immune response to the tumor. These hypotheses deserve further investigation to understand the respective role of tumor-infiltrative lymphocytes and cancer cells on cancer history.


In conclusion, we propose a robust prognostic subdivision of basal BC based on the expression of 28 genes, involved in immune response and notably the cytotoxic T-cell response. Tumors associated with higher activation of cytotoxic tumor-infiltrative lymphocytes have a better prognosis, and are likely to better respond to chemotherapy. Such classification should help tailor treatment. Furthermore, since adaptive immunity seems to play a pivotal role [49] immune response manipulation might be an efficient way of treating or preventing these poor-prognosis tumors [47, 50].



breast cancer


disease-free survival


estrogen receptor


false discovery rate


gene expression signature


gene set enrichment analysis


Hematoxylin and Eosin Staining


HER2-derived prognostic classifier


hormone receptor




Ingenuity Pathway Analysis


Institut Paoli-Calmettes


medullary breast cancer


progesterone receptor




robust multichip average


single sample predictor.


  1. Bertucci F, Finetti P, Cervera N, Maraninchi D, Viens P, Birnbaum D: Gene expression profiling and clinical outcome in breast cancer. Omics. 2006, 10: 429-443. 10.1089/omi.2006.10.429

    Article  CAS  PubMed  Google Scholar 

  2. Sorlie T, Tibshirani R, Parker J, Hastie T, Marron JS, Nobel A, Deng S, Johnsen H, Pesich R, Geisler S, Demeter J, Perou CM, Lønning PE, Brown PO, Børresen-Dale AL, Botstein D: Repeated observation of breast tumor subtypes in independent gene expression data sets. Proc Natl Acad Sci USA. 2003, 100: 8418-8423. 10.1073/pnas.0932692100

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  3. Sørlie T, Wang Y, Xiao C, Johnsen H, Naume B, Samaha RR, Børresen-Dale AL: Distinct molecular mechanisms underlying clinically relevant subtypes of breast cancer: gene expression analyses across three different platforms. BMC Genomics. 2006, 7: 127- 10.1186/1471-2164-7-127

    Article  PubMed Central  PubMed  Google Scholar 

  4. Perou CM, Sørlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, Pollack JR, Ross DT, Johnsen H, Akslen LA, Fluge O, Pergamenschikov A, Williams C, Zhu SX, Lønning PE, Børresen-Dale AL, Brown PO, Botstein D: Molecular portraits of human breast tumours. Nature. 2000, 406: 747-752. 10.1038/35021093

    Article  CAS  PubMed  Google Scholar 

  5. van 't Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AA, Mao M, Peterse HL, van der Kooy K, Marton MJ, Witteveen AT, Schreiber GJ, Kerkhoven RM, Roberts C, Linsley PS, Bernards R, Friend SH: Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002, 415: 530-536. 10.1038/415530a

    Article  PubMed  Google Scholar 

  6. Bertucci F, Finetti P, Cervera N, Esterni B, Hermitte F, Viens P, Birnbaum D: How basal are triple-negative breast cancers?. Int J Cancer. 2008, 123: 236-240. 10.1002/ijc.23518

    Article  CAS  PubMed  Google Scholar 

  7. van de Vijver MJ, He YD, van't Veer LJ, Dai H, Hart AA, Voskuil DW, Schreiber GJ, Peterse JL, Roberts C, Marton MJ, Parrish M, Atsma D, Witteveen A, Glas A, Delahaye L, van der Velde T, Bartelink H, Rodenhuis S, Rutgers ET, Friend SH, Bernards R: A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med. 2002, 347: 1999-2009. 10.1056/NEJMoa021967

    Article  CAS  PubMed  Google Scholar 

  8. Desmedt C, Haibe-Kains B, Wirapati P, Buyse M, Larsimont D, Bontempi G, Delorenzi M, Piccart M, Sotiriou C: Biological processes associated with breast cancer clinical outcome depend on the molecular subtypes. Clin Cancer Res. 2008, 14: 5158-5165. 10.1158/1078-0432.CCR-07-4756

    Article  CAS  PubMed  Google Scholar 

  9. Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S: The protein kinase complement of the human genome. Science. 2002, 298: 1912-1934. 10.1126/science.1075762

    Article  CAS  PubMed  Google Scholar 

  10. Futreal PA, Coin L, Marshall M, Down T, Hubbard T, Wooster R, Rahman N, Stratton MR: A census of human cancer genes. Nat Rev Cancer. 2004, 4: 177-83. 10.1038/nrc1299

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  11. Krause DS, Van Etten RA: Tyrosine kinases as targets for cancer therapy. N Engl J Med. 2005, 353: 172-187. 10.1056/NEJMra044389

    Article  CAS  PubMed  Google Scholar 

  12. Finetti P, Cervera N, Charafe-Jauffret E, Chabannon C, Charpin C, Chaffanet M, Jacquemier J, Viens P, Birnbaum D, Bertucci F: Sixteen-kinase gene expression identifies luminal breast cancers with poor prognosis. Cancer Res. 2008, 68: 767-776. 10.1158/0008-5472.CAN-07-5516

    Article  CAS  PubMed  Google Scholar 

  13. Speers C, Tsimelzon A, Sexton K, Herrick AM, Gutierrez C, Culhane A, Quackenbush J, Hilsenbeck S, Chang J, Brown P: Identification of novel kinase targets for the treatment of estrogen receptor-negative breast cancer. Clin Cancer Res. 2009, 15: 6327-6340. 10.1158/1078-0432.CCR-09-1107

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  14. Schmidt M, Böhm D, von Törne C, Steiner E, Puhl A, Pilch H, Lehr HA, Hengstler JG, Kölbl H, Gehrmann M: The humoral immune system has a key prognostic impact in node-negative breast cancer. Cancer Res. 2008, 68: 5405-5413. 10.1158/0008-5472.CAN-07-5206

    Article  CAS  PubMed  Google Scholar 

  15. Wang Y, Klijn JG, Zhang Y, Sieuwerts AM, Look MP, Yang F, Talantov D, Timmermans M, Meijer-van Gelder ME, Yu J, Jatkoe T, Berns EM, Atkins D, Foekens JA: Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet. 2005, 365: 671-679.

    Article  CAS  PubMed  Google Scholar 

  16. Sotiriou C, Wirapati P, Loi S, Harris A, Fox S, Smeds J, Nordgren H, Farmer P, Praz V, Haibe-Kains B, Desmedt C, Larsimont D, Cardoso F, Peterse H, Nuyten D, Buyse M, Van de Vijver MJ, Bergh J, Piccart M, Delorenzi M: Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. J Natl Cancer Inst. 2006, 98: 262-272. 10.1093/jnci/djj052

    Article  CAS  PubMed  Google Scholar 

  17. University of North Carolina Microarray Database.

  18. Ivshina AV, George J, Senko O, Mow B, Putti TC, Smeds J, Lindahl T, Pawitan Y, Hall P, Nordgren H, Wong JE, Liu ET, Bergh J, Kuznetsov VA, Miller LD: Genetic reclassification of histologic grade delineates new clinical subtypes of breast cancer. Cancer Res. 2006, 66: 10292-10301. 10.1158/0008-5472.CAN-05-4414

    Article  CAS  PubMed  Google Scholar 

  19. Desmedt C, Piette F, Loi S, Wang Y, Lallemand F, Haibe-Kains B, Viale G, Delorenzi M, Zhang Y, d'Assignies MS, Bergh J, Lidereau R, Ellis P, Harris AL, Klijn JG, Foekens JA, Cardoso F, Piccart MJ, Buyse M, Sotiriou C, : Strong time dependence of the 76-gene prognostic signature for node-negative breast cancer patients in the TRANSBIG multicenter independent validation series. Clin Cancer Res. 2007, 13: 3207-3214. 10.1158/1078-0432.CCR-06-2765

    Article  CAS  PubMed  Google Scholar 

  20. Pawitan Y, Bjöhle J, Amler L, Borg AL, Egyhazi S, Hall P, Han X, Holmberg L, Huang F, Klaar S, Liu ET, Miller L, Nordgren H, Ploner A, Sandelin K, Shaw PM, Smeds J, Skoog L, Wedrén S, Bergh J: Gene expression profiling spares early breast cancer patients from adjuvant therapy: derived and validated in two population-based cohorts. Breast Cancer Res. 2005, 7: R953-964. 10.1186/bcr1325

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  21. Miller LD, Smeds J, George J, Vega VB, Vergara L, Ploner A, Pawitan Y, Hall P, Klaar S, Liu ET, Bergh J: An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival. Proc Natl Acad Sci USA. 2005, 102: 13550-13555. 10.1073/pnas.0506230102

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  22. Hess KR, Anderson K, Symmans WF, Valero V, Ibrahim N, Mejia JA, Booser D, Theriault RL, Buzdar AU, Dempsey PJ, Rouzier R, Sneige N, Ross JS, Vidaurre T, Gómez HL, Hortobagyi GN, Pusztai L: Pharmacogenomic predictor of sensitivity to preoperative chemotherapy with paclitaxel and fluorouracil, doxorubicin, and cyclophosphamide in breast cancer. J Clin Oncol. 2006, 24: 4236-4244. 10.1200/JCO.2006.05.6861

    Article  CAS  PubMed  Google Scholar 

  23. Bonnefoi H, Potti A, Delorenzi M, Mauriac L, Campone M, Tubiana-Hulin M, Petit T, Rouanet P, Jassem J, Blot E, Becette V, Farmer P, André S, Acharya CR, Mukherjee S, Cameron D, Bergh J, Nevins JR, Iggo RD: Validation of gene signatures that predict the response of breast cancer to neoadjuvant chemotherapy: a substudy of the EORTC 10994/BIG 00-01 clinical trial. Lancet Oncol. 2007, 8: 1071-1078. 10.1016/S1470-2045(07)70345-5

    Article  CAS  PubMed  Google Scholar 

  24. Theillet C, Adelaide J, Louason G, Bonnet-Dorion F, Jacquemier J, Adnane J, Longy M, Katsaros D, Sismondi P, Gaudray P: FGFRI and PLAT genes and DNA amplification at 8p12 in breast and ovarian cancers. Genes Chromosomes Cancer. 1993, 7: 219-226. 10.1002/gcc.2870070407

    Article  CAS  PubMed  Google Scholar 

  25. Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003, 4: 249-264. 10.1093/biostatistics/4.2.249

    Article  PubMed  Google Scholar 

  26. Hu Z, Fan C, Oh DS, Marron JS, He X, Qaqish BF, Livasy C, Carey LA, Reynolds E, Dressler L, Nobel A, Parker J, Ewend MG, Sawyer LR, Wu J, Liu Y, Nanda R, Tretiakova M, Ruiz Orrico A, Dreher D, Palazzo JP, Perreard L, Nelson E, Mone M, Hansen H, Mullins M, Quackenbush JF, Ellis MJ, Olopade OI, Bernard PS, Perou CM: The molecular portraits of breast tumors are conserved across microarray platforms. BMC Genomics. 2006, 7: 96- 10.1186/1471-2164-7-96

    Article  PubMed Central  PubMed  Google Scholar 

  27. Rosenwald A, Wright G, Chan WC, Connors JM, Campo E, Fisher RI, Gascoyne RD, Muller-Hermelink HK, Smeland EB, Giltnane JM, Hurt EM, Zhao H, Averett L, Yang L, Wilson WH, Jaffe ES, Simon R, Klausner RD, Powell J, Duffey PL, Longo DL, Greiner TC, Weisenburger DD, Sanger WG, Dave BJ, Lynch JC, Vose J, Armitage JO, Montserrat E, López-Guillermo A: The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. N Engl J Med. 2002, 346: 1937-1947. 10.1056/NEJMoa012914

    Article  PubMed  Google Scholar 

  28. Rody A, Holtrich U, Pusztai L, Liedtke C, Gaetje R, Ruckhaeberle E, Solbach C, Hanker L, Ahr A, Metzler D, Engels K, Karn T, Kaufmann M: T-cell metagene predicts a favorable prognosis in estrogen receptor-negative and HER2-positive breast cancers. Breast Cancer Res. 2009, 11: R15- 10.1186/bcr2234

    Article  PubMed Central  PubMed  Google Scholar 

  29. Davicioni E, Anderson JR, Buckley JD, Meyer WH, Triche TJ: Gene expression profiling for survival prediction in pediatric rhabdomyosarcomas: a report from the children's oncology group. J Clin Oncol. 2010, 28: 1240-1246. 10.1200/JCO.2008.21.1268

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  30. Lenz G, Wright G, Dave SS, Xiao W, Powell J, Zhao H, Xu W, Tan B, Goldschmidt N, Iqbal J, Vose J, Bast M, Fu K, Weisenburger DD, Greiner TC, Armitage JO, Kyle A, May L, Gascoyne RD, Connors JM, Troen G, Holte H, Kvaloy S, Dierickx D, Verhoef G, Delabie J, Smeland EB, Jares P, Martinez A, Lopez-Guillermo A: Stromal gene signatures in large-B-cell lymphomas. N Engl J Med. 2008, 359: 2313-2323. 10.1056/NEJMoa0802885

    Article  CAS  PubMed  Google Scholar 

  31. Bertucci F, Borie N, Roche H, Bachelot T, Le Doussal JM, Macgrogan G, Debono S, Martinec A, Treilleux I, Finetti P, Esterni B, Extra JM, Geneve J, Hermitte F, Chabannon C, Jacquemier J, Martin AL, Longy M, Maraninchi D, Fert V, Birnbaum D, Viens P: Gene expression profile predicts outcome after anthracycline-based adjuvant chemotherapy in early breast cancer. Breast Cancer Res Treat. 2011, 127: 363-373. 10.1007/s10549-010-1003-z

    Article  CAS  PubMed  Google Scholar 

  32. Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA. 1998, 95: 14863-14868. 10.1073/pnas.95.25.14863

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  33. Sabatier R, Finetti P, Cervera N, Lambaudie E, Esterni B, Mamessier E, Tallet A, Chabannon C, Extra JM, Jacquemier J, Viens P, Birnbaum D, Bertucci F: A gene expression signature identifies two prognostic subgroups of basal breast cancer. Breast Cancer Res Treat. 2011, 126: 407-420. 10.1007/s10549-010-0897-9

    Article  CAS  PubMed  Google Scholar 

  34. Staaf J, Ringnér M, Vallon-Christersson J, Jönsson G, Bendahl PO, Holm K, Arason A, Gunnarsson H, Hegardt C, Agnarsson BA, Luts L, Grabau D, Fernö M, Malmström PO, Johannsson OT, Loman N, Barkardottir RB, Borg A: Identification of subtypes in human epidermal growth factor receptor 2--positive breast cancer reveals a gene signature prognostic of outcome. J Clin Oncol. 2010, 28: 1813-1820. 10.1200/JCO.2009.22.8775

    Article  PubMed  Google Scholar 

  35. Ingenuity Pathways Analysis v8.0-2602 (IPA).

  36. Palmer C, Diehn M, Alizadeh AA, Brown PO: Cell-type specific gene expression profiles of leukocytes in human peripheral blood. BMC Genomics. 2006, 7: 115- 10.1186/1471-2164-7-115

    Article  PubMed Central  PubMed  Google Scholar 

  37. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005, 102: 15545-15550. 10.1073/pnas.0506580102

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  38. McShane LM, Altman DG, Sauerbrei W, Taube SE, Gion M, Clark GM, : Reporting recommendations for tumor marker prognostic studies. J Clin Oncol. 2005, 23: 9067-9072. 10.1200/JCO.2004.01.0454

    Article  PubMed  Google Scholar 

  39. Sotiriou C, Piccart MJ: Taking gene-expression profiling to the clinic: when will molecular signatures become relevant to patient care?. Nat rev cancer. 2007, 7: 545-553. 10.1038/nrc2173

    Article  CAS  PubMed  Google Scholar 

  40. Pusztai L: Gene expression profiling of breast cancer. Breast Cancer Res. 2009, 11 (Suppl 3): S11- 10.1186/bcr2430

    Article  PubMed Central  PubMed  Google Scholar 

  41. Calabrò A, Beissbarth T, Kuner R, Stojanov M, Benner A, Asslaber M, Ploner F, Zatloukal K, Samonigg H, Poustka A, Sültmann H: Effects of infiltrating lymphocytes and estrogen receptor on gene expression and prognosis in breast cancer. Breast Cancer Res Treat. 2009, 116: 69-77. 10.1007/s10549-008-0105-3

    Article  PubMed  Google Scholar 

  42. Teschendorff AE, Miremadi A, Pinder SE, Ellis IO, Caldas C: An immune response gene expression module identifies a good prognosis subtype in estrogen receptor negative breast cancer. Genome Biol. 2007, 8: R157- 10.1186/gb-2007-8-8-r157

    Article  PubMed Central  PubMed  Google Scholar 

  43. Teschendorff AE, Caldas C: A robust classifier of high predictive value to identify good prognosis patients in ER-negative breast cancer. Breast Cancer Res. 2008, 10: R73- 10.1186/bcr2138

    Article  PubMed Central  PubMed  Google Scholar 

  44. Alexe G, Dalgin GS, Scanfeld D, Tamayo P, Mesirov JP, DeLisi C, Harris L, Barnard N, Martel M, Levine AJ, Ganesan S, Bhanot G: High expression of lymphocyte-associated genes in node-negative HER2+ breast cancers correlates with lower recurrence rates. Cancer Res. 2007, 67: 10669-10676. 10.1158/0008-5472.CAN-07-0539

    Article  CAS  PubMed  Google Scholar 

  45. Kreike B, van Kouwenhove M, Horlings H, Weigelt B, Peterse H, Bartelink H, van de Vijver MJ: Gene expression profiling and histopathological characterization of triple-negative/basal-like breast carcinomas. Breast Cancer Res. 2007, 9: R65- 10.1186/bcr1771

    Article  PubMed Central  PubMed  Google Scholar 

  46. Bertucci F, Finetti P, Cervera N, Charafe-Jauffret E, Buttarelli M, Jacquemier J, Chaffanet M, Maraninchi D, Viens P, Birnbaum D: How different are luminal A and basal breast cancers?. Int J Cancer. 2009, 124: 1338-1348. 10.1002/ijc.24055

    Article  CAS  PubMed  Google Scholar 

  47. Disis ML, Park KH: Immunomodulation of Breast Cancer via Tumor Antigen Specific Th1. Cancer Res Treat. 2009, 41: 117-121. 10.4143/crt.2009.41.3.117

    Article  PubMed Central  PubMed  Google Scholar 

  48. Charafe-Jauffret E, Ginestier C, Monville F, Finetti P, Adélaïde J, Cervera N, Fekairi S, Xerri L, Jacquemier J, Birnbaum D, Bertucci F: Gene expression profiling of breast cell lines identifies potential new basal markers. Oncogene. 2006, 25: 2273-2284. 10.1038/sj.onc.1209254

    Article  CAS  PubMed  Google Scholar 

  49. Disis ML: Immune regulation of cancer. J Clin Oncol. 2010, 28: 4531-5438. 10.1200/JCO.2009.27.2146

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  50. Disis ML, Bernhard H, Jaffee EM: Use of tumour-responsive T cells as cancer treatment. Lancet. 2009, 373: 673-683. 10.1016/S0140-6736(09)60404-9

    Article  PubMed Central  CAS  PubMed  Google Scholar 

Download references


Our work is supported by Institut Paoli-Calmettes, Inserm, Institut National du Cancer (Tr 2008), Association pour le Recherche contre le Cancer, Ligue Nationale contre le Cancer (label DB), and Fondation pour la Recherche Médicale (RS 2009). This paper is dedicated to the memory of our friend Stéphane RAYNAUD.

Author information

Authors and Affiliations


Corresponding author

Correspondence to François Bertucci.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

RS and FB designed the concept of the study. FB, EL and PV were responsible for provision of study patients. JJ was responsible for pathologic examination. RS, PF and NC were responsible for samples and data gathering, nucleic acids extraction and microarray experiments. RS, PF and FB were responsible for statistical analysis and interpretation. RS, FB and DB wrote the final manuscript. All authors read and approved the final manuscript.

Electronic supplementary material

Additional file 1: Table S1: Description of the breast cancer data sets. (XLS 24 KB)

Additional file 2: Supplementary materials. (DOC 53 KB)

Additional file 3: Table S2: List of 661 kinase and kinase-related genes analyzed. (XLS 290 KB)

Additional file 4: Table S3: Molecular subtypes and histoclinical features of the pooled data sets. (XLS 22 KB)

Additional file 5: Table S4: Description of the three genes clusters identified with QT clustering. (XLS 29 KB)


Additional file 6: Table S5: Univariate DFS analysis of metagenes by Cox regression of basal tumors: IPC series. (XLS 18 KB)


Additional file 7: Table S6: Histoclinical comparison of the two basal subgroups defined with the immune metagene (IPC series). (XLS 22 KB)

Additional file 8: Table S7: Ingenuity canonical pathways associated with the immune-related cluster. (XLS 28 KB)


Additional file 9: Figure S1: Biological network of genes included in or associated with our 28-gene model. A fine-tuning between inhibitor (phosphatases) and activator (kinases) signals regulates lymphocyte anti-tumor immunity. AK and Pyk2 are two of the major kinases that become tyrosine phosphorylated following lymphocyte stimulation. Both are associated to Lck. Lck (lymphocyte specific kinase) and Fyn are cytoplasmic tyrosine kinases of the Src family expressed in T-cells and natural killer (NK) cells, under the T cell receptor (TCR) or Natural cytotoxicity receptor (NCR). Their activity is critical for T and NK cell receptors-mediated signaling, leading to normal T- and NK-cell development and activation. Increased Fyn transcript and protein content in T cells can be observed with high T cell activity. Square 1. LAT is a linker protein essential for activation of T lymphocytes. Its rapid tyrosine-phosphorylation upon TCR stimulation recruits downstream signaling molecules for membrane targeting and activation. LAT is a substrate for Syk/Zap70 kinase and an immediate substrate for both Lck and Syk kinases. Its phosphorylation is an early event leading to T-cell activation. Both Lck and Syk phosphorylate the ITAM-like motifs on LAT, which is essential for induction of the interaction of LAT with downstream signaling molecules such as Grb2, PLC-γ1 and for activation of MAPK-ERK pathways. ZAP70 is thus at the crossroad of several signaling pathways that control lymphocyte development and function and cell survival in response to a wide variety of activator signals coming from the NCR, TCR or other receptor involved in anti-tumor immunity. Square 2. Cytokines receptors express at the membrane also regulate lymphocyte activation through the JAK-STAT signaling pathway. Square 3. In B, T and NK cells, the inhibition of these kinases is mostly mediated by protein tyrosine phosphatases (PTP), regrouping members of the SHP family (SHP-1, SHP-2) or LYP family. These proteins inhibit effector phase functions by dephosphorylating a wide spectrum of phospho-proteins involved in hematopoietic cell signaling. (PPT 1009 KB)


Additional file 10: Table S8: Genes differentially expressed between the "Immune-High" and "Immune-Low" basal tumor subgroups in the IPC set. (A) Summary of the 532 genes differentially expressed (Student's t-test). (B) Canonical pathways associated with the genes overexpressed in "Immune-High" tumors in the IPC set. (XLS 189 KB)


Additional file 11: Figure S2: Correlation of basal breast cancer subgroups (IPC series) and leukocyte cell-type gene expression signatures (GSEA algorithm). (A) Results of GSEA with the three tested. NES, normalized enrichment score; FDR, false discovery rate. (B) Enrichment plots for the three significant signatures: T-cell, CD8+ T-cell, and B-cell (from left to right). (PPT 204 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Sabatier, R., Finetti, P., Mamessier, E. et al. Kinome expression profiling and prognosis of basal breast cancers. Mol Cancer 10, 86 (2011).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: