Global proteomic analysis of oral cavity squamous cell carcinoma was performed to identify changes that reflect patient outcomes.
To identify differentially expressed proteins associated with patient outcomes and to explore the use of imaging mass spectrometry as a clinical tool to identify clinically relevant proteins.
Two-dimensional separation of digested peptides generated from 43 specimens with high-resolution mass spectrometry identified proteins associated with disease-specific death, distant metastasis, and loco-regional recurrence. RNA expressions had been correlated to protein levels to test transcriptional regulation of clinically relevant proteins. Imaging mass spectrometry explored an alternative platform for assessing clinically relevant proteins that would complement surgical pathologic diagnosis.
Seventy-two peptide features were found to be associated with 3 patient outcomes: disease-specific death (9), distant metastasis (16), and loco-regional recurrence (39); 8 of them were associated with multiple outcomes. Functional ontology revealed major changes in cell adhesion and calcium binding. Thirteen RNAs showed strong correlation with their encoded proteins, implying transcriptional control. Reduction of DSP, PKP1, and TRIM29 was associated with significantly shorter time to onset of distant metastasis. Reduction of PKP1 and TRIM29 correlated with poorer disease-specific survival. Additionally, S100A8 and S100A9 reductions were verified for their association with poor prognosis using imaging mass spectrometry, a platform more adaptable for use with surgical pathology.
Using global proteomic analysis, we have identified proteins associated with clinical outcomes. The list of clinically relevant proteins observed will provide a means to develop clinical assays for prognosis and optimizing treatment selection.
Head and neck squamous cell carcinoma (HNSCC) is the seventh most prevalent cancer worldwide, representing a major international health problem.1 Although treatment paradigms have evolved significantly over time, there has been relatively little change in 5-year survival since the 1970s, with the exception of survival among patients with human papillomavirus–driven oropharynx cancer.2 To improve patient survival, significant effort has been made to guide treatment decisions based on using histologic criteria to prognosticate, but these efforts have been to no avail to date.3,4 As a result, some patients may receive additional treatment to eliminate the possibility of metastasis, and consequently develop unnecessary morbidity. On the other hand, less aggressive therapy runs the risk of the development of local or regional persistent or recurrence disease, which carries significant mortality.5 Hence, there is an urgent need of innovative technology and biomarkers to better evaluate tumor aggressiveness and potential for metastasis, as well as to predict response to treatments. Although effort has been made to classify HNSCCs using genomic patterns with the goal of guiding treatment decisions,6–8 proteomic analysis reflects the functional phenotype more directly than profiling DNA or RNA aberrations alone. For example, changes such as DNA copy number amplification could be canceled by transcriptional silencing via DNA methylation, or increased mRNA expression could be reversed by posttranscriptional silencing as with microRNAs.9 Therefore, efforts to use proteomics as an alternative classification method have increased.10–13
Many previous attempts have been made to capture these differential protein expression patterns using patient tissue or plasma, or even closely related cancer cell lines.14–18 Many proteins have been detected that show strong potential to be developed further as biomarkers for clinical applications; these previous studies have had some limitations. Generally these studies have had small cohort sizes with limited clinical information. They have been focused on diagnostic markers that simply distinguish tumor from normal tissue. They have been restricted to relatively high-abundance proteins because of the limited nature of the separation method used, such as 2-dimensional gel electrophoresis.19 Our study differs in that we have used (1) an extensive tissue repository, with some cases having more than 10 years of detailed follow-up data; (2) a label-free quantitative proteomic approach to compare tumor tissues, and (3) 2-dimensional separation prior to mass spectrometry (MS) detection to provide wider proteome coverage. The MS–based proteomic analysis platform used in this study has been validated previously with 2 different cell lines.20 This platform enables the study of several thousand proteins per experiment, yielding a robust data set from which to assess differential expression of proteins associated with different physiological events during tumorigenesis.
Here, we present protein expression that we have found to be associated with patient outcomes in oral cavity squamous cell carcinoma (OCSCC) identified from global proteomic analysis. Previously, we have found that HNSCC produces site-specific molecular signatures by anatomic site for the oral cavity, oropharynx, and larynx.21 Hence, we focused this study solely on OCSCC. We also characterized the regulation of proteome changes at the transcriptional level as a part of a systems biology approach by integrating our global proteomic results with RNA expression data.
MATERIALS AND METHODS
Characteristics of Patient Population
Patients recruited for this study were undergoing treatment for histologically confirmed OCSCC at Montefiore Medical Center (Bronx, New York). All patients consented to participate in this study under protocols approved by the institutional review board. Only patients undergoing treatment with curative intent and no prior treatment for OCSCC were included in the study. From each patient, we obtained invasive primary tumor tissue at the time of surgical treatment. Patients were assessed, as appropriate, following their initial surgery. Medical records were kept noting the disease status at follow-up, the presence (or absence) of disease, the location and extent of failure, and death. The preliminary patient characteristics are described in Table 1. De-identified clinical and research data were used for the study. Primary tumor taken at surgery underwent multiple analyses including routine histopathology, genomic, transcriptional, and proteomic analyses. All OCSCC tissue samples were processed for total RNA, genomic DNA, and protein, and these banked samples were the source of the material to be used in this study.
Quality Control for Tumor Tissues
Study samples derived from resection specimens were taken fresh. Approximately 0.5 cm3 of tissue from invasive tumor was harvested in each case. This was further trisected and the middle one-third served as tumor control. Percentage of tumor and extent of necrosis were estimated by determining the number of microscopic fields containing tumor or necrotic tissue in relation to the total number of microscopic fields of the control sample on the contiguous slide. Normal mucosa was taken at a site at a significant distance away from the tumor identified by the surgeon (excisional biopsy) intraoperatively, or by the pathologist (resection specimen) when processing the specimen for frozen section or final diagnosis.
RNA Extraction and Detection
The RNA expression profiles used in the current study are a selected subset from our larger head and neck profiling project. All of the tissues were snap frozen within 30 minutes of surgical resection and kept at −80°C until analysis. Tissues were homogenized using a Brinkmann model PT 10/35 tissue homogenizer (KINEMATICA Inc, Bohemia, New York). RNA was isolated from tumor specimens in 1 mL volume of TRIzol (15596; Invitrogen, Carlsbad, California) following the manufacturer's suggested protocol and quantified on a NanoDrop ND-1000 spectrophotometer (NanoDrop, Wilmington, Delaware). Five hundred nanograms of total RNA was subjected to a linear amplification and biotin labeling protocol (Illumina TotalPrepRNA Amplification Kit; Ambion, Grand Island, New York) and used on Illumina HumanHT-12 v3 Expression BeadChips (Illumina, San Diego, California). These chips contained 48 000 probes targeting genes and known alternative splice variants from the RefSeq database release 17 and UniGene build 188, as well as more than 1000 control beads to assay sample RNA quality, labeling reaction success, hybridization stringency, and signal generation. All expression data were quantile normalized and background subtracted before analysis using BeadStudio software (Illumina).
Protein Extraction and Digestion
Proteins were extracted and digested as previously reported.20 Briefly, the protein pellets left after RNA isolation were reconstituted, reduced with Tris(2-carboxyethyl)phosphine, and digested successively with cyanogen bromide, endoproteinase Lys-C, and trypsin. Using the same tissue pieces for extractions facilitated the accurate comparison of RNA and protein expression levels. The concentration of the digested peptides was measured using a NanoDrop spectrophotometer at A280, and a 100-μg aliquot was used for strong cation-exchange separation.
Two-Dimensional Liquid Chromatography–MS and Peptide Feature Array Generation
Digested peptide mixtures obtained were separated into 6 fractions using strong cation-exchange spin columns (polysulfoethyl A resin, 12 μm, 300 Å; PolyLC, Columbia, Maryland). Peptides were eluted stepwise with aqueous ammonium acetate solutions at salt concentrations of 60, 70, 80, 120, 200, and 300 mM in 25% acetonitrile at pH adjusted to 3 with formic acid. A mixture of 4 standard peptides (with different retention times and masses)20 was added to each salt cut fraction as an internal control and analyzed using liquid chromatography–MS (LC-MS). The peptides served as markers for retention time alignment of the LC-MS spectra and used for correcting mass measurement drift. Each fraction was separated online by reversed-phase chromatography using a microflow UltiMate 3000 HPLC System (Dionex Corporation, Sunnyvale, California) interfaced with a QStar Pulsar i mass spectrometer (AB/Sciex, Foster City, California). Once acquired, the data were analyzed as previously described, as is shown in Figure 1.20 Peak picking was performed with a program built in house,22,23 which reduced the raw spectra to peptide feature lists containing monoisotopic mass, retention times, and intensities. Next, to alleviate any between-run retention time variation, retention times in different runs were aligned by first correcting with internal standards added as described previously,20 then by robust spline fitting. Subsequently, peptide features from different samples were matched using accurate mass and time tag approach with 0.3 Da and 1 minute tolerance each. Peptide feature arrays were constructed where each row contained the intensity value of each peptide obtained from different samples and statistical analysis performed.
Identification of Significant Peptides With Liquid Chromatography–Tandem MS
The generated peptide feature arrays contain peptide masses only. The sequence of each peptide in the array is unknown. Once associated with clinical outcomes, target lists of masses and retention times are generated for peptide sequence identification. Patient samples containing these targeted peptides and having the highest intensities were reanalyzed by liquid chromatography–tandem MS as described in Figure 2. Nanospray liquid chromatography–tandem MS analysis was performed on an Agilent 1100 HPLC system (Agilent, Palo Alto, California) and a QStar XL mass spectrometer (AB/Sciex). Acquired tandem MS spectra were manually curated to ensure high spectral quality and peptide identification. Tandem MS spectra were searched against the human International Protein Index database (ver. 3.79) for sequence identification using the Mascot algorithm (Matrix Science, Boston, Massachusetts) and adjusted for a false discovery rate of 1%.
Imaging In Situ MS for Additional Verification
Samples were prepared as 8-μm sections from 5 frozen OCSCC specimens and thaw mounted directly onto indium tin oxide–coated conductive glass slides (Delta Technologies, Loveland, Colorado). Sections were dried under vacuum for 30 minutes followed by fixation in graded ethanol (70%, 95%, and 98%; 30 seconds each) baths. Samples were dried again under vacuum for 10 minutes and spray coated with 10 mg/mL sinapinic acid (Fluka, St Louis, Missouri) in 60% acetonitrile and 0.2% trifluoroacetic acid aqueous solution using ImagePrep (Bruker Daltonics, Fremont, California). Matrix-assisted laser desorption/ionization (MALDI) mass spectra were acquired directly from the tissue sections with an ultrafleXtreme MALDI-TOF/TOF instrument (Bruker Daltonics). Data acquisition was performed in positive linear mode at constant laser power, optimized for detection of peaks in the 4- to 25-kDa range. A total of 800 laser shots were accumulated from each pixel of 200 × 200 μm. The data were baseline subtracted and rendered into an ordered array of spectra with xy coordinates. Images were visualized with selected ion images using FlexImaging (Bruker Daltonics). Once the MS spectra were acquired, the slides were washed with ethanol to remove sinapinic acid, stained with hematoxylin and eosin, and photomicrographed at high resolution (Mirax; 3DHISTECH Kft, Budapest, Hungary). Photomicrographs were examined and then marked with regions of interest and overlaid with the images extracted from MS spectra for evaluation.
Identification of Proteins Discovered on Imaging In Situ MS
Imaging in situ MS (IMS) generates an array of proteins with molecular weight versus intensities. Hence, an additional step is required to find the identity of the peaks with the certain molecular weight measured in IMS. To achieve this, remaining tissues (∼50 mg) after making sections were mechanically homogenized in 1 mL of 60% acetonitrile and 0.2% trifluoroacetic acid aqueous solution on ice and ultracentrifuged at 40 000g and 4°C for 20 minutes. Soluble protein concentration was determined using a 2xD Quant Kit (GE Healthcare Life Sciences, Pittsburgh, Pennsylvania). About 30 μg proteins from each sample was separated on 16.5% Mini-PROTEAN Tris/Tricine Urea Precast Gel (Biorad, Hercules, California) with peptide molecular weight marker (Sigma, St Louis, Missouri) to target small protein ranges better. Entire lanes were excised using disposable grid cutters (The Gel Company, San Francisco, California) to produce 40 bands each, and band slices were placed in 96-well plates. Slices in the molecular weight range of 3 to 25 kDa were digested with trypsin. Tryptic peptides were separated on an online nano-HPLC interfaced to a QStar XL mass spectrometer (AB/Sciex). MS and alternating MS/MS data were acquired in data-dependent acquisition mode to acquire m/z and peptide sequences to be searched against SwissProt_AC 2013_04 using Mascot (Matrix Science) version 2.4.1. Protein hits were filtered to less than 1% false discovery rate and included those that were identified with at least 2 unique peptides.
Statistical Analysis of Protein and RNA Expression
As part of our initial screen for candidate proteins with possible clinical relevance, we restricted our analyses to a subset (n = 28) of OCSCC patients for whom we had at least 2 years of clinical follow-up, and assessed differences in peptide levels isolated from the 6 salt cuts between patients with and without evidence of disease progression within 2 years of diagnosis. Clinical outcome measures included disease-related mortality, defined as death with active HNSCC disease (either persistence, resistance, or metastasis) present; presence or absence of distant metastasis (DM); and presence or absence of a local/regional recurrence. Student t tests were performed using the aforementioned clinical parameters comparing mean peptide levels measured in OCSCC tumor samples. After the identities of the significantly associated peptides were established, proteins for further analysis were selected based upon the presence of multiple peptides with clinical associations corresponding to the same protein or protein family if a specific isotype could not be clearly determined. RNA expression data for each of these proteins were also examined using a gene expression array database generated on the same patient population. Pearson correlation analysis was done to identify RNAs that have strong correlation with protein expression (Figure 3, A through C). The protein and RNA expression data were transformed to remove the scalar differences using the function [X − M]/[(Q3 − Q1)/1.35] where X is the intensity, M is the median and (Q3 − Q1) represents the interquartile range. RNAs that showed significant correlations were then analyzed to determine whether significant associations were also observed with clinical outcomes that were in line with those observed with the matching proteins. Finally, univariate Kaplan-Meier analysis was performed to reassess the clinical associations on the full cohort of 43 OCSCC samples, taking into account time to outcome for cancer mortality, DM, and local/regional recurrence. Patients were classified into 2 groups based on their tumor protein/RNA expression levels at diagnosis using quartile cut points.
RESULTS
Patient Characteristics
Table 1 shows the population demographics and clinicopathologic characteristics for the patient cohort used in the current study. All of the tumors in this study were squamous cell carcinomas of the oral cavity. Most of the cases presented with a TNM stage of III or IV (81%; n = 35) with 67% (n = 29) being positive for lymph node metastasis. The sex ratio was 3:1 male to female with a mean age at diagnosis of 62.6 years (±14.9 SD).
Seventy-two Peptides Derived From 7269 Features Are Found to Be Associated With Patient Outcomes
Comparative LC-MS approaches rely on the observation that the peak intensity is proportional to the concentration of the peptide in the sample in most cases.24 Therefore, measuring peak intensities and comparing these among multiple LC-MS data from different samples leads to a relative quantitation of peptide concentrations among samples. Figure 1 shows this procedure used to generate, separate, analyze, and identify peptide features enabling detection of differential expression of proteins among samples. After removing bins with confounding values due to peak overlap, this process generated an array with 7269 peptide features. To focus on features with potential clinical relevance, the lists from each of the strong cation-exchange fractions were examined for clinical association by taking each peptide and separating it into 2 populations based upon the binary value of the clinical parameter. Twenty-eight OCSCC specimens for which we had at least 24 months of clinical follow-up data were tested for 3 outcomes; disease-specific death (DSD), DM, and loco-regional recurrence. The intensity values from the MS run for each patient were compared using a Student t test and those with a significant difference between the 2 populations were kept. Overall, 72 unidentified features were selected that discriminated clinical outcomes with a P value of at least .01. Nine of the features associated only with DSD, 16 with DM, and 39 with loco-regional recurrence, as is shown in Table 2. Eight peptides among the 72 were associated with more than one clinical outcome, which is to be expected because many of the same pathways could lead to the same clinical outcome. Selected patient samples with the highest levels of the clinically significant features were reanalyzed for sequence identification of the peptides as shown in Figure 2. All 72 peptides were down-regulated in patients with poor outcomes by 2- to 200-fold from the mean of each group.
Functional Classification of Peptides Associated With Patient Outcomes
Proteins containing the digested peptide features associated with the different clinical parameters were independently separated into functional groups using the PANTHER classification system project25 based upon their gene ontologies as well as Qiagen's (Valencia, California) Ingenuity Pathway Analysis software. Most of the proteins identified as having clinical associations in the current study related to structural components of the cytoskeleton and calcium ion binding (Figure 4, A through C) from the PANTHER analysis, whereas the Ingenuity Pathway Analysis functional classification system showed significant functional clusters in inflammatory disease, cell death and survival, cancer, and cellular growth and proliferation (Table 3). Down-regulation of members of the structural molecule group would result in major changes in the structural architecture of the cell affecting cell-cell and cell-matrix interactions, intracellular trafficking, and cell signaling. The largest related group of proteins consisted of members of the keratin (KRT) family, specifically KRT 5, 8, 14, and 16 and one or more members of the KRT 6 family. Most of these have known functions in the differentiation state of keratinocytes or have been seen to be coregulated in genetic syndromes or cancer.26–31 Another group of proteins that had a significant association with DM are associated with cell-cell contact. Desmoplakin (DSP) and plakophilins (PKPs) are proteins associated with desmosome formation although they also have been shown to have other functions, for example transcriptional regulation.32 The remaining proteins, specifically TPM1, ACTB, ACTN4, LMNA, and TUBB, are important elements in the formation of the cytoskeleton itself. Reorganization of the cytoskeleton using proteins in this class is a vital process in the modulation of cell shape and migration during oncogenesis.
The binding class was comprised of 2 major functional subclasses: cytoskeletal structural element binding and calcium binding. As expected many proteins possessed both functions and are part of the cytoskeletal components mentioned above. Examples of these would be DSP, filamin-A, α-actinin-4, and spectrin α chain. Other calcium ion binding proteins include calmodulin or S100A8 and S100A9, which complex to form calprotectin, which explains almost identical localization in Figure 5, a through e and f through j. Ca2+ homeostasis has been demonstrated to be a key element in mechanisms such as migration and differentiation and its disruption is a common theme in events involving tumorigenesis.33 Some of the proteins in these 2 categories are implicated in several signaling pathways, for example HMGB (p53 pathway) and HSPB1 (p38 MAPK and VEGF signaling pathway).
Protein and RNA Expressions of TRIM29, DSP, and PKP1 Are Correlated and Their Reductions Are Associated With Poor Prognosis
To gain insights into possible regulatory mechanisms, we analyzed our RNA expression data for the peptides that exhibited clinical associations. For the current study we limited the analysis to a single clinical outcome, DM, to focus on studying OCSCC's predilection to early DM. Thirty-three of the 43 patients in the current study had corresponding RNA expression data on the identical tumor samples. There is no obligatory correlation between RNA expression levels and protein levels because posttranscriptional modulation and epigenetic influence, such as suppression via microRNA targeting, can reduce protein translation in the presence of high RNA levels. A correlation analysis between clinically significant peptide features from the array and RNA expression levels from the same tumor samples yielded 17 peptides, which directly correlated with RNA expression, suggesting regulation of these proteins at the transcription level. Because outliers can distort the size of the correlation coefficient, we examined scatter plots of protein expression versus RNA expression to see if any outliers were present, and whether the relationship was linear or curved. Examples of the correlation between RNA and protein expression for 3 peptides are shown in Figure 3: DSP, the tripartite motif-containing protein 29 (Trim29), and PKP1. Correlation of protein and RNA expression was not skewed by outliers and is evenly distributed across the range of expression. Mantel-Haenszel hazard ratios for either RNA or protein demonstrate statistically significant differences in 7 of the 17 peptides for time to onset of DM. These differences would not have been observed by RNA expression analysis alone, as is shown in Table 4, which demonstrates the advantage of using the proteomic strategy. Kaplan-Meier survival analysis (Figure 6, A through F) on time to DM demonstrated that both low RNA and peptide levels of Trim29, PKP1, or DSP are associated with decreased time to DM. Even though the initial t tests did not show a significant association between these 3 peptides and DSD, we also performed Kaplan-Meier survival analysis on time to death from OCSCC for these 3 proteins and observed a similar trend in the case of Trim29 and PKP1 but not for DSP (Figure 7, A through F).
Low S100A8 Expression Is Associated With Poor Prognosis and Verified Using Imaging Mass Spectral Assessment of Patient Tissues
Correlating RNA and protein expressions for validation can only be applied to the limited number of proteins under transcriptional control. Both S100A8 and S100A9 mRNA expression levels did not correlate with the corresponding protein levels. However, low S100A8 and S100A9 expressions were clearly associated with a poor prognosis in the global proteomic study, and this has been verified using an IMS assessment of patient tissue samples. Unlike LC-MS, IMS can analyze whole proteins directly from tissue sections as expression maps and provides another platform for validation for proteins. We provide an example of the proteins S100A8 and S100A9 as IMS expression maps in Figure 5, a through e and f through j. Morphologic details of the tissues are illustrated in Figure 5, k through o, and zoomed-in areas around the black arrows in Figure 5, k through o, are shown in Figure 5, p through t. When comparing tissue sections from 5 patients with different outcomes using IMS, the images from 2 patients with poor prognosis (Figure, 5 a, b, f, and g) had significantly lower expression of S100A8 and S100A9 than the images from 3 patients (Figure 5, c through e and h through j) who survived without loco-regional recurrence or DM, consistent with the global proteomic study (Table 2).
COMMENT
Technological advances have allowed us to begin to see the global changes in gene and protein expressions that commonly occur in cancer. The challenge will be to successfully integrate this knowledge to identify markers that could be used to aid in the treatment of cancer. Correlating markers with clinical outcomes will eventually aid physicians in the treatment of their patients with the goal of a more individualized approach to prognosis and therapy. As technology has advanced, we are now gaining a more thorough understanding of how transcriptional regulation through mechanisms like epigenetic DNA modifications or translational regulation via mechanisms like microRNA silencing can alter protein expression and ultimately cell phenotype. To date the major focus for the identification of biomarkers has been RNA expression. The problem with this approach is that clinical outcomes ultimately arise from changes in protein expression or posttranslational modifications. There can be many factors that can diminish RNAs usefulness as a surrogate marker for protein expression. Small noncoding RNA like microRNA can effectively neutralize increases in mRNA levels by interfering with translation. A single microRNA can regulate the effective concentrations of many different mRNAs, which in turn may each be under independent regulatory feedback loops. The more degrees of separation between a marker and an outcome, the greater the probability the marker will be affected by multiple agents and not perform well in a larger independent population. In the current study we have taken a global proteomic approach that bypasses many of these problems because clinical outcomes have to arise from a change in proteins. Because an exhaustive analysis of all of the mechanisms involved in the regulation of the proteins, that is, digested peptides discovered in this study is well beyond the scope of a single report, we have taken the first steps to integrate these analyses in an ad hoc manner to select potentially transcriptionally regulated targets and demonstrate the ability of global RNA and proteomics to verify and complement the strengths of each platform while minimizing the weaknesses.
Reduced Cell-Cell Adhesion Through Down-Regulated DSP, PKP1, and KRTs Contributes to Poor Prognosis
We observed that patients in our cohort who developed DM had significantly lower DSP at both the RNA and peptide levels. Low levels of DSP were also associated with DSD. Desmosomes are highly organized intercellular junctions that provide mechanical integrity to tissues by anchoring intermediate filaments to sites of strong adhesion.34 DSP and PKPs are major components of desmosomes that couple cytoskeletal elements to the plasma membrane at the cell-cell or cell-substrate adhesions.34 DSP binds KRTs via its C terminus, facilitating strong intercellular adhesion necessary to maintain tissue morphogenesis and architecture.32 DSP and PKP reduction have also been linked to dramatic loss of desmosomal functions in many diseases including cancers. For instance, DSP has been shown to function as a tumor suppressor in non–small cell lung cancer.35
Yang et al35 reported that DSP levels are reduced via DNA methylation as a primary mechanism in 73% of the lung cancer lines and 61% of the primary tumors they examined. We did not observe a strong relationship between DSP expression and DNA methylation in our cohort (data not shown), which suggests an alternative mechanism of silencing in OCSCC. They went on to demonstrate that overexpression of DSP led to increased plakoglobin (γ-catenin). In turn, it led to a decrease in TCF/LEF transcriptional activity and decreased expression of target genes such as Axin2 and MMP14. Silencing DSP via interfering RNA led to the opposite state for the previously mentioned genes. In the current study, tumors that led to DM also had low levels of DSP as well as low RNA levels of γ-catenin and elevated RNA levels of β-catenin. However, we did not observe any consistent statistically significant changes in the RNA expression levels of TCF/LEF, Axin2, or MMP14.
Plakophilins are proteins also involved in desmosome formation.36 In the current study, lower levels of PKP1 protein and RNA were associated with DM and disease-specific survival. Sobolik-Delmaire et al37 demonstrated that unlike the other members of its family, PKP1 is not a major element in the assembly of desmosomes. Yet it is translocated into the nucleus, where it can bind single-stranded DNA even though it lacks a canonical nuclear localization sequence. South et al38 showed that PKP1 appears to play a role in whether the desmosomes formed are calcium dependent or independent. This in turn affects keratinocyte migration, with cells expressing low levels of PKP1 having increased calcium dependent desmosomes and greater migratory capacity.
Several KRTs were identified in our study as having an association with clinical outcome. Among these, Krt5 and Krt14 are known to be coregulated as part of an obligatory acidic/basic heteropolymer expressed in the basal proliferating layer of stratified epithelia. Wang et al31 noted that loss of Krt5 and Krt6 staining could distinguish between breast papillomas and papillary carcinomas. Alam et al39 showed that loss of Krt14 expression can lead to decreased proliferation in the presence of a reciprocal increase in activated Notch1. We did not observe an increase in Notch1; however, Notch1 is deactivated in many squamous cell carcinomas of the head and neck, and lower levels of Krt14 and 5 are associated with earlier time to onset of DM in our cohort.40,41 Indeed, some level of quiescence could protect clinically occult micrometastases from destruction during adjuvant therapy.
Interestingly, 2 other KRTs, Krt16 and Krt6A or B, are also coregulated as part of an obligatory acidic/basic heteropolymer. Mutations in these KRTs have been shown to cause pachyonychia congenita, a rare genodermatosis characterized by dystrophic, thickened nails and painful palmoplantar keratoderma. Overall, down-regulation of DSP, PKP1, and several KRTs has been observed to associate with DM and DSD in this study.
These observations suggest that in squamous cell carcinoma of the oral cavity, cells that have lower levels of these proteins may have a loss in desmosomal integrity as well as a shift to mitotic quiescence while exhibiting an increased migratory capacity.
Suppressed TRIM29 Expression May Affect Anchorage-Independent Growth, Contributing to DM
TRIM29 is a member of the TRIM protein family containing both zinc finger and leucine zipper motifs but not a RING finger domain. It is also the ataxia-telangiectasia group D–complementing gene42 and its expression confers resistance to ionizing radiation.43 In this study, expression of TRIM29 protein and its encoding RNA are down-regulated in patients with DM, indicating that this decrease in expression is regulated at the transcriptional level. Suppressed expression of TRIM29 has been observed in other cancers such as breast and prostate cancers,44 implying its relevance to tumorigenesis. Hosoi et al44 have shown that expression of TRIM29 leads to suppression of anchorage-independent growth (AIG) in multiple cancer cell lines. AIG is a crucial step in the acquisition of malignancy for cells to have the potential to migrate through the body, colonize other tissues, and grow metastatically.45 This observation with our study suggests that suppressed TRIM29 may promote AIG, contributing to poorer outcome in OCSCC.
Additionally, TRIM 29 has been shown to interact with the p53 pathway, either by a direct interaction with p53 itself46 and/or by interacting with the Tat-interacting protein 60 (Tip60), resulting in suppression of p53 activity.47 Inactivation of the p53 pathway has widespread effects in HNSCC tumorigenesis.48 Our data also suggest involvement of p53 pathway indirectly with 14 proteins (bold in Table 2). P53 was indicated as their top upstream regulator in Ingenuity Pathway Analysis. In the case of TRIM 29, its reduction was associated with poorer prognosis, even though it can function as both an oncogenic protein and a tumor suppressor,49 possibly because of the state of p53, that is, posttranslational modification and/or mutation, directly involved in the OCSCC tumorigenesis.
Reduced Calcium-Binding Proteins Associated With Poor Prognosis
The discussion has focused on the 17 peptides showing strong correlation with mRNA expression, implying a transcriptional regulatory mechanism. However, that is not meant to imply that proteins with weaker correlations to their mRNAs lack importance in OCSCC progression. They may be regulated at the translational level by the actions of microRNAs or at the posttranslational level by functions like targeted degradation due to modifications such as ubiquitination, or they could be affected by environmental factors such as Ca2+ homeostasis. According to the functional classification described above, one of the major functional subclasses belongs to calcium-binding proteins. They are a large and heterogeneous group of proteins that participate in numerous cellular functions (eg, Ca2+ homeostasis and Ca2+ signaling pathways).50 S100A8, S100A9, CALM, ACTN4, FLNA, SPTAN1, ANAXA2, and DSP are calcium-binding proteins associated with poor prognosis in this study. Three of these, S100A8, S100A9, and CALM, are EF hand proteins. EF hand proteins bind Ca2+ with high affinity to helix-loop-helix motifs that are repeated from 2 to 12 times. They not only buffer Ca2+ but also transduce calcium signals by changing conformation after binding. The current study shows that they are markedly down-regulated in the tumors of patients with poor prognosis. This observation from global proteomics data has also been independently verified using IMS, which also demonstrated the loss of S100A8 and S100A9 in the patient with poor outcome (Figure 5, a through e and f through j). The S100A8 and S100A9 complex calprotectin has been suggested to act as a switch between differentiation and proliferation. The switch is believed to be triggered by calcium binding, which releases S100A8 from the complex, allowing it to interact with telomerase,51 disruption of which appears to be required in essentially all tumors for immortalization.
Patient Outcome–Associated Proteins Generated From Global Proteomic Analysis Can Be Potential Prognostic Indicators
It has been shown in the present study that global proteomic analysis can produce a set of peptides associated with clinical outcomes for OCSCC. Overall, it is clear that our global proteomic analysis has revealed outcome-associated proteins that may directly or indirectly be involved in regulatory mechanisms affecting prognosis. With further validation we propose that these outcome-associated proteins generated from global proteomic analysis may be used as prognostic indicators. Additionally, we provide proof-of-concept data for a simple biomarker assay, demonstrating that S100A8 and S100A9 are reduced as measured using IMS on a tumor section taken from a sample associated with poor outcome.
Future studies will be focused on validating these clinically significant peptides, and we plan to assess their clinical utility using more cost- and time-effective analysis platforms. Multiple-reaction monitoring using a triple-quadrupole mass spectrometer52 is a potential option, because it can be developed into an assay that screens multiple peptides at once. This will also permit us to test the combination of proteins as a panel to improve the sensitivity and specificity of the assay as development proceeds. The additional cohort can be tested using IMS as well, because it acquires protein expression directly from frozen section samples. Imaging in situ MS is more adaptable to the clinical environment because of its potential to provide data that rival immunohistochemistry data for reliability and reproducibility, with the added benefit of the ability to screen a few hundred proteins at once with a very rapid processing time. Overall, our study is a step toward the development of a proteomics approach to prognostication in OCSCC.
This study was supported by grants from the US National Institute of Health (NIH), R33CA103547 (Dr Prystowsky), R21CA103547 (Dr Prystowsky), S10RR021056 (Dr Angeletti), S10RR025128 (Dr Angeletti), S10RR015859 (Dr Angeletti), and P30CA013330 (Dr Goldman). It was also supported in part by CTSA grants UL1RR025750, KL2RR025749, and TL1RR025748 from the National Center for Research Resources. The authors would like to thank Rani Sellers, DVM, PhD, at the Albert Einstein College of Medicine histotechnology and comparative pathology facility for her assistance. The authors also thank Shannon Cornett, PhD, at Bruker Daltonics for helpful discussions.
References
Author notes
The authors have no relevant financial interest in the products or companies described in this article.