Objectives To characterise the mRNA expression patterns of early and advanced stage colorectal adenocarcinomas of Malaysian patients.
Design Comparative expression analysis.
Setting and participants We performed a combination of annealing control primer (ACP)-based PCR and reverse transcription-quantitative real-time PCR for the identification of differentially expressed genes (DEGs) associated with early and advanced stage primary colorectal tumours. We recruited four paired samples from patients with colorectal cancer (CRC) of Dukes’ A and B for the preliminary differential expression study, and a total of 27 paired samples, ranging from CRC stages I to IV, for subsequent confirmatory test. The tumouric samples were obtained from the patients with CRC undergoing curative surgical resection without preoperative chemoradiotherapy. The recruited patients with CRC were newly diagnosed with CRC, and were not associated with any hereditary syndromes, previously diagnosed cancer or positive family history of CRC. The paired non-cancerous tissue specimens were excised from macroscopically normal colonic mucosa distally located from the colorectal tumours.
Primary and secondary outcome measures The differential mRNA expression patterns of early and advanced stage colorectal adenocarcinomas compared with macroscopically normal colonic mucosa were characterised by ACP-based PCR and reverse transcription-quantitative real-time PCR.
Results The RPL35, RPS23 and TIMP1 genes were found to be overexpressed in both early and advanced stage colorectal adenocarcinomas (p<0.05). However, the ARPC2 gene was significantly underexpressed in early colorectal adenocarcinomas, while the advanced stage primary colorectal tumours exhibited an additional overexpression of the C6orf173 gene (p<0.05).
Conclusions We characterised two distinctive gene expression patterns to aid in the stratification of primary colorectal neoplasms among Malaysian patients with CRC. Further work can be done to assess and compare the mRNA expression levels of these identified DEGs between each CRC stage group, stages I–IV.
- PUBLIC HEALTH
This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 3.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/3.0/
Statistics from Altmetric.com
Strengths and limitations of this study
This regional-based study has a relatively small sample size due to the lack of a designated Tissue Bank and colorectal cancer (CRC) patient volunteers.
All subjects were newly diagnosed with CRC, and were not associated with any hereditary syndromes, previously diagnosed cancer or positive family history of CRC.
The findings of this study are considered preliminary.
Cancer staging is vital for patient management, especially in prognosis prediction and planning of treatment intervention.1 This is especially so in the colorectal cancer (CRC) staging system. As such, there have been many noteworthy improvements since the introduction of the classical Dukes’ staging system, followed by the modified Astler-Coller staging system; to the latest seventh edition of tumour-node-metastasis (TNM) staging system published by the American Joint Committee on Cancer (AJCC).2–4 The TNM staging system allows the incorporation of various clinical information (which are obtained through histopathological examination, radiological imaging and surgical findings), for accurate CRC stratification.5 However, these clinical assessments are greatly dependent on the expertise of pathologists, radiologists and clinicians.
The TNM classification is applicable for both clinical (cTNM) and pathological (pTNM) staging of primary colorectal tumours. Typically, it involves the assessment on the depth of bowel wall invasion at the time of diagnosis and the presence of regional lymph nodes metastases, as well as the presence of distant organ metastasis.4 As a potentially worse patient outcome with more advanced disease stage is the core concept in cancer staging, AJCC revises the TNM classification system every few years with an attempt to formulate it for more accurate patient prognostication.5 The latest seventh edition has further detailed the subclassification of the pN category and the assessment of discontinuous/satellite tumour foci. However, these revisions have increased the complexity and subjectivity during evaluation, and thus might lead to interobserver variability and hamper its efficiency in routine clinical practise.5 ,6 In addition, current clinicopathological parameters are insufficient to address the great biological and genetic heterogeneity of CRC in patients’ outcome and treatment response prediction. From the perspective of clinical oncology, the integration of molecular biomarkers into existing clinicopathological assessment will further refine the cancer management in future.
Over the past decades, many researchers have attempted to establish gene expression signatures specifically for the diagnosis, prognostication and recurrence prediction of sporadic CRC. Transcriptional profiling promises a fairly dynamic view on the cellular functions, regulatory mechanisms and biochemical pathways involved in the disease pathogenesis and progression.7 Various gene expression profiling techniques ranging from differential display, SAGE to microarrays, have been utilised. Despite its wide application in gene expression profiling, microarray experiments have been subjected to various sources of variability, false-positives as well as statistical and bioinformatic challenges. To date, none of the molecular markers described has been validated and employed in routine clinical practise owing to the poor reproducibility of the identified differentially expressed genes (DEGs) between different profiling platforms.8 Although the KRAS mutation and mismatch repair status have shown promising prognostic and predictive values, they have yet to be incorporated into either routine pathological reporting systems or TNM staging systems.5
As most of the molecular studies on CRC were based in Western populations and different molecular changes were thought to underlie the development of sporadic CRC in populations with different genetic backgrounds, we aimed to investigate the changes in mRNA expression patterns in primary sporadic colorectal tumours with regard to our Malaysian patients. In our study, we employed a combined approach of a two-step ACP-based PCR and real-time reverse transcription PCR to characterise the gene expression patterns for early and advanced stage sporadic colorectal adenocarcinomas.
Materials and methods
Patient selection and specimen collection
All patients presented with histologically confirmed colorectal adenocarcinomas and were staged accordingly to the AJCC TNM staging system (table 1). The staging of cancer was performed by taking into consideration their histopathological reports, CT images, morphological evaluations during surgery and serum carcinoembryonic antigen levels. All patients were newly diagnosed with CRC, and were not associated with any hereditary syndromes, previously diagnosed cancer or positive family history of CRC. Initially, four patients with CRC stages I–III were recruited for the preliminary ACP-based PCR analysis, while another 27 patients with CRC stages I–IV were recruited for subsequent reverse transcription-quantitative real-time PCR (RT-qPCR) analysis. The patients’ group was composed of the three main ethnic groups in the Malaysian population, that is, Chinese, Malays and Indians, in order to ensure a better representative of the study population.
The subjects were admitted to the University Malaya Medical Centre, Kuala Lumpur, Malaysia, and underwent curative surgical resection between 2010 and 2011. None had received preoperative chemoradiotherapy. The study protocol was approved by the Ethics Committee Board of University Malaya Medical Centre (Ref. No. 654.1), and written informed consent was obtained from all study patients. The tumouric specimens were excised from the primary colorectal tumours, while the non-cancerous tissue specimens were obtained from distally located macroscopically normal colonic mucosa. Both colorectal tumour and paired non-cancerous tissue specimens were immersed in RNAlater RNA Stabilisation Reagent (Qiagen) immediately after excision and stored at −80°C.
Total RNA extraction
Total RNA was extracted from homogenised colonic tissues with the RNeasy Plus Mini Kit (Qiagen) according to manufacturer's instructions. Subsequently, the RNA yield and integrity were ascertained via Agilent 2100 Bioanalyser in conjunction with Agilent RNA 6000 Nano Kits (Agilent Technologies). The values of RIN were then determined in order to assess the integrity of the isolated total RNA. In this study, only RNA samples with RIN values of 8.0–10.0 and rRNA ratios (28S/18S) of 1.5–2.5 were selected for successive applications.
ACP-based PCR analysis
First-strand cDNA synthesis
The synthesis of first-strand cDNA was performed according to the manufacturer's protocol for the GeneFishing DEG Premix Kit (Seegene), as follows: 3 µg of total RNA was added with 2 µL of 10 µM dT-ACP1 (5′-CTGTGAATGCTGCGACTACGATXXXXX(T)18-3′) and RNase-free water to a final volume of 9.5 µL. The mixture was then incubated at 80°C for 3 min, followed by chilling on ice for another 2 min. Subsequently, 4 µL of 5X RT buffer (Mbiotech), 5 µL of 2 mM dNTP (Fermentas), 0.5 µL of 40 U/µL RNase inhibitor (Mbiotech) and 1 µL of 200 U/µL M-MLV reverse transcriptase (Mbiotech) were added. This mixture was then incubated at 42°C for 90 min, heated at 94°C for another 2 min and chilled on ice for 2 min. Finally, 80 µl of DNase-free water was added to dilute the synthesised cDNA. The first-strand cDNA was stored under −20°C until further analysis.
ACP-based GeneFishing PCR
First, all four cDNA samples within each CRC and control group samples were pooled together in equal amounts. The characterisation of DEGs was then conducted via ACP-based PCR based on 20 arbitrary ACP primers (Cat. No. K1021) in a thermal cycler (Mastercycler Gradient, Eppendorf) according to the manufacturer's protocol (GeneFishing DEG Premix Kit, Seegene). Initially, the synthesis of second-strand cDNA was commenced in a one-cycle first-stage PCR: 94°C for 5 min, 50°C for 3 min and 72°C for 1 min. Next, the constructed second-strand cDNA was subjected to second-stage PCR with 40 cycles of a denaturing step at 94°C for 40 s, annealing step at 65°C for 40 s and extension step at 72°C for 40 s. Finally, a final extension step at 72°C for 5 min was carried out. The amplified products were then separated on 3% (w/v) agarose gels stained with ethidium bromide.
Cloning and sequencing
The identified differentially expressed bands were extracted from the agarose gel by using the PureLink Quick Gel Extraction Kit (Invitrogen). Each of these extracted DNA fragments was then individually cloned with the use of the TOPO TA Cloning Kit for Sequencing (Invitrogen). Subsequently, the plasmid containing the inserted DNA fragment was extracted from clones of interest via PureLink Quick Plasmid Miniprep Kit (Invitrogen). The isolated cloned plasmids were then sequenced with the ABI 3730xl DNA Analyser (Applied Biosystems). Finally, all the sequences obtained were analysed and matched for similarities with reference to the BLAST programme under the NCBI database.
The total RNA isolated from 27 paired samples was reverse transcribed to first-strand cDNA, with the following protocol: 3 µg of total RNA was added with 2 µL of 0.5 µg/µL oligo(dT)12–18 (Invitrogen) and RNase-free water to a final volume of 9.5 µL. The reaction mixture was then incubated at 80°C for 3 min, followed by chilling on ice for another 2 min. Next, 4 µL of 5X first-strand buffer (Invitrogen), 5 µL of 2 mM dNTP (Fermentas), 0.5 µL of 40 U/µL RNaseOUT recombinant RNase inhibitor (Invitrogen) and 1 µL of 200 U/µL M-MLV reverse transcriptase (Invitrogen) were added to the mixture. Finally, the reaction mixture was incubated at 42°C for 90 min, heated at 94°C for another 2 min and chilled on ice for 2 min. The synthesised first-strand cDNA was stored under −20°C until further usage.
The relative expression of identified DEGs in all paired colorectal tumours and control samples was determined via ΔΔCT method. The RT-qPCR was performed in a singleplex reaction containing 50 ng first-strand cDNA under universal thermal cycling conditions with the ABI 7500 Fast Real-Time PCR System (Applied Biosystems). Both ACTB (Assay ID: Hs99999903_m1) and GAPDH (Assay ID: Hs99999905_m1) were used as reference genes and are commercially available as TaqMan Pre-designed Assays (Applied Biosystems). Prior to the analysis of gene expression, the amplification efficiency for all target and reference genes assays was measured by using the standard curve method with 2-log measurements. The amplification efficiency value of 90–110% was acceptable (Applied Biosystems). In this relative quantification method, the values obtained represented the fold change in gene expression of the colorectal tumours, which was normalised with both reference genes, in relative to the calibrator (control sample).9
The difference in the expression level between primary colorectal tumour and paired non-cancerous tissues was analysed by using Real-Time StatMiner software (Integromics). The distribution of the ΔCT values obtained for each DEGs within each CRC and control group were tested for normality via the Shapiro-Wilk test. Subsequently, the paired t test was performed to assess the statistical significance of the observed differential expression patterns.
DEGs between colorectal tumours and non-cancerous colonic tissues
This preliminary study was conducted on paired samples pooled from four patients with CRC stages I–III. In ACP-based GeneFishing PCR, 20 sets of arbitrary ACP primers were used to randomly amplify gene products in both colorectal tumours and normal colonic samples. Upon visualisation on agarose gels, a total of 13 differentially expressed bands were observed by means of comparing bands intensity between the tumouric and non-cancerous samples, as shown in figure 1. These bands were further sequenced for gene identification, and 16 DEGs were successfully reported. Among them, 13 DEGs were overexpressed in colorectal tumours, while 3 DEGs were underexpressed, as listed in table 2.
Differential ability of the identified DEGs on early and advanced colorectal neoplasia
Following the identification of DEGs, the gene sequences obtained were then used to design primers and TaqMan probes for RT-qPCR analysis by Applied Biosystems, as listed in table 3. In an attempt to assess the differential ability of identified DEGs on early and advanced colorectal adenocarcinomas, the recruited paired samples were further stratified into two groups according to the cancer stage. Among them, 13 patients with stages I and II were grouped as early stage CRC, while the advanced stage CRC group comprised 14 patients with stages III and IV.
The analysis of RT-qPCR results was performed via Real-Time StatMiner software by importing the raw Ct data. The within-group correlation of these ΔCT values was then determined by calculating the median absolute deviation for all the samples within the same experimental group. The biological samples that do not correlate well with other samples in the same group, were detected as group outliers and excluded from subsequent analysis. Both ACTB and GAPDH were used for normalisation in computing the ΔCT (figure 2) and values by using the following formulas (table 4):
The relative fold change in the mRNA expression level between the colorectal tumours and adjacent normal colonic mucosa was shown as the values. The statistical significance of the observed fold change in expression was determined by paired t test for all the DEGs. A p value of less than 0.05 is considered as statistically significant (table 4).
In both early and advanced stage CRC groups, the expression of 4 of 16 DEGs was reported to be significantly differed between tumouric and non-cancerous tissues. Remarkably, the combination of this panel of four genes is different among two groups. The RPL35, RPS23 and TIMP1 genes were found to be overexpressed in early and advanced colorectal neoplasms (p<0.05) (figures 3 and 4). It is interesting to note that the under-expression of ARPC2 gene (p<0.05) was only observed in early stage colorectal tumours (figure 3). However, the C6orf173 gene was found to be overexpressed (p<0.05) in advanced colorectal adenocarcinomas, but not in early stage colorectal tumours (figure 4).
Our current study has revealed two distinctive four-gene signatures for both early and advanced stage colorectal adenocarcinomas. The early stage sporadic CRC was characterised by the overexpression of RPL35, RPS23 and TIMP1 genes, as well as underexpression of ARPC2 gene. However, the advanced primary colorectal tumours were reported with overexpression of C6orf173, RPL35, RPS23 and TIMP1 genes. Although the relative fold change for ARPC2, RPL35 and RPS23 genes is below 2, the individual result does not affect the analysis as gene expression patterns of all four genes in combination were proposed to distinguish between the early and advanced stage colorectal neoplasms. The potential involvement of these DEGs and their altered expression levels in CRC were further supported by previous researches.
In fact, several proto-oncogenes and tumour suppressors are previously reported to regulate the ribosome production, that is, the RB,10 TP53,11 PTEN genes,12 as well as the MYC gene family.13 It is suggested that the alterations in ribosome biogenesis might affect the translation of genes that are involved in neoplastic transformation. In addition, the additional extra-ribosomal functions of the ribosomal proteins (r-proteins) in cellular apoptosis, cellular proliferation, cellular transformation, genes transcription, mRNA translation, DNA repair and inflammation, might also trigger and support the neoplastic development.14 Hence, the overexpression of r-proteins-encoding genes observed in colorectal adenocarcinomas is not unexpected.15–17 Our current study has revealed the significant overexpression of two r-proteins that were not previously described in colorectal tumours, that is, the RPL35 and RPS23. The observed fold changes for the RPL35 and RPS23 mRNA levels were comparable between the early and advanced stage colorectal tumours in our sample cohort. This was in agreement with previous reports by Barnard et al18 and Frigerio et al,19 where the changes in the mRNA expression levels of the r-proteins were irrespective of the cancer stage. The hypothesis that the same ribosomal protein may contribute in different stages of cancer progression with their hitherto unknown extra-ribosomal roles might provide an explanation to these observations.20
However, our current study also demonstrated an over-expression of the TIMP1 gene in both early and advanced stage primary colorectal tumours. This finding is supported by Zeng et al,21 where the overexpression of TIMP1 was reported in all stages of primary colorectal tumours. Under normal physiological conditions, the proteolytic activities of MMPs are kept at bay by their natural inhibitors, the TIMPs.22 Previous studies have reported the overexpression of MMPs in early and advanced stage colorectal tumours, as well as other cancer types,23–25 which is in accordance to their biological roles. Hence, a similar scenario is expected for TIMPs and indeed, their suppressive role in tumour invasion and metastasis has been demonstrated in various cancer models.26 However, more recent studies have revealed a direct correlation between TIMP1 expression and tumour aggressiveness in cancer, including CRC.21 ,27 These findings, which are contradictory to its protease-inhibiting function, have suggested a possible tumour-promoting role of TIMP1 in tumorigenesis. It is postulated that the TIMP1 exhibits the abilities to inhibit tumour cell apoptosis and promote tumour angiogenesis, as well as other growth-factor-like effects.28 In our current study, the observed comparable overexpression of TIMP1 in early and advanced stage sporadic colorectal neoplasms was in line with its MMP inhibitory and MMP-independent tumour-promoting activities.
In cancer biology, the expression of mRNAs and proteins of the ARP2/3 complex is often studied due to its role in cell migration, which contributes to cancer invasion and metastasis if aberrantly regulated.29 We have detected a significant underexpression of ARPC2 in our cohort of early stage primary colorectal tumours. Surprisingly, this finding is contradictory with the role played by ARPC2 in cancer invasion and metastasis theoretically. Previously, Kaneda et al has reported the decreased expression of all the seven genes encoding the subunits of ARP2/3 complex in human gastric cancers. Among them, the Arp2, ARPC2 and ARPC3 showed the most prominent reduction in their expression levels.30 The exact mechanism underlying this observation still remains unknown, but the epigenetic alteration might potentially provide an explanation for it. For instance, promoter hypermethylation that causes gene silencing is responsible for the reduced expression of ARPC1 in human gastric cancer.31 Similarly, the epigenetic study might also offer a clue for the underexpression of ARPC2 in colorectal neoplasms.
C6orf173, which is also known as CUG2 or CENP-W, is a novel oncogene that has been found to be upregulated in many human cancer tissues. Its high expression level is profoundly reported in tumours of the ovary, liver, lung, pancreas, breast, colon, rectum and stomach. The CENP-W is a new member of the constitutive centromere-associated network, which specifically interacts with the CENP-T and plays an important role in mitosis.32 In our current study, the CENP-W is over-expressed in advanced colorectal adenocarcinomas. This finding correlates to its function in kinetochore assembly, where its aberrant expression might lead to abnormal cell division and aneuploidy in cancer.32 In our study, the overexpression of CENP-W was observed in early and advanced cohort of colorectal neoplasms but only statistically significant in the latter group. Given the fact that aneuploidy is constantly associated with a greater proportion of advanced CRC cases, the aberrant expression of CENP-W might potentially relate to a poorer prognosis of CRC.33
In conclusion, we have characterised two distinctive gene expression patterns, which comprise the ARPC2, C6orf173, RPL35, RPS23 and TIMP1 genes, for the stratification of primary colorectal adenocarcinomas among Malaysian CRC patients. It was postulated that the actin cytoskeleton might play an important role in determining the dysplastic cell morphology during the early development of CRC, while the aberration in the assembly of functional kinetochore might be crucial for the aneuploidy of the advanced stage colorectal tumours. Nevertheless, the findings of this study were considered preliminary owing to the relatively small sample size. The main reason for this is the lack of a designated Tissue Bank in our institution. Moreover, the lack of patient with CRC volunteers and our stringent criteria for patient selection have also limited the availability of suitable specimens within the short sample collection period.
However, our identified mRNA expression patterns specific for early and advanced stage colorectal tumours are still convincing with our stringent sample selection criteria, high specificity primers and probes as well as reliable statistical analysis. In future, the validation of these DEGs should be performed on a larger set of clinical samples, and extensive inter-laboratory testing of their differential abilities on each CRC stage is also desired. In addition, we should also integrate other imaging and histological information to complement our identified gene expression patterns, which then hold promises for better stratification of tumours.
Contributors KHC, KLG, IH, HCC and ACR had the original idea for this work and gained funding in collaboration with PCL. TPL carried out the experiment. TPL, KHC, PCL, HCC and LHL were involved in the data analysis. TPL wrote the first draft of this paper and all authors subsequently assisted in redrafting and have approved the final version.
Funding This study was supported by FS176/2007C, PS172/2008C and Research Collaborative Grant, CG041-2013 from the University of Malaya.
Competing interests None.
Patient consent Obtained.
Ethics approval Ethics Committee Board of University of Malaya Medical Center (Ref. No. 654.1).
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement No additional data are available.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.