- Research
- Open access
- Published:
Computational approach for assessing the involvement of SMYD2 protein in human cancers using TCGA data
Journal of Genetic Engineering and Biotechnology volume 21, Article number: 122 (2023)
Abstract
Background
SMYD2 is a protein of the SET and MYND domain-containing family SMYD. It can methylate the lysine residue of various histone and nonhistone cancer-related proteins and plays a critical role in tumorigenesis. Although emerging evidence supports the association of SMYD2 in the progression of cancers, but its definitive effect is not yet clear. Therefore, further study of the gene in relation with cancer progression needs to be conducted. In the current study, investigators used TCGA data to determine the potential carcinogenic effect of SMYD2 in 11 cancer types. The transcriptional expression, survival rate, mutations, enriched pathways, and Gene Ontology of the SMYD2 were explored using different bioinformatics tools and servers. In addition, we also examined the correlation between SMYD2 gene expression and immunocyte infiltration in multiple cancer types.
Results
Findings revealed that higher expression of SMYD2 was significantly correlated with cancer incidents. In CESC and KIRC, the mRNA expression of SMYD2 was significantly correlated with overall survival (OS). In BRCA, KIRC, COAD, and HNSC, the mRNA expression of SMYD2 was significantly correlated with disease-free survival (DFS). We detected 15 missense, 4 truncating, 4 fusions, and 1 splice type of mutation. The expression of SMYD2 was significantly correlated with tumor purity and immunocyte infiltration in six cancer types. The gene GNPAT was highly associated with SMYD2. Significant pathways and Gene Ontology (GO) terms for co-expressed genes were associated to various processes linked with cancer formation.
Conclusion
Collectively, our data-driven results may provide reasonably comprehensive insights for understanding the carcinogenic effect of SMYD2. It suggests that SMYD2 might be used as a significant target for identifying new biomarkers for various human tumors.
Background
Cancer has become a serious health burden which surpassed cardiovascular diseases as the second largest cause of death worldwide [1]. According to the latest press release by the WHO in 2020, there were 19.3 million new cancer patients diagnosed, and approximately, 10 million cancer deaths occurred globally [2]. The evolutions in genetic and epigenetic parameters promote tumorigenesis. Differentially expressed genes linked to cancer patient survival could be exploited as diagnostic markers for early cancer detection [3, 4]. Therefore, cancer investigation, identification of related biomarkers, and development of methods for active prevention are essential requirements for the early screening of cancer.
Protein methyltransferase (PMT) is a catalytic enzyme that helps in the transfer of the methyl group to its substrate with the help of methyl donor S-adenosyl-L-methionine (SAM). It plays a significant role in the regulation of epigenetic mechanisms and is involved in the methylation of various substrates [5, 6]. PMTs play a crucial role in transcriptional events through histone methylation and nonhistone methylation at the position of arginine or lysine residues. Protein lysine methyltransferases (PKMT) are a type of PMT that helps to transfer a methyl group to the lysine residue of the substrate protein. It has been described that overexpression of proteins from PKMTs was linked with different types of human cancers [7, 8].
SMYD2 is a protein from PKMTs implicated in tumorigenesis and can influence gene transcription through lysine methylation [9]. Numerous studies have revealed the activity of SMYD2 methylation to nonhistone proteins such as P53 and RB1 [10, 11]. The SMYD2-specific nonhistone substrates are significantly associated with the carcinogenicity [12, 13]. Numerous tumor-causing proteins, for example, P53 [14], heat shock protein (HSP90) [15], retinoblastoma (Rb) [16, 17], ERα [18], PTEN [19], PARP1 [20], and STAT3 [21] being methylate by SMYD2. Therefore, it has been evidenced that SMYD2 is an onco-related protein that can affect the function of cancer suppressor proteins. The data analysis demonstrated that higher expression of SMYD2 is present in a variety of human cancers, like breast, bladder, colorectal, cervical, esophageal, lymphoma, ovarian, head and neck, and pancreatic cancer [14, 16, 22].
In the present analysis, we systematically explored the SMYD2 expression and its clinical outcomes to evaluate its potential marker for cancer treatment. Various expression and patient survival datasets available on several online platforms were used for this analysis. We measured multiple factors, such as the difference in gene expression, survival value, gene mutations, phosphorylation, methylation, immune infiltration, and functional enrichment analysis to explore the potential molecular mechanisms of the oncogenic role of SMYD2 on pathogenesis. Collectivelly, we identified that SMYD2 was not only potential biomarker but also may be of promising therapeutic target for multiple cancers.
Methods
Gene expression analysis
Expression analysis of SMYD2 was performed by using two web servers TIMER2 (http://timer.cistrome.org/) [23] and GEPIA2 [24] (http://gepia2.cancer-pku.cn/#analysis). Both utilized the tumor and non-tumor expression data from The Cancer Genome Atlas (TCGA). Genotype-tissue expression (GTEx) data was used to perform the expression difference between the tumor tissues and normal tissues using statistical method analysis of variance (ANOVA) with the help of GEPIA2. Parameters for the assessment method were under the setting of log2FC of 1, p-value of 0.01, and “Match TCGA normal and GTEx data.” Moreover, the violin plots for the SMYD2 expression level in diverse stages of pathology (stages I–V) of all TCGA tumors were also obtained via the GEPIA2 [24]. The log2 [TPM (Transcripts per million) +1] transformed data were applied for the construction of violin plots.
Survival prognosis analysis
Kaplan–Meier (K-M) plots were calculated to perform the survival analysis of cancer patients in TCGA cancers with the help of the GEPIA2 server [24]. The median score was used as a cutoff to divide the high-expression and low-expression cohorts. Then, samples with expression level higher than 50% were considered high-expression cohorts, and lower than 50% were considered low-expression cohort. With the help of K-M plots, we analyzed the overall survival (OS) and disease-free survival (DFS) for 11 cancer types. The log-rank test also called Mentel-Cox test was utilized for hypothesis test and p-value < 0.05 was considered statistically significant for all survival analyses. Additionally, the hazard ratio (HR) with 95% confidence intervals was also computed.
Promoter methylation analysis
The promoter methylation analysis in multiple cancer patients has been studied in TCGA dataset by using the UALCAN (http://ualcan.path.uab.edu/analysis-prot.html) [25]. It provides the facilities to estimate the cancer-related multi-omics data and assists to analyze the expression of proteins present in the Clinical Proteomic Tumor Analysis Consortium (CPTAC) as well as TGCA data. To generate the analysis results, default parameters were used. The statistical method Wilcoxon rank-sum test was used for the methylation differential analysis.
Genetic alteration analysis
The cBioPortal (https://www.cbioportal.org/) has a large-scale web resource for cancer research [26, 27]. This study is based on TCGA data, so genetic alteration was performed by selecting the “TCGA Pan-Cancer Atlas Studies.” All TCGA tumors were examined for the frequency of alteration, type of mutations, and DNA copy number alterations by using 4617 samples. We also analyzed the differences between TCGA cancer patients with and without SMYD2 mutations in terms of overall, disease-free, disease-specific, and progression-free survival. To create the K-M graphs, the log rank of the p-value was utilized with the significant level of <0.05.
Immune cell infiltration analysis
By choosing the immune cells such as cancer-associated fibroblasts cell and CD8 + T cells, the TIMER2 web server [23] was performed the association analysis between immune cell infiltration and SMYD2 expression across all tumor types. Immune infiltration was estimated using a variety of methods such as CIBERSORT-ABS, TIMER, CIBERSORT, EPIC, QUANTISEQ, MCPCOUNTER, and XCELL. The partial correlation and p-values were calculated by using the correlation test Spearman with the purity-adjustment parameter. The p-value < 0.05 was considered as statistically significant. The correlation output data was represented with the help of a scatter plot and heat maps.
SMYD2-related gene enrichment analysis
Using the targeted data of normal and tumor tissues, the GEPIA2 server was utilized for finding the top 100 targeted genes associated with SMYD2. In this study, Enrichr web (https://maayanlab.cloud/Enrichr/) [28] was used for pathways and Gene Ontology (GO) analysis. Reactome 2022 and the Kyoto Encyclopedia of Genes and Genomes (KEGG) 2021 databases were utilized to define the signaling pathways. The SMYD2-correlated genes were categorized into three processes such as biological processes, cellular components, and molecular functions using different GO terms. The Cox p-value < 0.05 was considered statistically significant. The q-value (adjusted p-value) was calculated using the Benjamini–Hochberg method. Top ten enriched terms for input genes were displayed on bar charts based on the − log10 (p-value).
Results
Analysis of gene expression data
To check the association of SMYD2 with cancers, the gene expression profile was analyzed in various normal and cancer types of tissues. The SMYD2 was overexpressed in most types of cancer (p < 0.001) as compared to the corresponding normal tissues (Fig. 1A). The X-axis shows the SMYD2 expression in log2 fold change values, whereas Y-axis shows the tissue types where SMYD2 is expressed. Analysis of TCGA datasets by using the GEPIA2 database also showed similar SMYD2 expression in bladder carcinoma, colon adenocarcinoma, diffuse large B-cell lymphoma, cervical squamous cell carcinoma, liver hepatocellular carcinoma, pancreatic adenocarcinoma, rectum adenocarcinoma, thymoma, uterine corpus endometrial carcinoma, skin cutaneous melanoma, and uterine carcinosarcoma (Fig. 1B). From here, we selected only the cancers reported with the overexpression of SMYD2 and associated with cancer progression. These cancer types are bladder, cervical, colon, breast, lymphoid, esophageal, liver, head and neck, kidney, ovarian, and pancreatic cancer.
We next used GEPIA2 to compare the expression difference of SMYD2 in the GTEx dataset as a control. Tumors and normal tissues of 11 cancer types, such as “breast invasive carcinoma (BRCA), bladder urothelial carcinoma (BLCA), colon adenocarcinoma (COAD), cervical squamous cell carcinoma, endocervical adenocarcinoma (CESC), lymphoid neoplasm diffuse large B-cell lymphoma (DLBC), head and neck squamous cell carcinoma (HNSC), esophageal carcinoma (ESCA), kidney renal clear cell carcinoma (KIRC),”liver hepatocellular carcinoma (LIHC), pancreatic adenocarcinoma (PAAD), and ovarian serous cystadenocarcinoma (OV), were considered for the evaluation of SMYD2 expression differences. The box plots were created to represent the expression of normal and tumor tissues. In comparison to normal tissues, all tumor types showed higher expression. The significant expression difference (p-value < 0.01) was observed in the BLCA, CESC, COAD, DLBC, and PAAD (Fig. 2).
Furthermore, we used to study the correlation between the cancer pathological stages (stages I–V) and SMYD2 expression. The violin plots for the pathological stages of all 11 cancer types are shown in Fig. 3. The width of each violin corresponds to the density of data points at that particular stage, with wider violins indicating a higher data density. The expression of SMYD2 varied more than six orders of magnitude in maximum cancer types. The height of each violin indicates the range of values observed for that stage. This analysis suggested that SMYD2 has a promoting role in the occurrence and progression of cancer.
Analysis of survival data
To explore the critical efficiency of SMYD2 in the survival of various cancer cases, we used GEPIA2 to evaluate survival data and establish an association between cancer patient survival and RNA expression SMYD2. The K-M plots for overall survival and disease-free survival analysis for all types of cancer were analyzed. A high level of SMYD2 in CESC (HR 2.3, p = 0.00045) and a lower level of SMYD2 in KIRC (HR 0.49, p = 6E-06) were significantly correlated with the OS of cancer patients (Fig. 4). The lower level of SMYD2 in BRCA (HR 0.68, p = 0.046) and KIRC (HR 0.63, p = 0.013) and a higher level of SMYD2 in COAD (HR 2, p = 0.0061) and HNSC (HR 1.6, p = 0.008) were significantly correlated with DFS (Fig. 5). The significantly correlated high level of SMYD2 expression indicated that patients with higher SMYD2 expression tend to live longer than those with lower SMYD2 expression, vice versa. The low SMYD2 expression group had a greater survival rate as compared to the higher expression group in maximum cancer types for both the OS and DFS.
Analysis of DNA methylation data
DNA methylation is an important epigenetic regulator of gene expression [29, 30]. Multiple malignancies have been found to have distinct and abnormal hypermethylation of CpG-rich regions (called CpG islands) or whole-genome hypermethylation [31, 32]. Hence, we discovered a possible correlation between SMYD2 expression and methylation in a range of cancers. The level of DNA methylation is represented in terms of beta values, which range from 0 (non-methylated) to 1 (fully methylated). The beta value towards one signifies the higher methylation level. In BRCA, CESC, COAD, ESCA, and KIRC cancers, the promoter methylation level was found higher. The reduced methylation level was observed in BLCA, HNSC, LIHC, and PAAD tumors compared to their counterparts of normal tissues in the UALCAN analysis using the TCGA dataset (Fig. 6).
Analysis of genetic alteration data
The genetic alteration analysis of SMYD2 in several cancers was performed using cBioPortal. The SMYD2 gene mutations were searched in 4617 cancer samples from 11 different pan-cancer studies, including breast, cervical, bladder, colon, esophageal, kidney, head and neck, liver, lymphoid, pancreatic, and ovarian cancer. In SMYD2 433 amino acid long sequence, a total of 24 different mutations were detected, in which missense mutations (15 in number) are the most common form of genetic alteration (Fig. 7A). Maximum alteration frequency of SMYD2 (> 10%) seems in lymphoid cancer patients. The amplification type of copy number alterations was the main type in the lymphoid, breast, and liver cancer cases (Fig. 7B) (Supplementary Table 1). The deletion type of mutation was detected in several cancer patients. Additionally, as seen in the TCGA dataset, amplification and gain were more common (Fig. 7C). Moreover, the possible link between genetic variants of SMYD2 and survival prognosis across all TCGA cancer was also investigated. The survival analysis in comparison to SMYD2 with and without alteration was analyzed in disease-specific, overall, progression-free, and disease-free survival (Fig. 7D). The maximum survival difference was observed in disease-free survival analysis.
Analysis of immune-infiltration data
Tumor-infiltrating immune cells are a chief constituent of the tumor microenvironment, and they play a crucial role in cancer progression, invasion, and metastasis [33, 34]. Cancer-related fibroblasts correlated to cancer in the stroma of the tumor microenvironment have been discovered to play a role in the functional regulation of immune cells infiltrating malignancies [35,36,37]. Here, we used various algorithms such as CIBERSORT, TIMER, XCELL, MCPCOUNTER, CIBERSORT-ABS, EPIC, and QUANTISEQ to study the correlation between immune cell infiltration and SMYD2 expression in multiple cancers types.
Analysis revealed that statistically significant positive correlation (p < 0.05 and Rho > 0) was detected between CD8 + T-cell and SMYD2 expression for BLCA (Rho = 0.208, p = 5.37E-05), BRCA (Rho = 0.115, p = 0.0002), DLBC (Rho = 0.340, p = 0.031), KIRC (Rho = 0.262, p = 1.12E-08), and LIHC (Rho = 0.129, p = 0.016). The statistically significant negative correlation (p < 0.05 and Rho < 0) was observed between the expression of SMYD2 and immune infiltration of CD8+ T cell in BRCA basal (Rho = − 0.160, p = 0.034), CESC (Rho = − 0.137, p = 0.023), COAD (Rho = − 0.171, p = 0.004), ESCA (Rho = − 0.287, p = 9.50E-05), HNSC (Rho = − 0.220, p = 7.69E-07), and PAAD (Rho = − 0.195, p = 0.010) cancers of TCGA based on most or at least one algorithm (Supplementary Table 2). Furthermore, for BLCA (Rho = 0.208, p = 5.37E-05), BRCA-LumA (Rho = 0.208, p = 5.37E-05), BRCA-LumB (Rho = 0.237, p = 0.0009), CESC (Rho = 0.217, p = 0.0002), ESCA (Rho = 0.449, p = 2.67E-10), HNSC (Rho = 0.162, p = 0.0002), HNSC-HPV − (Rho = 0.240, p = 1.08E-06), and LIHC, a statistically significant positive correlation was found between the infiltration value of cancer-related fibroblasts and SMYD2 expression. The scatter plot for these tumors was created by using one algorithm as shown in Fig. 8. A statistically negative association was detected for DLBC (Rho = − 0.336, p = 0.031), KIRC (Rho = − 0.135, p = 0.004), and PAAD Rho = − 0.200, p = 0.008) of TCGA tumors based on most or almost all algorithms (Supplementary Table 2).
Enrichment analysis of SMYD2-related genes
To explore the SMYD2-related genes, GEPIA2 server was utilized to identify the first 100 correlated genes with SMYD2 by combining all TCGA tumor expression data (Supplementary Table 3). The highest correlation was found in glyceronephosphate O-acyltransferase (GNPAT) (r = 0.42), insulin-induced gene 2 (INSIG2) (r = 0.41), and Egl-9 family hypoxia-inducible factor 1 (EGLN1) (r = 0.40). Finally, we used the list of associated genes with SMYD2 in various cancers for an ontology-level analysis to identify the putative signaling pathways.
The first ten pathways from REACTOME and their interrelated genes were associated with the activation of arylsulfatases, RAB geranylgeranylation, metabolism of protein, post-chaperonin tubulin folding pathway, metal ion SLC transporters, cargo concentration in the ER, protein folding, gamma carboxylation and hypusin formation, post-translation protein modification, and glycosphingolipid metabolism. All these pathways showed a significant correlation with SMYD2 (Fig. 9A). The top ten KEGG pathways are mainly associated with a HIF-1 signaling pathway, glycolysis/gluconeogenesis, central carbon metabolism in cancer, thiamine metabolism, selenocompund metabolism, glycosaminoglycan degradation, one carbon pool by folate, histidine metabolism, renin-angiotensin system, and beta-alanine metabolism (Fig. 9B). A significant association was observed in the HIF-1 signaling pathway. Additionally, we also analyzed the GO terms for genes associated with SMYD2 to see their functions in biological processes, molecular functions, and cellular components. The recommended GO features mainly were involved in mitochondrial transport and oxaloacetate metabolic process in the biological processes category (Fig. 9C), guanosinediphosphate in the molecular function category (Fig. 9D), and an integral component of the mitochondrial membrane in the process of a cellular component category (Fig. 9E). All significant terms involved in pathways and GO functions along with p-value and q-value (adjusted p-value) are shown in Supplementary Table 4.
Discussion
Cancer ranked as the second most prominent cause of global death that accounts for approximately 10 million deaths in 2020 worldwide [2]. The main cause for a higher number of cancer deaths is poor prognosis and advanced disease [38, 39]. So, the identification of effective biomarkers would be beneficial for early-stage diagnosis of cancer patients and will also help improve the treatment efficacy. Additionally, the proposed biological regulatory entities will help in the prognosis of multiple human malignancies. SMYD2 is a protein that takes part in the epigenetic modifications of the tumor suppressor gene and affects tumor transcription regulation by promoting nonhistone protein methylation [40]. Thus, it is important to explore how SMYD2 is associated with malignancies through common molecular mechanisms. Liu et al. studied the correlation of proteins from the SMYD family with cancer patients [40]. When we performed a literature review, we could not identify any integrated multicenter SMYD2 cancer analysis. Therefore, in the present study, we executed a multi-omics analysis to better understand the role of SMYD2 in diverse cancers.
In this systematic bioinformatics investigation of public datasets, we examined the expression value of SMYD2 across the tumors present in TCGA. The overexpression of SMYD2 was present in tumor tissues of all cancer types except a few compared to normal tissues. Furthermore, the overexpression of SMYD2 was observed in TCGA tumor tissues when GTEx data was used as a control. We found higher SMYD2 mRNA expression in all studied cancers. Overexpression of SMYD2 has been observed in several types of cancer, including breast cancer, lung cancer, and hepatocellular carcinoma [22]. The analysis of survival prognosis for the SMYD2 gene proposed discrete conclusions for tumors. Here, the GEPIA2 program was used to perform a statistical correlation between SMYD2 expressions and overall/disease-free survival rate of cancer patients. Results showed that a lower SMYD2 expression group has been significantly associated with a higher survival rate in the maximum type of cancer patients for both OS and DFS. The overexpression of the SMYD2 gene in tumor tissues was induced by a genetic mutation, CNAs, and epigenetic control. Moreover, mutational analysis from TCGA data suggested that CNA amplification was the most common type of alteration. The maximum alterations (> 10%) for SMYD2 were observed in lymphoid cancer. Further, correlated genes with SMYD2 were examined, and the top 100 genes were taken for pathways and GO analysis. In the REACTOME pathway analysis, the top pathways were associated with carcinogenesis [41,42,43,44]. For example, the activation of arylsulfatases is often decreased in cancer cells, leading to an accumulation of sulfated glycosaminoflycans (GAGs) in the extracellular matrix (ECM). This altered ECM composition can promote tumor growth and invasion by altering cell adhesion, migration, and signaling [45]. The abnormal RAB geranylgeranylation can be associated with cancer development and progression [46]. Furthermore, a study has shown that the post-chaperonin pathway for tubulin folding is associated with cancer [47]. The KEGG pathway analysis indicated a significant correlation with the HIF-1 signaling pathway involved in tumor progression and metastasis [48, 49]. The enriched GO terms involved in transport and metabolic process in the biological process category, guanosinediphosphate in molecular function category, and essential component of the mitochondrial membrane in the cellular component category were observed.
The present research shows the promising association of SMYD2 in multiple cancers through diverse publicly available bioinformatics tools and servers. Our integrated analysis shows that SMYD2 would be a potential biomarker for a wide range of cancers. The regulatory effect of SMYD2 on diverse cancers is different, though further experimental studies are desired to understand the complete molecular analysis of SMYD2 to identify its more effective biomarker role for cancer. Thus, SMYD2 can be used for the diagnosis of several cancers. Additionally, in vivo and in vitro research is necessary to clarify SMYD2 as a potential biomarker for cancer.
Conclusion
In this comprehensive analysis, various bioinformatics databases and tools were used to elicit the SMYD2 expression, prognostics value, DNA methylation, mutation, CNAs, and correlated genes of SMYD2 in various human cancers. This comprehensive analysis shows that SMYD2 is significantly associated with multiple cancers. Heterogeneous data in TCGA were analyzed using extensive statistical and computational procedures that allowed us to reveal novel promising parameters for each examined cancer type. Additionally, our findings will give an enhanced understanding of the role of SMYD2 in the process of tumorigenesis and metastasis. The pan-cancer analysis provides a potential mechanism that suggested the expression of SMYD2 might modulate tumors. However, because these findings were based on data analysis, more experimental verification will be needed. In conclusion, SMYD2 would be a possible biomarker and a significant drug target for the prevention and management of human cancers.
Availability of data and materials
All data generated or analyzed during this study are included in this published article and its supplementary information files.
Abbreviations
- TCGA:
-
The Cancer Genome Atlas
- PMT:
-
Protein methyltransferase
- PKMT:
-
Protein lysine methyltransferases
- GTEx:
-
Genotype-tissue expression
- K-M:
-
Kaplan-Meier
- OS:
-
Overall survival
- DFS:
-
Disease-free survival
- BRCA:
-
Breast invasive carcinoma
- BLCA:
-
Bladder urothelial carcinoma
- COAD:
-
Colon adenocarcinoma
- CESC:
-
Cervical squamous cell carcinoma and endocervical adenocarcinoma
- DLBC:
-
Lymphoid neoplasm diffuse large B-cell lymphoma
- HNSC:
-
Head and neck squamous cell carcinoma
- ESCA:
-
Esophageal carcinoma
- KIRC:
-
Kidney renal clear cell carcinoma
- LIHC:
-
Liver hepatocellular carcinoma
- PAAD:
-
Pancreatic adenocarcinoma
- OV:
-
Ovarian serous cystadenocarcinoma
References
Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A (2018) Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 68:394–424. https://doi.org/10.3322/caac.21492
Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A et al (2021) Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71:209–49. https://doi.org/10.3322/caac.21660
Sehgal M, Gupta R, Moussa A, Singh TR (2015) An integrative approach for mapping differentially expressed genes and network components using novel parameters to elucidate key regulatory genes in colorectal cancer. PLoS One 10:e0133901. https://doi.org/10.1371/journal.pone.0133901
O’Cathail SM, Wu C-H, Lewis A, Holmes C, Hawkins MA, Maughan T (2020) NRF2 metagene signature is a novel prognostic biomarker in colorectal cancer. Cancer Genet 248–249:1–10. https://doi.org/10.1016/j.cancergen.2020.08.006
Copeland RA, Solomon ME, Richon VM (2009) Protein methyltransferases as a target class for drug discovery. Nat Rev Drug Discov 8:724–732. https://doi.org/10.1038/nrd2974
Yadav AK, Gupta PK, Singh T (2023) PMTPred: Machine Learning Based Prediction of Protein Methyltransferases using the Composition of k-spaced Amino Acid Pairs. BioRxiv 2023–09. https://doi.org/10.1101/2023.09.20.558595
Hamamoto R, Nakamura Y (2016) Dysregulation of protein methyltransferases in human cancer: an emerging target class for anticancer therapy. Cancer Sci 107:377–384. https://doi.org/10.1111/cas.12884
Yadav AK, Singh TR (2021) Novel structural and functional impact of damaging single nucleotide polymorphisms (SNPs) on human SMYD2 protein using computational approaches. Meta Gene 28:100871. https://doi.org/10.1016/j.mgene.2021.100871
Greer EL, Shi Y (2012) Histone methylation: a dynamic mark in health, disease and inheritance. Nat Rev Genet 13:343–357. https://doi.org/10.1038/nrg3173
Eggert E, Hillig RC, Koehr S, Stöckigt D, Weiske J, Barak N et al (2016) Discovery and characterization of a highly potent and selective aminopyrazoline-based in vivo probe (BAY-598) for the protein lysine methyltransferase SMYD2. J Med Chem 59:4578–4600. https://doi.org/10.1021/acs.jmedchem.5b01890
Hamamoto R, Saloura V, Nakamura Y (2015) Critical roles of non-histone protein lysine methylation in human tumorigenesis. Nat Rev Cancer 15:110–124. https://doi.org/10.1038/nrc3884
Nguyen H, Allali-Hassani A, Antonysamy S, Chang S, Chen LH, Curtis C et al (2015) LLY-507, a Cell-active, potent, and selective inhibitor of protein-lysine methyltransferase SMYD2. J Biol Chem 290:13641–13653. https://doi.org/10.1074/jbc.M114.626861
Yadav AK, Singh TR (2021) Novel inhibitors design through structural investigations and simulation studies for human PKMTs (SMYD2) involved in cancer. Mol Simul 47:1149–1158. https://doi.org/10.1080/08927022.2021.1957882
Huang J, Perez-Burgos L, Placek BJ, Sengupta R, Richter M, Dorsey JA et al (2006) Repression of p53 activity by Smyd2-mediated methylation. Nature 444:629–632. https://doi.org/10.1038/nature05287
Voelkel T, Andresen C, Unger A, Just S, Rottbauer W, Linke WA (2013) Lysine methyltransferase Smyd2 regulates Hsp90-mediated protection of the sarcomeric titin springs and cardiac function. Biochim Biophys Acta 1833:812–822. https://doi.org/10.1016/j.bbamcr.2012.09.012
Saddic LA, West LE, Aslanian A, Yates JR, Rubin SM, Gozani O et al (2010) Methylation of the retinoblastoma tumor suppressor by SMYD2. J Biol Chem 285:37733–37740. https://doi.org/10.1074/jbc.M110.137612
Cho H-S, Hayami S, Toyokawa G, Maejima K, Yamane Y, Suzuki T et al (2012) RB1 methylation by SMYD2 enhances cell cycle progression through an increase of RB1 phosphorylation. Neoplasia 14:476–486
Zhang X, Tanaka K, Yan J, Li J, Peng D, Jiang Y et al (2013) Regulation of estrogen receptor α by histone methyltransferase SMYD2-mediated protein methylation. Proc Natl Acad Sci USA 110:17284–17289. https://doi.org/10.1073/pnas.1307959110
Nakakido M, Deng Z, Suzuki T, Dohmae N, Nakamura Y, Hamamoto R (2015) Dysregulation of AKT pathway by SMYD2-mediated lysine methylation on PTEN. Neoplasia 17:367–373. https://doi.org/10.1016/j.neo.2015.03.002
Piao L, Kang D, Suzuki T, Masuda A, Dohmae N, Nakamura Y et al (2014) The histone methyltransferase SMYD2 methylates PARP1 and promotes poly(ADP-ribosyl)ation activity in cancer cells. Neoplasia 16(257–64):264.e2. https://doi.org/10.1016/j.neo.2014.03.002
Li LX, Fan LX, Zhou JX, Grantham JJ, Calvet JP, Sage J et al (2017) Lysine methyltransferase SMYD2 promotes cyst growth in autosomal dominant polycystic kidney disease. J Clin Invest 127:2751–2764. https://doi.org/10.1172/JCI90921
Jarrell DK, Hassell KN, Crans DC, Lanning S, Brown MA. Characterizing the role of SMYD2 in mammalian embryogenesis—future directions. Vet Sci 2020;7. https://doi.org/10.3390/vetsci7020063
Li T, Fu J, Zeng Z, Cohen D, Li J, Chen Q, et al. TIMER2.0 for analysis of tumor-infiltrating immune cells. Nucleic Acids Res 2020;48:W509–14. https://doi.org/10.1093/nar/gkaa407
Tang Z, Kang B, Li C, Chen T, Zhang Z (2019) GEPIA2: an enhanced web server for large-scale expression profiling and interactive analysis. Nucleic Acids Res 47:W556–W560. https://doi.org/10.1093/nar/gkz430
Chandrashekar DS, Bashel B, Balasubramanya SAH, Creighton CJ, Ponce-Rodriguez I, Chakravarthi BVSK et al (2017) UALCAN: a portal for facilitating tumor subgroup gene expression and survival analyses. Neoplasia 19:649–658. https://doi.org/10.1016/j.neo.2017.05.002
Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA et al (2012) The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov 2:401–404. https://doi.org/10.1158/2159-8290.CD-12-0095
Gao J, Aksoy BA, Dogrusoz U, Dresdner G, Gross B, Sumer SO, et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal 2013;6:pl1. https://doi.org/10.1126/scisignal.2004088
Kuleshov MV, Jones MR, Rouillard AD, Fernandez NF, Duan Q, Wang Z et al (2016) Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res 44:W90–W97. https://doi.org/10.1093/nar/gkw377
Bansal A, Singh TR. Chapter 9 - Epigenome-Wide DNA Methylation and histone modification of Alzheimer’s disease. Wei LK, editor. Computational Epigenetics and Diseases, Academic Press; 2019; 9: 131–48. https://doi.org/10.1016/B978-0-12-814513-5.00009-X
Shukla A, Sehgal M, Singh TR (2015) Hydroxymethylation and its potential implication in DNA repair system: a review and future perspectives. Gene 564:109–118. https://doi.org/10.1016/j.gene.2015.03.075
Kulis M, Esteller M (2010) DNA methylation and cancer. Adv Genet 70:27–56. https://doi.org/10.1016/B978-0-12-380866-0.60002-2
Wajed SA, Laird PW, DeMeester TR (2001) DNA methylation: an alternative pathway to cancer. Ann Surg 234:10–20
Grivennikov SI, Greten FR, Karin M (2010) Immunity, inflammation, and cancer. Cell 140:883–899. https://doi.org/10.1016/j.cell.2010.01.025
Whiteside TL (2008) The tumor microenvironment and its role in promoting tumor growth. Oncogene 27:5904–5912. https://doi.org/10.1038/onc.2008.271
Liu T, Han C, Wang S, Fang P, Ma Z, Xu L et al (2019) Cancer-associated fibroblasts: an emerging target of anti-cancer immunotherapy. J Hematol Oncol 12:86. https://doi.org/10.1186/s13045-019-0770-1
Ping Q, Yan R, Cheng X, Wang W, Zhong Y, Hou Z, et al. Cancer-associated fibroblasts: overview, progress, challenges, and directions. Cancer Gene Ther 2021:1–16. https://doi.org/10.1038/s41417-021-00318-4
Sahai E, Astsaturov I, Cukierman E, DeNardo DG, Egeblad M, Evans RM et al (2020) A framework for advancing our understanding of cancer-associated fibroblasts. Nat Rev Cancer 20:174–186. https://doi.org/10.1038/s41568-019-0238-1
Sohn E (2020) Better cancer screening in resource-poor nations. Nature 579:S17–S19
Shukla A, Singh TR (2021) Structure based inference of functional single nucleotide polymorphism and its role in TGFβ1 allied colorectal cancer (CRC). Int J Bioinform Res Appl 17:80–99. https://doi.org/10.1504/IJBRA.2021.113965
Liu D, Wang X, Shi E, Wang L, Nie M, Li L et al (2021) Comprehensive analysis of the value of SMYD family members in the prognosis and immune infiltration of malignant digestive system tumors. Front Genet 12:699910. https://doi.org/10.3389/fgene.2021.699910
Zhou Z, Sun B, Nie A, Yu D, Bian M (2020) Roles of aminoacyl-tRNA synthetases in cancer. Front Cell Dev Biol 8:599765. https://doi.org/10.3389/fcell.2020.599765
Kim S, You S, Hwang D (2011) Aminoacyl-tRNA synthetases and tumorigenesis: more than housekeeping. Nat Rev Cancer 11:708–718. https://doi.org/10.1038/nrc3124
Seo YR, Kelley MR, Smith ML (2002) Selenomethionine regulation of p53 by a ref1-dependent redox mechanism. PNAS 99:14548–14553. https://doi.org/10.1073/pnas.212319799
Kovacs Z, Jung I, Gurzu S (2019) Arylsulfatases A and B: from normal tissues to malignant tumors. Pathol Res Pract 215:152516. https://doi.org/10.1016/j.prp.2019.152516
Hua SH, Viera M, Yip GW, Bay BH (2023) Theranostic applications of glycosaminoglycans in metastatic renal cell carcinoma. Cancers 15:266. https://doi.org/10.3390/cancers15010266
Gendaszewska-Darmach E, Garstka MA, Błażewska KM (2021) Targeting small GTPases and their prenylation in diabetes mellitus. J Med Chem 64:9677–9710. https://doi.org/10.1021/acs.jmedchem.1c00410
Parker AL, Teo WS, McCarroll JA, Kavallaris M (2017) An emerging role for tubulin isotypes in modulating cancer biology and chemotherapy resistance. Int J Mol Sci 18:1434. https://doi.org/10.3390/ijms18071434
Masoud GN, Li W (2015) HIF-1α pathway: role, regulation and intervention for cancer therapy. Acta Pharm Sin B 5:378–389. https://doi.org/10.1016/j.apsb.2015.05.007
Jin X, Dai L, Ma Y, Wang J, Liu Z (2020) Implications of HIF-1α in the tumorigenesis and progression of pancreatic cancer. Cancer Cell Int 20:273. https://doi.org/10.1186/s12935-020-01370-0
Acknowledgements
The authors thank to Jaypee University of Information for providing the facilities to perform the research study.
Funding
No fund was received.
Author information
Authors and Affiliations
Contributions
TRS conceived the project. AKY collected the data and perform the analysis and major contributor in writing the research. Both authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1:
Supplementary Table 1. Type of SMYD2 gene alterations present in different TCGA tumor samples with their alteration frequency aand alteration count.
Additional file 2:
Supplementary Table 2. Correlation of SMYD2 expression with cancer associated fibroblast and immune cell infiltration in various cancer types using multiple algorithms.
Additional file 3:
Supplementary Table 3. 100 correlated genes with SMYD2 by combining all TCGA tumor expression data.
Additional file 4: Supplementary Table 4
. Significant pathways and functional enrichment analysis list with GO term of SMYD2 related genes.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yadav, A.K., Singh, T.R. Computational approach for assessing the involvement of SMYD2 protein in human cancers using TCGA data. J Genet Eng Biotechnol 21, 122 (2023). https://doi.org/10.1186/s43141-023-00594-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s43141-023-00594-7