In silico comparative structural and functional analysis of arsenite methyltransferase from bacteria, fungi, fishes, birds, and mammals
Journal of Genetic Engineering and Biotechnology volume 21, Article number: 64 (2023)
Arsenic, a ubiquitous toxic metalloid, is a threat to the survival of all living organisms. Bioaccumulation of arsenic interferes with the normal physiological pathway. To overcome arsenic toxicity, organisms have developed arsenite methyltransferase enzyme, which methylates inorganic arsenite to organic arsenic MMA (III) in the presence of S-adenosylmethionine (SAM). Bacteria-derived arsM might be horizontally transported to different domains of life as arsM or as3mt (animal ortholog). A systematic study on the functional diversity of arsenite methyltransferase from various sources will be used in arsenic bioremediation.
Several arsenite methyltransferase protein sequences of bacteria, fungi, fishes, birds, and mammals were retrieved from the UniProt database. In silico physicochemical studies confirmed the acidic, hydrophilic, and thermostable nature of these enzymes. Interkingdom relationships were revealed by performing phylogenetic analysis. Homology modeling was performed by SWISS-MODEL, and that was validated through SAVES-v.6.0. QMEAN values ranged from − 0.93 to − 1.30, ERRAT score (83–96), PROCHECK (88–92%), and other parameters suggested models are statistically significant. MOTIF and PrankWeb discovered several functional motifs and active pockets within the proteins respectively. The STRING database showed protein–protein interaction networks.
All of our in silico studies confirmed the fact that arsenite methyltransferase is a cytosolic stable enzyme with conserved sequences over a wide range of organisms. Thus, because of its stable and ubiquitous nature, arsenite methyltransferase could be employed in arsenic bioremediation.
Arsenic, a ubiquitously found metalloid, has been ranked 20th, 14th, and 12th in the earth’s crust, seawater, and human body respectively based on its occurrence [1, 2]. Rock weathering, erosion, volcanic eruption, extensive mining, use of pesticides, etc. are notable causes of environmental arsenic contamination [1,2,3,4,5]. Different forms of arsenic exposure have led to drastic metabolic changes, even death, thereby affecting more than 300 million people in over 115 countries [3, 4]. Arsenic contaminants found in drinking water, food, or sometimes air cause skin, liver, lung, bladder cancers, and cardiovascular disease, mental disorder, etc. .
The two most common forms of arsenic, i.e., arsenite [As (III)] and arsenate [As (V)], enter through aquaglyceroporins and phosphate channels of bacteria respectively and interfere with their metabolism. These elements produce reactive oxygen species (ROS) in the cell, which leads to DNA damage or mutation and impairs enzymatic function [6, 7]. ATP synthesis gets disrupted due to abrupt changes in mitochondrial membrane potential in eukaryotes. Arsenic toxicity leads to the generation of nitric oxide (NO), superoxide ions (O2−), and hydroxyl radicals (OH), consequently triggering tumor formation .
Nowadays, arsenic-resistant bacteria and fungi are being deployed in bioremediation, due to their ability to perform biosorption, bioaccumulation, biotransformation [8, 9]. Though several living organisms ranging from bacteria to humans can methylate inorganic arsenic during detoxification, higher plants cannot do so . Different methylated forms of arsenic, viz., monomethyl arsinic acids (MMAs), dimethyl arsinic acids (DMAs), trimethyl arsinic acids (TMAs), arsenosugars, arsenolipids, etc. are found in nature. Arsinothricin (AST) is a methylarsenical antibiotic that is used as a bacterial weapon to protect themselves from other strains. The presence of arsenic could be detected in the lipid extracts of fishes . Yang et al.  have reported that food and water act as the mode of entry of arsenic in bird species. They detected the presence of arsenite, arsenate, DMA, MMA, etc. in the feathers and muscles of two birds inhabiting a highly arsenic-contaminated area of China . In human beings, arsenite is at first methylated to MMA and is subsequently converted to DMA, the excreted form. Scientific studies have proved MMA to be more dangerous than DMA . Even recent researches suggest methylated arsenic species to be a potent cause of women’s breast cancer .
arsM system has been developed in ancient bacterial strains to detoxify the harmful effects of arsenic. In the presence of SAM (S-adenosylmethionine), arsenite methyltransferase (encoded by arsM gene) can methylate arsenite. Horizontal gene transfer (HGT) plays a key decisive role in the development of resistance across various domains of living organisms . Orthologs of arsM are observed in fungi and animals as arsM and as3mt, respectively. Arsenite methyltransferase contains three conserved domains, viz., N-terminal domain (that binds with SAM), middle domain (that deals with arsenite), and C-terminal domain (function unknown). Three to 4 conserved cysteine residues are very important for proper enzymatic function for all types of organisms. AS3MT enzyme is found in the liver of human beings .
Arsenic has been an infamous environmental pollutant from the beginning of time. Bioaccumulation and biomagnification of arsenic exhibit undesirable physiological changes in the living organisms, affecting their survival. Thus, bioremediation of this toxic metalloid is of utmost necessity in this predicament. The omnipresent arsenite methyltransferase will play a crucial role at this juncture. Comparative in silico analyses of this enzyme among different domains of life are not well studied to date. This present study will aid in understanding their inter-relationships and physiochemical characteristics (both structural and functional conservation of amino acids, etc.).
Arsenic methyltransferase from various sources was assessed for structural and functional properties to better understand the roles of arsenic bioremediation capability. This in silico study will thus help to develop a cost-effective and efficient method of utilizing arsenite methylarsenite as an arsenic bioremediation agent in future in vivo applications.
Materials and methods
One-hundred fifteen amino acid sequences (i.e., 25 amino acid sequences of bacteria, 25 amino acid sequences of birds, 25 amino acid sequences of mammals, 20 amino acid sequences of fungi, and 20 amino acid sequences of fishes) of arsenite methyltransferase were retrieved in FASTA format from UniProtKB database (https://www.uniprot.org/) on 27th–30th May 2021.
Primary sequence analysis and phylogenetic tree construction
ExPASy ProtParam (https://web.expasy.org/protparam/) was used to determine amino acid sequences, theoretical PI, aliphatic index, instability index, grand average of hydropathicity (GRAVY), etc.  of arsenite methyltransferase. Phylogenetic relationships among organisms were studied using MEGA-X software, and a phylogenetic tree was constructed based on 500 bootstrap values .
Analysis of secondary structure
Hydrogen bonding between amino acids containing amide hydrogen and carbonyl oxygen is responsible for the construction of secondary structures in proteins. α-helix and β-sheets are common secondary structures within proteins. In this study, comparative analysis of secondary structures, viz., α-helix, β-turn, extended loop, and random coil, was done, after selecting five sequences for each organism by SOPMA (https://npsa-prabi.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_sopma.html) web server tool .
Analysis of tertiary structure
SWISS-MODEL workplace (https://swissmodel.expasy.org/) was used to predict the 3D structure of selected enzymes . Total five structures were predicted (one structure per organism), and the most suitable templates were considered for modeling. Constructed models were then validated and reanalyzed by using another web server SAVES v6.0 (https://saves.mbi.ucla.edu/). The models were processed through the ERRAT server , PROCHECK , and VERIFY-3D  of SAVES v6.0 for qualitative analyses. Salt bridge combinations were detected by ESBRI (http://bioinformatica.isa.cnr.it/ESBRI/introduction.html) .
Functional analysis of enzymes
PrankWeb server (https://prankweb.cz/) was used to predict probable ligand binding sites of the enzymes . Sequence-based structural conservation was studied by MEGA . Cofactory v.1.0 (http://www.cbs.dtu.dk/services/Cofactory/) server, a tool for identification of cofactor(s) related with enzymatic function, was implemented . Protein localization within the cells was confirmed by SignalP 5.0 (http://www.cbs.dtu.dk/services/SignalP/) server . Transmembrane helices were predicted using TMHMM server v. 2.0 (http://www.cbs.dtu.dk/services/TMHMM/) . Related motifs were searched of these enzymes through MOTIF tool (https://www.genome.jp/tools/motif). Finally, protein–protein interaction networks were studied by STRING database (https://string-db.org/) .
Amino acid sequences of arsenite methyltransferase enzymes of 25 bacteria, 20 fungi, 20 fishes, 25 birds, and 25 mammals were retrieved from UniProtKB, and their protein accession numbers with respective organisms are documented in Supplementary data 1.
Primary sequence analysis and phylogenetic tree construction
Amino acids are considered as building blocks for any protein. Thus, determination of the number and position (i.e., sequence) of amino acids is crucial factors for any structural and functional properties of a protein. Among 20 amino acids, the highest average concentration was found for alanine in arsenite methyltransferase. Bacteria contained the highest percentage of alanine (> 10%), and mammals showed minimum value (< 7%). The lowest concentration of amino acid goes to tryptophan (< 1%) for all the organisms (Fig. 1a). The average percentages of nonpolar uncharged amino acids were higher than the rest two, whereas polar charged and polar uncharged amino acid concentrations are moderate and least respectively.
However, the total average percentage of polar amino acid concentration (7–8%) is quite higher than nonpolar residues (6–7%), indicating that protein is slightly hydrophilic in nature (Fig. 1b). Average molecular weight of the proteins varies from about 29 to 42 kDa (Fig. 1c) which is quite less or equal to another arsenic detoxification enzyme, arsenite oxidase . Average pI of proteins varies from 4.8 to 5.7 which indicates arsenite methyltransferase is slightly acidic in nature. The highest and lowest pI values were observed in case of a bird species, Rhodinocichla rosea (6.5), and bacteria, Mumia sp. (4.33) (Supplementary data 2) respectively. Among 115 protein sequences, only 6 have instability index (II) more than 40. Report suggested that II value lower than 40 indicates stable proteins. Aliphatic index is the volume occupied by aliphatic side chains of a protein of aliphatic amino acids: alanine, valine, isoleucine, and leucine. Average aliphatic index of the proteins is above 80% confirming their thermostability [15, 29]. Negatively charged amino acids (NCAA) are more in number than positively charged amino acids (PCAA). A negative GRAVY value (Fig. 1d) indicates the hydrophilic nature of the protein .
Enzyme of Mus sp. is showing a separate lineage in the phylogenetic tree (Fig. 2). The arsenite methyltransferase enzymes of Zapornia atra, a bird, also separated itself from other bird lineages. Interestingly, arsenite methyltransferase enzymes of Arthroderma otae and Arthroderma gypseum are belonging to the same clade, while A. gypseum shared the same ancestor with Trichophyton tonsurans, although both the genera belong to Arthrodermataceae. Close inter-relationships could be noticed between the proteins of Homo sapiens and Pan troglodytes. This complex relationship indicates complex evolutionary relationships among different organisms. Phylogenetic trees for individual organisms were constructed which are represented in Supplementary data 3.
Analysis of secondary structures
The average comparative secondary structures such as α-helix, β-turn, extended loop, and random coil were computed and graphically represented in Fig. 3a. α-helix and random coils were found to be most abundant. For fungi, average percentage of occurrence of α-helix is more than 42%, while in mammals it is the lowest, 36%. Among three helical structures (α-helix, β-turn, and 310-helix), α-helix is most stable structure. Proline is an α-helix breaker, and the presence of excess amount of proline may lead to aperiodic protein structure . Excessive random coils indicate that protein showed many conserved regions with evolutionary significance. Random coils sometimes give more flexibility to protein .
Analysis of tertiary structure
All sequences of arsenite methyltransferase enzymes were analyzed through SWISS-MODEL workplace for selecting the best suitable sequences on basis of QMEAN score: bacteria (QMEAN − 4.33 to − 0.93); fungi (QMEAN − 5.54 to − 0.46); fishes (QMEAN − 3.68 to − 0.94); birds (QMEAN − 3.29 to − 0.94); and mammals (QMEAN − 1.34 to 2.90). Among them, the arsenite methyltransferase enzymes of bacterium Clostridium sp. (− 0.93); fungus Armillaria solidipes (− 0.96); fish Oryzias latipes (− 0.94); bird Dasyornis broadbenti (− 0.94); and mammal Felis catus (− 1.34) were taken for further analysis (Fig. 4). ERRAT quality factors ranged from 83 to 96 (Table 1) confirming the high resolutions of the protein structures. Here, in this experiment, proteins of bacteria and fungi displayed acceptable results (Supplementary data 4 and Table 1).
Among five organisms, VERIFY-3D revealed the least average 3D-1D score gained in fungi (82.80%), whereas others showed good scores (Table 1). A total of 90% or more residues occurred in the most favored regions in the Ramachandran plot, thereby indicating good protein model quality of Armillaria and Dasyornis (Table 1). However, other models showed more than 95% residues in allowed region too (Supplementary data 5). Additionally, Ramachandran plots of proteins, Chi1-Chi2 plots, and Ramachandran plots of individual amino acids have been provided in Supplementary data 6 and 7. Salt bridges are the interactions between side chains of a protein-associated positively (Lys, His, and Arg) and negatively charged (Asp and Glu) amino acids between the bond distances of 7 Å. There are several salt bridges present as suggested by ESBRI server  (Fig. 3b). The protein sequence of a bird, Dasyornis sp., had maximum pair of charged amino acids involved in salt bridges formation (i.e., 13 amino acids), while fungus Armillaria sp. had a minimum number of amino acids involved (i.e., 3 amino acids). Most dominating salt bridges are Arg-Asp, whereas only His-Asp salt bridge is present in F. catus.
Functional analysis of the enzyme
PrankWeb results showed that within the enzyme, though several numbers (4–6) of functional pockets are present, the numbers of amino acids involved in pocket formation remain approximately the same (except in the case of Armillaria sp., where 36 amino acids lead to the formation of 5 pockets) (Fig. 5f). Pockets are dominated by charged or polar (serine, threonine, cysteine, etc.), occasionally nonpolar (isoleucine, leucine, proline, or glycine) amino acids. Cysteine, which has a pivotal role in arsenite methylation, was found either in pocket 1 or 2 or in both pockets in case of all enzymes, except that of Armillaria sp. (Fig. 5a–e). Local alignment of five sequences using the MEGA software revealed that there exists much significant sequence similarity among the amino acid sequences of the enzymes. By using MUSCLE algorithm of MEGA and based on 100% conservation sites, many conserved amino acids within the amino acid position of 94–103 were revealed (Supplementary data 8). On the other hand, only a few conserved sequences could be found within positions 167–210 in a scattered manner. Occasional swapping of positively charged amino acids could be observed too. For example, the 200th amino acid of the MUSCLE algorithm alignment, arginine (R) replaced lysine (K) in case of fungi, Armillaria (Supplementary data 8). Interestingly, high sequence conservation was observed among bacteria and fungi, signifying their close evolutionary lineage. Cofactors are chemical compounds or metal ions that are associated with enzymes for proper functioning. Cofactory v.1.0 web server can find out FAD, NAD, or NADH cofactors (if) present in enzyme . However, no cofactors were found related to the enzyme. After translation, a signal peptide present in the N-termini of protein guides the protein to its target location. SignalP 5.0 finds out signal peptide along with its position in the protein, but here, no such signal peptide was discovered (Supplementary data 9). Additionally, TMHMM server did not find any transmembrane domain in it (Supplementary data 10) which also supports its subcellular localization. On the basis of these results, we can consider arsenite methyltransferase to be a cytosolic enzyme. MOTIF search (Supplementary data 11) revealed the existence of several functional domains of which the methyltransferase domain was the most predominant one. This domain of protein interacts with SAM and arsenite, where SAM donates methyl groups to arsenite; as a result, methylated arsenic species are generated . STRING analysis reveals probable overall interactomics of the enzymes with different proteins, enzymes, etc. Animal arsenite methyltransferases (AS3MT) and their interactions with other proteins are well documented in Supplementary data 12.
Arsenic became one of the most threatening hazards since early origin of life [about 4 billion years ago (bya)] on earth, and its concentration drastically increased in the late Archean eon (3–2.5 bya). To fight against arsenic toxicity, early organisms developed ArsM enzyme which promoted defense mechanism for respective microorganisms. Gradually, after the Great Oxidation Event (GOE, 2.45–2.32 bya), it evolved as an arsenic-detoxifying enzyme . Chen et al.  also argued that at least six horizontal gene transfers of arsM gene occurred between different kingdoms of life, resulting in a high diversity of inorganic arsenic-methylating species. Methylation and volatilization are enormously important and satisfactory mechanism adopted by bacteria and fungi for arsenic bioremediation [33,34,35]. Here, the comparison of arsenite methyltransferase from five groups of organisms, i.e., bacteria, fungi, fishes, birds, and mammals, was performed. A fungal species Armillaria solidipes showed close similarity with the bacteria Janibacter sp. (Fig. 2), thereby indicating the probability of horizontal gene transfer between two different kingdoms. Thermostable proteins are very important for industrial as well as in situ bioremediation applications. High aliphatic index (AI) (72–97) confirmed the thermostable nature of the proteins . α-helix, the key regulator of protein stability, is found to occur more frequently in the proteins of thermophilic organisms than those of mesophilic ones . In this study, α-helix occupied more than 35% (average) of the protein structure leading to good thermostability which is quite higher than extended strand and β-turn (Fig. 3a). Other physicochemical parameters, such as pI and instability indexes (Supplementary data 2), are important features for laboratory-based protein isolation and purifications. For homology modeling, in SWISS-MODEL workplace, most suitable models (confirmed by SAVES 6.0 server) were selected for subsequent alignment [viz., PDB ID: 4FR0 (for Clostridium); PDB ID: 4KW7 (for Armillaria) and PDB ID: 5EVJ (for three animals)] (Supplementary data 13). Former two are the proteins of a red alga, Cyanidioschyzon, whereas 5EVJ is identified as CrArsM of Chlamydomonas reinhardtii. Addition or deletion of salt bridges within protein structure may decrease or increase its stability respectively. Carboxyl oxygen of negatively charged amino acids interacts with nitrogen atoms of positively charged amino acids when they are present within a 4.0 Å distance either in the same polypeptide sequence or different . In the present study, lysine and arginine are common positively charged amino acids involved in salt bridge formation. STRING analysis revealed that every enzyme has significantly more interaction probabilities than expected and showed first and second shells of interactions (Supplementary data 12). Bacterial ArsM interacts with arsenate reductase (ArsC), another arsenic-detoxifying enzyme responsible for reduction of arsenate to arsenite , and also some permeases which pump out arsenicals from cells. The fungal protein interacts with calmodulin, chaperones, ArsH, involved in enzymatic transformations of methylated arsenite , etc. Among animals, fish protein (AS3MT) interacts with aquaglyceroporins, sometimes related with arsenite import into the cell. It was also found to be interconnected with mitochondrial energy-producing enzymes by both first and second shells of interactions. Bird AS3MT shows similarities with fish AS3MT; additively, it interacts with some defense-related proteins. Finally, Felis catus enzyme showed active participation in glutathione metabolism. For animals, every AS3MT interacts with mitochondria-associated proteins involved in energy generation. So, there would be a relationship between arsenite biotransformation and energy generation within cells. For animals interactome, one enzyme, putative N-6 adenine-specific DNA methyltransferase 1 (N6AMT1), is common, which transforms monomethylarsonous acid to dimethylarsinic acid . Besides humans, this enzyme is also present in fishes and birds confirming the evolutionary conservation of the enzyme.
Arsenite methyltransferase is a thermostable, hydrophilic, evolutionary conserved enzyme involved in arsenite biotransformation (i.e., methylation). This in silico study sheds some light into the probable role of the amino acids in arsenite methylation. The conserved sequences and domains among different classes of arsenite methyltransferase promote its employability in bioremediation over different ecologies.
Availability of data and materials
All data generated or analyzed during this study are included in this published article [and its supplementary information files].
- Ali. Index:
Entrepreneurship and Small Business Research Institute
- Ex. Co.:
Flavin adenine dinucleotide
Grand average of hydrophobicity
Horizontal gene transfer
Molecular Evolutionary Genetics Analysis
Monomethyl arsinic acid
Nicotinamide adenine dinucleotide
Reduced nicotinamide adenine dinucleotide
Negatively charged amino acids
Positively charged amino acids
Protein Data Bank
Qualitative Model Energy ANalysis
Reactive oxygen species
Trimethyl arsinic acid
Chen QY, Costa M (2021) Arsenic: a global environmental challenge. Annu Rev Pharmacol Toxicol 61:47–63. https://doi.org/10.1146/annurev-pharmtox-030220-013418
Golfinopoulos SK, Varnavas SP, Alexakis DE (2021) The status of arsenic pollution in the Greek and Cyprus environment: an overview. Water 13(2):224. https://doi.org/10.3390/w13020224
Kumar A, Ali M, Kumar R, Kumar M, Sagar P, Pandey RK, Akhouri V, Kumar V, Anand G, Niraj PK, Rani R, Kumar S, Kumar D, Bishwapriya A, Ghosh AK (2021) Arsenic exposure in Indo Gangetic Plains of Bihar causing increased cancer risk. Sci Rep 11(1):1–16. https://doi.org/10.1038/s41598-021-81579-9
Hussain MM, Wang J, Bibi I, Shahid M, Niazi NK, Iqbal J, Mian IA, Shaheen SM, Bashir S, Shah SN, Hina K, Rinklebe J (2021) Arsenic speciation and biotransformation pathways in the aquatic ecosystem: the significance of algae. J Hazard Mater 403:124027. https://doi.org/10.1016/j.jhazmat.2020.124027
Yin G, Xia L, Hou Y, Li Y, Cao D, Liu Y, Chen J, Liu J, Zhang L, Yang Q, Zhang Q, Tang N (2021) Transgenerational male reproductive effect of prenatal arsenic exposure: abnormal spermatogenesis with Igf2/H19 epigenetic alteration in CD1 mouse. Int J Environ Health Res 1–13. https://doi.org/10.1080/09603123.2020.1870668
Jelinkova P, Vesely R, Cihalova K, Hegerova D, Ananbeh HAAA, Richtera L, Smerkova K, Brtnicky M, Kynicky J, Moulick A, Adam V (2018) Effect of arsenic (III and V) on oxidative stress parameters in resistant and susceptible Staphylococcusaureus. Environ Res 166:394–401. https://doi.org/10.1016/j.envres.2018.06.024
Mandal P (2017) An insight of environmental contamination of arsenic on animal health. Emerg Contam 3(1):17–22. https://doi.org/10.1016/j.emcon.2017.01.004
Srivastava PK, Vaish A, Dwivedi S, Chakrabarty D, Singh N, Tripathi RD (2011) Biological removal of arsenic pollution by soil fungi. Sci Total Environ 409(12):2430–2442. https://doi.org/10.1016/j.scitotenv.2011.03.002
Irshad S, Xie Z, Mehmood S, Nawaz A, Ditta A, Mahmood Q (2021) Insights into conventional and recent technologies for arsenic bioremediation: a systematic review. Environ Sci Pollut Res 1–23. https://doi.org/10.1007/s11356-021-12487-8
Tang Z, Lv Y, Chen F, Zhang W, Rosen BP, Zhao FJ (2016) Arsenic methylation in Arabidopsisthaliana expressing an algal arsenite methyltransferase gene increases arsenic phytotoxicity. J Agric Food Chem 64(13):2674–2681. https://doi.org/10.1021/acs.jafc.6b00462
Chen J, Rosen BP (2020) The arsenic methylation cycle: how microbial communities adapted methylarsenicals for use as weapons in the continuing war for dominance. Front Environ Sci. https://doi.org/10.3389/fenvs.2020.00043
Yang F, Xie S, Liu J, Wei C, Zhang H, Chen T, Zhang J (2018) Arsenic concentrations and speciation in wild birds from an abandoned realgar mine in China. Chemosphere 193:777–784. https://doi.org/10.1016/j.chemosphere.2017.11.098
López-Carrillo L, Gamboa-Loira B, Gandolfi AJ, Cebrián ME (2020) Inorganic arsenic methylation capacity and breast cancer by immunohistochemical subtypes in northern Mexican women. Environ Res 184:109361. https://doi.org/10.1016/j.envres.2020.109361
Chen SC, Sun GX, Rosen BP, Zhang SY, Deng Y, Zhu BK, Rensing C, Zhu YG (2017) Recurrent horizontal transfer of arsenite methyltransferase genes facilitated adaptation of life to arsenic. Sci Rep 7(1):1–11. https://doi.org/10.1038/s41598-017-08313-2
Gasteiger E, Hoogland C, Gattiker A, Wilkins MR, Appel RD, Bairoch A (2005) Protein identification and analysis tools on the ExPASy server. The proteomics protocols handbook. pp 571–607. https://doi.org/10.1385/1-59259-890-0:571
Kumar S, Stecher G, Li M, Knyaz C, Tamura K (2018) MEGA X: Molecular Evolutionary Genetics Analysis across computing platforms. Mol Biol Evol 35(6):1547. https://doi.org/10.1093/molbev/msy096
Geourjon C, Deleage G (1995) SOPMA: significant improvements in protein secondary structure prediction by consensus prediction from multiple alignments. Bioinform 11(6):681–684. https://doi.org/10.1093/bioinformatics/11.6.681
Schwede T, Kopp J, Guex N, Peitsch MC (2003) SWISS-MODEL: an automated protein homology-modeling server. Nucleic Acids Res 31(13):3381–3385. https://doi.org/10.1093/nar/gkg520
Colovos C, Yeates TO (1993) Verification of protein structures: patterns of nonbonded atomic interactions. Protein Sci 2(9):1511–1519. https://doi.org/10.1002/pro.5560020916
Laskowski RA, MacArthur MW, Moss DS, Thornton JM (1993) PROCHECK: a program to check the stereochemical quality of protein structures. J Appl Crystallogr 26(2):283–291. https://doi.org/10.1107/S0021889892009944
Lüthy R, Bowie JU, Eisenberg D (1992) Assessment of protein models with three-dimensional profiles. Nature 356(6364):83–85. https://doi.org/10.1038/356083a0
Costantini S, Colonna G, Facchiano AM (2008) ESBRI: a web server for evaluating salt bridges in proteins. Bioinformation 3(3):137. https://doi.org/10.6026/97320630003137
Jendele L, Krivak R, Skoda P, Novotny M, Hoksza D (2019) PrankWeb: a web server for ligand binding site prediction and visualization. Nucleic Acids Res 47(W1):W345–W349. https://doi.org/10.1093/nar/gkz424
Geertz-Hansen HM, Blom N, Feist AM, Brunak S, Petersen TN (2014) Cofactory: sequence-based prediction of cofactor specificity of Rossmann folds. Proteins 82(9):1819–1828. https://doi.org/10.1002/prot.24536
Armenteros JJA, Tsirigos KD, Sønderby CK, Petersen TN, Winther O, Brunak S, Gv Heijne, Nielsen H (2019) SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat Biotechnol 37(4):420–423. https://doi.org/10.1038/s41587-019-0036-z
Krogh A, Larsson B, Von Heijne G, Sonnhammer EL (2001) Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305(3):567–580. https://doi.org/10.1006/jmbi.2000.4315
Szklarczyk D, Gable AL, Nastou KC, Lyon D, Kirsch R, Pyysalo S, Doncheva NT, Legeay M, Fang T, Bork P, Jensen LJ, von Mering C (2021) The STRING database in 2021: customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res 49(D1):D605–D612. https://doi.org/10.1093/nar/gkaa1074
Pal S, Sengupta K (2021) In silico analysis of phylogeny, structure, and function of arsenite oxidase from unculturable microbiome of arsenic contaminated soil. J Genet Eng Biotechnol 19(1):1–14. https://doi.org/10.1186/s43141-021-00146-x
Ikai A (1980) Thermostability and aliphatic index of globular proteins. J Biochem 88(6):1895–1898. https://doi.org/10.1093/oxfordjournals.jbchem.a133168
Damodaran S (2008) Amino acids, peptides and proteins. Fennema’s food chemistry. Tailor & Francis group, London
Dutta B, Deska J, Bandopadhyay R, Shamekh S (2021) In silico characterization of bacterial chitinase: illuminating its relationship with archaeal and eukaryotic cousins. J Genet Eng Biotechnol 19(1):1–11. https://doi.org/10.1186/s43141-021-00121-6
Wood TC, Salavagionne OE, Mukherjee B, Wang L, Klumpp AF, Thomae BA, Eckloff BW, Schaid DJ, Weiben ED, Weinshilboum RM (2006) Human arsenic methyltransferase (AS3MT) pharmacogenetics: gene resequencing and functional genomics studies. J Biol Chem 281(11):7364–7373. https://doi.org/10.1074/jbc.M512227200
Sher S, Rehman A (2019) Use of heavy metals resistant bacteria—a strategy for arsenic bioremediation. Appl Microbiol Biotechnol 103(15):6007–6021. https://doi.org/10.1007/s00253-019-09933-6
Huda N, Khanom A, Mizanur Rahman M, Huq A, Rahman M, Banu NA (2021) Biochemical process and functional genes of arsenic accumulation in bioremediation: agricultural soil. Int J Environ Sci Technol 1–20. https://doi.org/10.1007/s13762-021-03655-x
Satyapal GK, Kumar N (2021) Arsenic: source, distribution, toxicity and bioremediation. In: Kumar N (ed) Arsenic toxicity: challenges and solutions, 1st edn. Springer, Singapore. https://doi.org/10.1007/978-981-33-6068-6_6
Kumar S, Tsai CJ, Nussinov R (2000) Factors enhancing protein thermostability. Protein Eng 13(3):179–191. https://doi.org/10.1093/protein/13.3.179
Rahman MS, Hossain MS, Saha SK, Rahman S, Sonne C, Kim KH (2021) Homology modeling and probable active site cavity prediction of uncharacterized arsenate reductase in bacterial spp. Appl Biochem Biotechnol 193(1):1–18. https://doi.org/10.1007/s12010-020-03392-w
Zhang H, Ge Y, He P, Chen X, Carina A, Qiu Y, Aga DS, Ren X (2015) Interactive effects of N6AMT1 and As3MT in arsenic biomethylation. Toxicol Sci 146(2):354–362. https://doi.org/10.1093/toxsci/kfv101
The authors are also thankful to UGC-Center of Advanced Study and DST-FIST, Department of Botany, the University of Burdwan, for pursuing research activities. AK is thankful to DHESTBT (WB-DBT) for financial support [Memo no. 30 (Sanc.)-BT/ST/P/S&T/2G-48/2017]. AL is thankful to the principal of Chandernagore College.
No funding was received for this research work.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
List of organisms with protein accession numbers.
Physiobiochemical analysis of enzymes: ExPASy.
Phylogenetic trees of different organisms.
ERRAT values of proteins.
Ramachandran plots of individual organisms.
Chi1-Chi2 scores of selected enzymes regarding individual amino acids.
Ramachandran plots of individual amino acids of selected enzymes.
Structural conservation among the sequences.
Identification of probable signal sequences within enzymes.
Studies on probable transmembrane helixes within enzymes.
Identification of probable motifs.
Results of STRING analysis.
Alignment of most suitable enzymes with the query sequence.
About this article
Cite this article
Kabiraj, A., Laha, A., Panja, A.S. et al. In silico comparative structural and functional analysis of arsenite methyltransferase from bacteria, fungi, fishes, birds, and mammals. J Genet Eng Biotechnol 21, 64 (2023). https://doi.org/10.1186/s43141-023-00522-9
- Arsenite methyltransferase
- Homology modelling