Insights from the in silico structural, functional and phylogenetic characterization of canine lysyl oxidase protein
Journal of Genetic Engineering and Biotechnology volume 18, Article number: 20 (2020)
Lysyl oxidase is an extracellular regulatory enzyme with an imperative role in interlinking of collagen and elastin by oxidizing lysine residues. Lysyl oxidase has been implicated in incidence of mammary tumors in bitches. Therefore, it becomes significant to study the structural and functional features of this enzyme for a better understanding of its molecular mechanisms.
The detailed computational investigation of the canine lysyl oxidase protein was analyzed in silico with respect to its physicochemical properties, secondary and tertiary structure predictions and functional analysis using standard bioinformatic tools. Lysyl oxidase is a flexible protein with an average molecular weight of around 46 kDa, unstable, hydrophilic, and extracellular (secretory) in nature. Twelve cysteine residues and a disulfide bridge were also found. Secondary structure analysis shows that most of the protein has predominant coiled configuration. A putative copper-binding region signature was predicted. The phylogenetic relationship of canine lysyl oxidase with a vast range of mammalian species indicates that the protein was very well conserved throughout the course of evolution. Top 10 interacting proteins were identified using STRING v10.0 analysis, elastin being the closest interacting protein. Functional analysis by InterproScan predicted protein’s biological role in oxidation-reduction process.
Understanding the structural and functional properties of the protein will facilitate a better understanding of its mechanism of enzyme action. Further, the predicted 3D model will serve as a cornerstone for further understanding towards the tumorigenesis potential of the protein.
Lysyl oxidase, a copper-dependent amine oxidase belongs to the oxido reductase family of enzymes and is secreted in the extracellular space . LOX plays a major role in collagen and elastin crosslinking by acting on the peptidyl lysine and converting it into reactive aldehyde (allysine) which is vital for collagen fibrils stabilization and maintaining the integrity of mature elastin . In a LOX knockout mice model, the physiological significance of lysyl oxidase-mediated crosslinking was illustrated due to severe fragility of the connective tissue of the cardiovascular system leading to the death of the mice either before or shortly after birth .
In canines, lox gene is mapped on chromosome 11. Canine lox contains 7 exons and spans across a genomic region of ~8.16 kb (from 12031440 bp to 12039603 bp) . LOX fragment has three segments that is, signal peptide, propeptide, and mature LOX. By 1990s, five different LOX genes (LOX, LOXL1, LOXL2, LOXL3, and LOXL4) encoding proteins were identified which shared a highly conserved C terminal domain and a diverse N-terminal. Lysyl oxidase enzyme family contains a catalytic domain and a cytokine receptor-like domain at their C termini. Catalytic domain has a copper-binding motif and a unique lysyl-tyrosylquinone (LTQ) cofactor. The coordination of copper into the active site is brought by four histidines of copper-binding motif . The LTQ cofactor is conserved in all LOX like proteins and is essential for the catalytic activity of LOX.
LOX proteins are reported not only in animals but also in archaea, bacteria, and many other eukaryotes revealing a pre-metazoan origin for the LOX family. Based on the present understanding of the mammalian LOX genes, LOX/L1/L5 and LOXL2/L3/L4 are the two LOX superfamilies. LOX/L1/L5 superfamily has been reported in cnidarians and chordates. However, LOXL2/L3/L4 superfamily is present in bilaterian genomes and is reportedly lost in cnidarians. This LOX family is present in protostomes, tunicates, and cephalochordates. Vertebrates are known to have the highest LOX enzymes with all well-known five families (LOX, LOXL1, LOXL2, LOXL3, and LOXL4), one specific family to fishes (LOXL5) and one specific to lampreys (LOXL2/L3/L4). However, no LOX gene was identified either in nematodes, ctenophore, or placozoan .
Lysyl oxidase genes are a family of LOX paralogs suggesting diverse functions due to the different regulation of the LOX family. Novel functions of the LOX family include collagen and elastin crosslinking, tumor progression , histone protein modification , and chemotaxis . LOX activity has been demonstrated in various fibrotic diseases, connective tissue disorders, and hypoxia-induced tumors . Also, over expression of LOX is considered tumor marker for invasiveness in breast cancer, head and neck squamous cell, prostatic and renal cell carcinomas . We have previously reported elevated expression of LOX with incidence of mammary tumors in bitches . Other than cancer, lysyl oxidase activity is also found reduced in nutritional copper deficiency and lathyrism  and in two X-linked recessively inherited disorders, Menkes disease, and occipital horn syndrome (OHS) .
Regardless of its important role, canine lysyl oxidase is not well characterized structurally or functionally. Considering the importance of this protein, the present study is formulated to analyze canine lysyl oxidase gene by assessing its phylogenetic relationships, physicochemical properties, secondary and tertiary structure prediction, motif prediction, and functional analysis.
Retrieval of nucleotides and protein
Canis lupus famiilaris lysyl oxidase nucleotide sequence was retrieved from NCBI (MH330152.1). Basic Local Alignment Search Tool (BLAST) was used to obtain similar sequences in other organisms. Multiple sequence alignment was carried out on all sequences using Molecular Evolutionary Genetic Analysis (MEGA) 6.0 version standalone software . Sequence of lysyl oxidase protein was retrieved from UNIPROT (ID: J9NZK5_CANLF). This sequence was used as an input for Expert Protein Analysis System (ExPASy) which is the proteomic server of Swiss Institute of Bioinformatics (SIB) (https://www.expasy.org/) .
At least 57 organisms were selected (on the basis of highest similarity) to infer the evolutionary relationships with canine sequence as a reference point. The phylogenetic tree was constructed using maximum likelihood method of the Mega 6.0 package. The consistency of the inferred phylogenetic tree was evaluated with bootstrap analysis of 1000 replications.
Primary structural analysis of lysyl oxidase
Primary structural analysis of the lysyl oxidase protein was determined using ProtParam from ExPASy. The biophysical and biochemical properties include molecular weight (Mw), isoelectric point(pI), extinction coefficients (EC-quantitative study of protein-protein and protein-ligand interactions) , instability index (II-stability of proteins) , aliphatic index (AI-relative volume of protein occupied by aliphatic side chains) , grand average hydropathicity (GRAVY-sum of all hydropathicity values of all amino acids divided by number of residues in a sequence) , half-life , and number of positive and negative residues.
Secondary structure characterization
The coding sequence of canine lysyl oxidase gene was translated to protein sequence by using ExPASy translated tool (http://web.expasy.org/translate/). The amino acid sequence was subjected to secondary protein structure prediction by using (http://bioinf.cs.ucl.ac.uk/psipred/). Hydrophilicity plot, antigenic index , and surface probability plot (Emini) were predicted using protean tool—DNASTAR .
Tertiary structure prediction
The tertiary structure prediction of lysyl oxidase protein was modeled through using ab initio approach using online available tool RaptorX (http://raptorx.uchicago.edu/) and Swiss model software (https://swissmodel.expasy.org/). The model, thus obtained, was further validated by Ramachandran’s plot using the RAMPAGE online tool (http://mordred.bioc.cam.ac.uk/~rapper/rampage.php).
CYC_REC tool was used to predict the SS-bonding of cysteine residues in protein sequence . Potential phosphorylation sites of the protein was studied using NetPhos2.0 . Glycosylation sites were predicted using NetNGlyc server (http://www.cbs.dtu.dk/services/NetNGlyc/) that is provided by Centre for Biological Sequence Analysis, Technical University of Denmark (CBS DTU). Location of signal peptide cleavage sites was predicted using Signal P-4.1 . Psite is a protein domain database for functional annotation and description of protein sequences which. Motifs in the lysyl oxidase amino acid sequences were predicted using Psite software . ProtComp 9.0 was used to identify the sub-cellular localization of protein . Inter-ProScan (https://www.ebi.ac.uk/interpro/) functionally characterizes proteins by identifying protein families, domains, and functional sites .
Protein-protein interaction study
STRINGv10.0 web server (http://string-db.org) was used to predict the interaction of lysyl oxidase protein with other closely allied proteins. Canine lysyl oxidase was chosen as the query sequence and a protein-protein interaction network was generated .
Ethics approval and consent to participate
Retrieval of nucleotides and protein
A total of 57 lysyl oxidase sequences from different species were retrieved for phylogenetic analysis. Canine lysyl oxidase protein sequence was retrieved from UNIPROT and used for studying its physicochemical properties, functional analysis, protein interactions, secondary and tertiary structures using various computational tools and servers.
Phylogenetic tree depicts the formation of different clads on the basis of the evolutionary changes between sequences. Higher bootstrap values shows the higher consistency of the given data. The phylogenetic tree was constructed by subjecting 57 nucleotide sequences to maximum likelihood method (MEGA 6.0) with 1000 bootstrapping resampling. The phylogenetic tree showed that sequences belonging to the same order and family formed different clads (Fig. 1). The results showed that the gene came from the common ancestry root but diverged into different clads in the course of evolution. Ruminant lysyl oxidase (cattle, American bison, zebu, and water buffalo) are forming an independent clad and clusters away from the canine lysyl oxidase sequence. Leopard lysyl oxidase sequence was found closest to canine sequence followed by cat. The lysyl oxidase sequence of sea otter, ferret giant panda, pacific walrus, Weddell seal, and Hawaiian monk though, forming an independent clad but are quite similar to canine lysyl oxidase. Among equines, horse and donkey lysyl oxidase sequence clustered very much near to canines and are closely related. Human lysyl oxidase sequence was found to be less similar to the canine sequence. Canine lysyl oxidase sequence showed maximum divergence from camels (Arabian camel, Bactrian camel, and wild Bactrian camel). Chicken also showed divergence away from the canine sequence. To infer the evolutionary history of LOX proteins from eukaryotes, bacteria, and archaea, researchers have surveyed a wide selection of genomes in the past. A pre-metazoan source of this family has been reported so far .
Primary structural analysis of lysyl oxidase
The amino acid composition (Table 1) and physicochemical properties (Table 2) of lysyl oxidase protein were assessed using ExPASy ProtParam server. The protein has 409 amino acids. Alanine is the most abundant amino acid present and proline; tyrosine are the next abundant amino acids present predominantly. The presence of aspartic acid in proteins is vital as it interacts with the solvent which further stabilizes the protein’s 3D structure.
The average molecular weight of the protein was around 46 kDa. The state of a solution where the amino acid produces the identical amount of positive and negative charges and thus, an ultimate zero charge .The isoelectric point of lysyl oxidase was found to be 8.84 which suggests that the given protein sequence seemed mildly alkaline. In isoelectric focusing method, this computed pI will be supportive for developing buffer system for purification. The instability index is computed to 50.34 which classifies the lysyl oxidase protein as unstable. The extinction coefficients was 88295 and the aliphatic index was 56.92. Relative volume of a protein occupied by it aliphatic side chains (alanine, isoleucine, leucine, and valine) is denoted by aliphatic index. The higher the aliphatic index, the higher will be the stability of the protein . The grand average of hydropathy (GRAVY) value indicates the solubility of proteins and was found to be −0.757. The lesser the value is, the more superior the interaction takes place between protein with water . The expected half-life was about 30 h.
Secondary structure characterization
Using the PSIPRED online tool, the secondary protein structure of canine lysyl oxidase gene was determined. Eight percent of total amino acids contributed to helix, 73% to coils, and 18% to strands (Fig. 2). This shows that coil dominated among the secondary structure elements followed by alpha helix. The dominant coiled structural content might be due to the presence of proline amino acid (hydrophobic). Proline has a special property of disrupting structured secondary structure by creating kinks in the polypeptide chains, thus resulting in coiling. Solvent accessibility can further aid in providing useful insights about the sequence and structure relationship. A total of 53% were predicted as buried, 19% medium, and 25% were exposed; about 41% positions were predicted as disordered. Further, 38% of total amino acids were predicted as non-polar, 34% polar, 14% hydrophobic, and 14% contributed to aromatic plus cysteine (Fig. 3). Human and rat lysyl oxidase propeptide has been predicted to contain more than 80% disordered residues . As predicted by DNASTAR, multiple peaks in the antigenic index contributes to a potential antigenic determinant of lysyl oxidase protein (Fig. 4).
Tertiary structure prediction
Homology modeling was carried out to predict the 3-D structure of lysyl oxidase protein, since there is no experimental data available in the protein data bank. Lysyl oxidase 3-D structure model was generated by RaptorX online software and SWISS MODEL that works by selecting the best template for modeling (Fig. 5). Human LOX homolog 2 (5ze3.1.A) was used a modeling template with a significant similarity with the query sequence. The predicted oligo state of the protein model was monomer. Based on QMEAN score and Z score, a good quality model was selected (Fig. 6). The model was further validated by Ramachandran plot which concluded that 94.2% of amino acids were in favored and 2.23% were outliers (Fig. 7). More than 90% residues in favored region are attributes of a good quality model . Similar type of in silico homology modeling has also been reported for human lysyl oxidase protein .
A total of 12 cysteine residues at positions 16, 21, 230, 236, 283, 316, 322, 332, 343, 353, 390, 404 were found in the lysyl oxidase protein using the CYC_REC tool and forms at least 1 disulfide bridge. Cysteine residues are vital for protein’s thermostability while the disulfide bonds are important in folding of protein. Serine , threonine (5 ), and tyrosine  are predicted as potential phosphorylation sites. NetNGlyc online tool predicted two N-glycosylation sites at 96th and 136th position with high confidence. The SignalP 5.0 server predicts the incidence of signal peptides and the position of their cleavage sites. The likelihood of the signal peptide was around 0.9505 and the location of peptide cleavage site between position 21 and 22 was also predicted (Fig. 8). Through Psite software, it was found that the protein sequence has N-myristoylation site the maximum number of times. Also, a putative copper-binding region signature was predicted in the lysyl oxidase protein sequence at position 278-291. Copper atom exists within an octahedral coordination complex along with three histidine residues within the enzyme’s central region . Other motif regions are summarized in (Table 3). ProtComp 9.0 revealed that the sub-cellular localization of the protein was extracellular (secreted). Functional analysis by InterproScan predicted protein’s biological role in oxidation-reduction process. Further, it has a copper ion binding and an oxidoreductase molecular activity.
Protein-protein interaction study
Protein interaction network resolved by STRING web server revealed 10 potential interacting protein associates (Fig. 9) based on various network parameters like text mining, gene fusion, co-occurrence, co-expression, neighborhood, and databases. A node indicates a protein while as a connecting edge represents their interaction. The closest interacting protein having the shortest node was found elastin while the distant interacting protein was lysyl oxidase-like 3 and microfibrillar-associated protein 5. Potential interacting protein associates with canine lysyl oxidase protein are listed in (Fig. 10). Lysyl oxidase propeptide has been associated with interact with elastin, an extracellular protein promoting deposition onto elastic fibers . Also, secreted LOX (proenzyme) is activated by bone morphogenetic protein 1 (BMP-1), releasing the mature catalytic domain and its N-terminal propeptide . Proteins generally function by interacting with other proteins forming protein complexes and networks. Elucidating these complex protein interactions will give important clues as to the function of novel proteins that govern the cell behavior.
Lysyl oxidase is a matrix remodeling enzyme which plays a vital role both inside and outside the cells contributing to cell matrix interactions, extracellular matrix assembly and organization. However, its aberrant expression (either upregulation or downregulation) in various pathological and physiological conditions is still being investigated. LOX is also being studied as a target for cancer metastasis owing to its incidence in various cancers. Thus, understanding the structural and functional properties of the protein will further facilitate a better understanding of its mechanism of enzyme action. Due to the non-availability of the crystal structure, studying the in silico structure-function aspects of the protein appears to be the moonlight in the dark. In this study, a flexible, unstable, hydrophilic, and extracellular protein with a molecular weight of 46 kDa was found. Functional motifs in the protein were also predicted along with a putative copper-binding region. Copper acts as a cofactor and a determinant of enzyme activity in the connective tissues. The predicted 3-D structure might help in shedding light on the biological functions. There is a likely prospect that the concerned gene in humans may have evolutionary relationship with that of canines and may be correlated with the cancer progression. The extracellular function of the canine LOX along with their elevated mRNA and protein expression in canine mammary tumors makes LOX a therapeutic target for diagnosis of mammary tumors. Targeting LOX in canine cancer is an exciting prospect for the development of drugs that could prevent cancer metastasis and progression. Thus, it becomes imperative to use bioinformatic tools to understand the relationship. This will help both veterinarians as well as medical experts in providing basic and concrete information regarding its diagnosis and treatment.
Availability of data and materials
All data generated or analyzed during this study are included in this published article.
Lysyl oxidase protein
- lox :
Lysyl oxidase gene
Occipital horn syndrome
Basic Local Alignment Search Tool
Molecular Evolutionary Genetic
Expert Protein Analysis System
Swiss Institute of Bioinformatics
Unweighted pair group method
- II :
Grand average hydropathicity
Kuivaniemi H, Ala-Kokko L, Kivirikko KI (1986) Secretion of lysyl oxidase by cultured human skin fibroblasts and effects of monensin, nigericin, tunicamycin and colchicine. Biochim Biophys Acta 883:326–334
Reiser K, McCormick RJ, Rucker RB (1992) Enzymatic and non enzymatic crosslinking of collagen and elastin. FASEB J 6:2439–2449
Mäki JM, Räsänen J, Tikkanen H, Sormunen R, Mäkikallio K, Kivirikko KI, Soininen R (2002) Inactivation of the lysyl oxidase gene Lox leads to aortic aneurysms, cardiovascular dysfunction, and perinatal death in mice. Circulation 106:2503–2509
Saleem A, Singh S, Sunil Kumar BV, Arora JS, Choudhary RK (2019) Analysis of lysyl oxidase as a marker for diagnosis of canine mammary tumors. Mol Biol Rep 46:4909–4919
Gacheru SN, Trackman PC, Shah MA, O'Gara CY, Spacciapoli P, Greenaway FT, Kagan HM (1990) Structural and catalytic properties of copper in lysyl oxidase. J Biol Chem 265:19022–19027
Grau BX, Ruiz TI, Rodriguez PF (2015) Origin and evolution of lysyl oxidases. Sci Rep 5:10568
Erler JT, Bennewith KL, Nicolau M, Dornhöfer N, Kong C, Le QT, Chi JT, Jeffrey SS, Giaccia AJ (2006) Lysyl oxidase is essential for hypoxia-induced metastasis. Nature. 440:1222–1226
Giampuzzi M, Oleggini R, Di Donato A (2003) Demonstration of in vitro interaction between tumor suppressor lysyl oxidase and histones H1 and H2: definition of the regions involved. Biochim Biophys Acta 1647:245–251
Lucero HA, Ravid K, Grimsby JL, Rich CB, DiCamillo SJ, Mäki JM, Myllyharju J, Kagan HM (2008) Lysyl oxidase oxidizes cell membrane proteins and enhances the chemotactic response of vascular smooth muscle cells. J Biol Chem 283:24103–24117
Barker HE, Cox TR, Erler JT (2012) The rationale for targeting the LOX family in cancer. Nat Rev Cancer 12:540–552
Siddikuzzaman, Grace VM, Guruvayoorappan C (2011) Lysyl oxidase: a potential target for cancer therapy. Inflammopharmacology 19:117–129
Smith LI, Kagan HM (1998) Lysyl oxidase: properties, regulation and multiple functions in biology. Matrix Biol 16:387–398
Kaler SG (1998) Metabolic and molecular bases of Menkes disease and occipital horn syndrome. Pediatr Dev Pathol 1:85–98
Kumar S, Stecher G, Li M, Knyaz C, Tamura K (2018) MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol Biol Evol 35:1547–1549
Gasteiger E, Hoogland C, Gattiker A, Duvaud S, Wilkins MR, Appel RD, Bairoch A (2005) Protein identification and analysis tools on the ExPASy Server The Proteomics Protocols Handbook. Humana Press, pp 571–607
Gill SC, Hippel PHV (1989) Calculation of protein extinction coefficient from amino acid sequence data. Anal Biochem 182:319–326
Guruprasad K, Reddy BVB, Pandit MW (1990) Correlation between stability of a protein and its dipeptide composition: a novel approach for predicting in vivo stability of a protein from its primary sequence. Protein Eng 4:55–161
Ikai A (1980) Thermostability and aliphatic index of globular proteins. J Biochem 88:1895–1898
Kyte J, Doolittle RF (1982) A simple method for displaying the hydropathic character of a protein. J Mol Biol 157:105–132
Gonda DK, Bachmair A, Wunning I, Tobias JW, Lane WS, Varshavsky A (1989) A Universality and structure of the N-end rule. J Biol Chem 264:16700–16712
Jameson BA, Wolf H (1988) The antigenic index: a novel algorithm for predicting antigenic determinants. Bioinformatics 4:181–186
Burland TG (2000) DNASTAR’s Lasergene sequence analysis software. Methods Mol Biol 132:71–91
CYS_REC: The program for predicting SS-bonding states of cysteines and disulphide bridges in protein sequences. http://www.softberry.com/berry.phtml?topic=cys_rec&group=programs&subgroup=propt
Blom N, Gammeltoft S, Brunak S (1999) Sequence- and structure-based prediction of eukaryotic protein phosphorylation sites. J Mol Biol 294:1351–1362
Peterson TN, Brunak S, Heijne G, Nielsen H (2011) SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods 8:785–786
Solovyev VV, Kolchanov NA (1994) Search for functional sites using consensus. In: Computer analysis of Genetic macromolecules. World Scientific, pp 16–21
ProtComp - Version 9: Program for Identification of sub-cellular localization of Eukaryotic proteins : Animal/Fungi. http://www.softberry.com/berry.phtml?topic=protcompan&group=programs&subgroup=proloc
Jones P, Binns D, Chang HY (2014) InterProScan 5: genome-scale protein function classification. Bioinformatics 30:1236–1240
Szklarczyk D, Morris JH, Cook H, Kuhn M, Wyder S (2015) STRINGv10.0: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res 43:447–452
Botto M, Hawkins PN, Bickerstaff MC (1997) Amyloid deposition is delayed in mice with targeted deletion of the serum amyloid P component gene. Nat Med 3:855–859
Roan NR, Müller JA, Liu H, Chu S, Arnold F, Stürzel CM, Walther P, Dong M, Witkowska HE, Kirchhoff F, Münch J, Greene WC (2011) Peptides released by physiological cleavage of semen coagulum proteins form amyloids that enhance HIV infection. Cell Host Microbe 10:541–550
Vallet S, Miele A, Uciechowska-Kaczmarzyk U (2018) Insights into the structure and dynamics of lysyl oxidase propeptide, a flexible protein with numerous partners. Sci Rep 8:11768
Pramanik K, Ghosh PK, Ray S, Sarkar A, Mitra S, Maiti TK (2017) An in silico structural, functional and phylogenetic analysis with three dimensional protein modeling of alkaline phosphatase enzyme of Pseudomonas aeruginosa. J Genet Eng Biotechnol 15:527–537
Mishra S, Kumar P, Singh S (2017) Structural analysis of protein lysyl oxidase: modeling and simulation study. J Biotech Res 8:9–17
Krebs CJ, Krawetz SA (1993) Lysyl oxidase copper-talon complex : A model. Biochim Biophys Acta 1202:7–12
Thomassin L, Werneck CC, Broekelmann TJ (2005) The pro-regions of lysyl oxidase and lysyl oxidase-like 1 are required for deposition onto elastic fibres. J Biol Chem 280:42848–42855
Borel A, Eichenberger D, Farjanel J (2001) Lysyl oxidase-like protein from bovine aorta. Isolation and maturation to an active form by bone morphogenetic protein-1. J Biol Chem 276:48944–48949
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Saleem, A., Rajput, S. Insights from the in silico structural, functional and phylogenetic characterization of canine lysyl oxidase protein. J Genet Eng Biotechnol 18, 20 (2020). https://doi.org/10.1186/s43141-020-00034-w