Study of lipase producing gene in wheat – an in silico approach

Rani, Shradha; Kumari, Priya; Poddar, Raju; Chattopadhyay, Soham

doi:10.1186/s43141-021-00150-1

Research
Open access
Published: 17 May 2021

Study of lipase producing gene in wheat – an in silico approach

Shradha Rani¹,
Priya Kumari¹,
Raju Poddar¹ &
…
Soham Chattopadhyay ORCID: orcid.org/0000-0002-3797-5333¹

Journal of Genetic Engineering and Biotechnology volume 19, Article number: 73 (2021) Cite this article

1738 Accesses
2 Citations
Metrics details

Abstract

Background

Lipases (EC 3.1.1.3) catalyze the hydrolysis of oil into free fatty acids and glycerol forming the 3rd largest group of commercialized enzymes. Plant lipases grab attention recently because of their specificity, less production and purified cost, and easy availability. In silico approach is the first step to identify different genes coding for lipase in a most common indigenous plant, wheat, to explore the possibility of this plant as an alternative source for commercial lipase production. As the hierarchy organization of genes reflects an ancient process of gene duplication and divergence, many of the theoretical and analytical tools of the phylogenetic systematics can be utilized for comparative genomic studies. Also, in addition to experimental identification and characterization of genes, for computational genomic analysis, Arabidopsis has become a popular strategy to identify crop genes which are economically important, as Arabidopsis genes had been well identified and characterized for lipase. A number of articles had been reported in which genes of wheat have shown strong homology with Arabidopsis. The complete genome sequences of rice and Arabidopsis constitute a valuable resource for comparative genome analysis as they are representatives of the two major evolutionary lineages within the angiosperms. Here, in this in silico approach, Arabidopsis and Oryza sativa serve as models for dicotyledonous and monocotyledonous species, respectively, and the genomic sequence data available was used to identify the lipase genes in wheat.

Results

In this present study, Ensembl Plants database was explored for lipase producing gene present in wheat genome and 21 genes were screened down as they contain specific domain and motif for lipase (GXSXG). According to the evolutionary analysis, it was found that the gene TraesCS5B02G157100, located in 5B chromosome, has 58.35% sequence similarity with the reported lipase gene of Arabidopsis thaliana and gene TraesCS3A02G463500 located in the 3A chromosome has 51.74% sequence similarity with the reported lipase gene of Oryza sativa. Homology modeling was performed using protein sequences coded by aforementioned genes and optimized by molecular dynamic simulations. Further with the help of molecular docking of modeled structures with tributyrin, binding efficiency was checked, and the difference in energies (DE) was −9.83 kcal/mol and −6.67 kcal/mol, respectively.

Conclusions

The present work provides a basic understanding of the gene-encoding lipase in wheat, which could be easily accessible and used as a potent industrial enzyme. The study enlightens another direction which can be used further to explore plant lipases.

Background

Lipases are ubiquitous enzymes, widespread in nature, and it can be of microbial (bacterial, fungal, and yeast), animal, or plant origin. Lipase (EC 3.1.1.3), a member of the lipolytic enzyme family, catalyzes the hydrolysis of the ester bonds of tri-, di-, and monoglycerides into fatty acids and glycerol [6] as described in Fig. 1.

In the active site of the enzyme lipase, the catalytic triad serine, aspartate or glutamate and histidine, is present, in which serine acts as a nucleophile and aspartate or glutamate acts as a catalytic acid residue and forms a hydrogen bond with histidine. Lipases consist of a pentapeptide consensus motif (Gly-X-Ser-X-Gly) [12, 17].

Recently, plant lipases have been the focus of much attention as biocatalysts. It presents advantages over microbial lipases due to specificity, low production cost, availability, and ease of purification [37]. Plant lipases are often present in the reserve tissues of germinating seedlings or in tissues with a large amount of triacylglycerols, where they play an important role in biological reactions such as lipolysis, esterification, and transesterification and thus helping in plant growth and development. In higher plants, triacylglycerols (TAGs) may be in few percentages of total lipids in the leaf tissue but can make up to 60% of the dry weight of oil seeds. Fatty acids can be cleaved off by a lipase and further metabolized in peroxisomes through b-oxidation to yield acetyl-coA [20, 31]. A minor application of the lipase enzyme is that it is used as a diagnostic tool in medicine. Apart from this, it is used in the food, detergent, and pharmaceutical sectors. The variety of lipase applications led to increased research to characterize them and better understand their kinetics and reaction mechanisms and to establish methods for lipase production in homologous and heterologous expression systems. In the worldwide enzyme industry market, the rank of lipases has grown significantly high. It is also believed that in the near future, it will acquire importance as comparable to that of the peptidases, which represent 25 to 40% of industrial enzyme sales [13, 16]. A number of articles have been published, especially concerning the synthesis of seed lipases from barley, linseed, maize, rice, and wheat [3, 4, 28, 30]. Researchers have also studied for optimization of different physicochemical conditions and estimated the lipase activity and purified them.

In India, bread wheat (Triticum aestivum L.) is one of the most widely grown wheat species, occupying 37% of the total cultivated land. Many vitamins, essential amino acids, and proteins are present in wheat germs [14]. Although a number of articles have been reported on the purification and characterization of lipase from wheat, very few reports are available on the lipase gene present in wheat and its biological reaction with substrates [19, 36].

To understand plant lipase in more detail, the major focus of our study is to identify lipase genes from the wheat genome which can be used as a basis for further applied researches. In this present study, we have identified and located genes present in the wheat genome, through in silico study. These identified protein sequences were further modeled and docked to check their biological activity. Further, the discovery of putative lipase gene sequences presents in the wheat using bioinformatics tools has been described.

Methods

Sequence retrieval and analysis

Data available at Ensembl Plants database [5] was mined by using “lipase” as a keyword to search for all the lipase genes present in the annotated Triticum aestivum genome (access date November 10, 2019).

Motif and domain search

By using the Scan Prosite tool [8] found at ExPASy-PROSITE, the motif of lipase genes was analyzed and sequences with the GXSXG lipase motif [38] were selected for further analysis. Domain search was performed for genes having lipase domain using CDD search [23].

Subcellular localization prediction

Subcellular localization prediction for lipase genes was carried out using TargetP 1.1 [11] for all genes having lipase domain.

Phylogenetic analysis

To further confirm the evolutionary relationships, a phylogenetic tree was constructed using sequences containing lipase domain from wheat and one reported lipase sequence from Arabidopsis thaliana and Oryza sativa. The complete genome sequences of Oryza sativa and Arabidopsis constitute a valuable resource for comparative genome analysis as they are representatives of the two major evolutionary lineages within the angiosperms. A number of articles had been reported in which genes of wheat have shown strong homology with Arabidopsis [29]. The phylogenetic tree was constructed using the neighbor-joining (NJ) method in MEGA v.10 [21] under the Jones-Taylor-Thornton amino acid matrix-based model of molecular evolution with uniform rates and pattern.

Multiple sequence alignment

From the phylogenetic tree, sequences neighbor to lipase sequence of Arabidopsis thaliana [24] and Oryza sativa [38] were selected for alignment using Clustal Omega [35] and percent identity was also checked.

Molecular modeling

It was found that protein sequences ensemble gene Id: TraesCS5B02G157100 and TraesCS3A02G463500 have the highest percent identity, and so these sequences were selected for molecular modeling in SWISS-MODEL [39], which is a homology-modeling server. By using the list of 50 templates, 3D models of lipase enzyme were constructed. Once the 3D models of lipase were built, the geometrical aspects of modeled protein structures were evaluated using Qualitative Model Energy Analysis (QMEAN) and models with the highest QMEAN value was selected for further work and the models are referred to as TraesCS5B02G157100 (Ensembl ID UPI0003D5866F) and TraesCS3A02G463500 (Ensembl ID Q8L6B0). Also, the Ramachandran plot for the models was generated for structure validation.

Molecular dynamic simulation

The modeled structures were optimized using GROMACS 5.5 [1], which uses the Steepest Decent algorithm for minimization of the structure. Molecular dynamic (MD) simulation was carried out to understand the conformational behavior, structural details, and stability of protein complexes. Molecular dynamic (MD) simulation consists of an intensive force field calculation for each of the atom in a system, which is followed by an integration step, which advances the dynamical nature and positions of the atoms according to the classical laws of motion. MD simulation was used to unravel the stability analysis of protein complexes [33].

MD simulation of a modeled structure of lipase enzyme from wheat was performed using OPLS-AA force field [18], and all the protocols for the dynamic study were followed. In the TIP3P water model, the simulation was done and solvated in a cubic solvent box with a minimum distance of 1.0 nm. Till force tolerance of 1000 KJ mol⁻¹ nm⁻¹, the minimization of the structure was done. After energy minimization, the protein was equilibrated.

Equilibration is done in two phases. The first phase is NVT ensemble (constant number of particles, Volume, and Temperature), also referred to as “isothermal-isochoric” or “canonical” and it stabilizes the temperature of the system. The second phase is NPT ensemble, wherein the number of particles, pressure, and temperature are all constant, and this ensemble is also called the “isothermal-isobaric” ensemble—also it closely resembles the experimental conditions. Equilibration was done for 300 K temperature for 10 ns. Finally, molecular dynamics simulation was carried out for 50 ns at temperature 300 K. Different molecular dynamics parameters like the root mean square deviation (RMSD), root mean square fluctuation (RMSF), and radius of gyration (R_g) were performed using the GROMACS tool. Origin 6.0 [10] was used for generating the plots.

Principal component analysis

To study the conformational change of proteins induced by inhibitor bindings, the principal component (PC) analysis has been widely used. The principal component analysis is also called the essential dynamics method or quasiharmonic analysis. It is one of the most popular methods as it systematically reduces the dimensionality of a complex system [40]. The principal component analysis (PCA) is mainly used to examine the relationship between different conformers or structures on the basis of their equivalent residues. The resulting principal component, orthogonal eigenvectors describe the axes of the maximal variance of the distribution of structures, and projection of distribution onto the subspace defined by the largest principal components results in a lower dimensional representation of the structural dataset. Also, the percentage of the total mean square displacement or variance of atom positional fluctuations captured in each dimension is represented by their corresponding eigenvalue [15]. In brief, the eigenvector represents the direction of motion of the protein and eigenvalues suggest amplitude of motion [7, 40].

PCA is performed on any high-dimensional dataset, so for the analysis of a protein trajectory, a C-matrix is constructed associated with a selected set of atomic positions. Most of the time, at the residue level, a coarse-grained description of protein motion is made by using the alpha carbon atom, which represents a point for the position of a residue. To get the eigenvectors and eigenvalues, it is mandatory to create the covariance matrix of the C-alpha atom’s fluctuation. The first and last eigenvectors were generated using PyMol tool and presented as a porcupine plot.

The equation which was used for the PCA plot generation is as follows:

$$ {P}_{xy}=\left\langle \left({m}_x(t)-{\left\langle {m}_x\right\rangle}_t\right)\left({m}_y(t)-{\left\langle {m}_y\right\rangle}_t\right)\right\rangle t $$

where, m_x and m_y represent the Cartesian coordinate of the xth atom and yth atom. “t” is the averaged time position of the complete trajectory [2, 21].

Molecular docking

Molecular docking was performed using AutoDock v.4.2 [25]. The optimized structure of the lipase enzyme was retrieved after simulation in pdb format which was utilized for molecular docking. Tributyrin, a triglyceride, is an ester composed of butyric acid, and glycerol was used as a ligand for docking. Tributyrin (CID: 6050) was retrieved from the PubChem Database and was converted into the mol2 format using the Open Babel tool [27]. In AutoDock tool, the protein file as well as the ligand file was converted into PDBQT structure format for the docking process. All the necessary information required for AutoDock was stored in the PDBQT file. To the protein Kollman, charges were added and for protein as well as ligand Gasteiger, partial charges were kept constant during the process. During the whole docking process, phi, psi, and chi angles were treated as rotatable bonds. The docking grid box of dimension 60Å×60Å×60Å was made, covering the entire binding region for TraesCS5B02G157100 and similarly for TraesCS3A02G463500, 60Å×60Å×60Å grid box was made. The genetic algorithm was used for the conformational search strategy which was applied to the ligand as well as protein. The lamarckian genetic algorithm was used to study the free energy changes upon binding.

Presentation and analysis software

UCSF Chimera v.1.13.1 [32] and PyMOL software [9] were used to visualize and analyze ligand–protein interactions. PyMOL was employed for better illustration of ligand–protein interactions for further analysis.

Results and discussion

Sequence retrieval and analysis

Data available at Ensembl Plants database was mined by using “lipase” as a keyword to search for all the lipase genes present in the annotated Triticum aestivum genome, and a list of 133 genes was obtained. Among the 133 genes initially retrieved, 62 genes code for lipoxygenase; therefore, these genes are not selected for further study. Lipoxygenases (LOXs; EC 1.13.11.12) are non-heme iron-containing dioxygenases widely distributed in plants and animals [34]. The remaining 71 genes were selected for further study.

Motif and domain search

The amino acid motif, GXSXG, is commonly found in lipases. A lipase motif search analysis was performed for the proteins encoded by the remaining 71 lipase genes. The lipase motif was not encoded in 37 lipase genes, and these therefore not included for further study. The remaining 34 genes encoded proteins were found to consist of lipase motif in their deduced amino acid sequences. In CDD (Conserved Domain Database) search, it was found that out of 34 genes, 21 genes had a domain for lipase. CDD is a database having annotation for proteins, and it consists of multiple sequence alignment models for domains and full-length proteins.

Subcellular localization prediction

Sub-cellular localization prediction was carried out using TargetP 1.1 for 21 genes (Table 1).

Table 1 Sub-cellular localization prediction of all the lipase genes

Full size table

On the basis of subcellular localization prediction, only one gene, i.e., UPI0008448C8C, is not reported in the secretory pathway. The reliability class ranges from 1 to 5, where 1 denotes the strongest prediction and vice versa. Reliability class is a measure of difference (“diff”) between the highest and the second highest output scores. The lower the value of RC, the safer is the prediction which can be observed from Table 1.

Phylogenetic analysis

From the phylogenetic tree presented in Fig. 2, it can be predicted that wheat has a strong evolutionary relationship with Arabidopsis thaliana and Oryza sativa. Ensembl ID UPI0008425792 (uniport ID A0A1D5YD83), UPI000843C42A (uniport ID A0A1D5YD84), and UPI0003D5866F (uniport number W5FG08) were neighbors to the reported lipase sequence of Arabidopsis thaliana, and uniport number Q8L6B0 is neighbor to the reported lipase sequence of Oryza sativa [17].

Multiple sequence alignment

MSA was done in Clustal Omega (refer to S1 and S2), and percent identity was also checked (Tables 2 and 3). Sequence UPI0003D5866F has 58.35% similarity with the reported lipase sequence of Arabidopsis thaliana (uniport number sp|Q71DJ5). Sequence Q8L6B0 has 51.74% with Oryza sativa gene (uniport number Q2R077). Ensembl ID UPI0003D5866F is coded by Gene ID TraesCS5B02G157100, and Ensembl ID Q8L6B0 is coded by TraesCS3A02G463500.

Table 2 Percent identity matrix (created by Clustal 2.1)

Full size table

Table 3 Percent identity matrix (created by Clustal 2.1)

Full size table

Molecular modeling

For the ensemble gene Id TraesCS5B02G157100 and TraesCS3A02G463500, the molecular modeling was performed. The crystal structure of dog gastric lipase in complex with a phosphonate inhibitor (PDB id: 1k8q) having 30.14% sequence identity with TraesCS5B02G157100 was selected as a template to generate the 3D model (Fig. 3). QMEAN score was −4.37. Also, the modeled structure was validated by predicting the Ramachandran plot which indicated 90.44% is in the favored region (Fig. 3). MolProbity score is a combined protein quality score that gives the idea of crystallographic resolution at which such quality would be expected. In an ideal case, it should be as low as possible. MolProbity score for TraesCS5B02G157100 is 2.33 (Table 4) and for TraesCS3A02G463500 is 2.31 (Table 4) [24, 29].

Table 4 Table showing MolProbity results obtained after Ramachandran analysis

Full size table

Similarly, the crystal structure of Rhizomucor miehei triacylglyceride lipase (PDB id: 3TGL) of 30.95% sequence identity with wheat gene was selected as the template for the TraesCS3A02G463500 sequence extracted from Ensembl database to generate the 3D model (Fig. 4). QMEAN score was −2.18. Also, the modeled structure was validated by predicting Ramachandran plot which indicates 89.89% in favored region (Fig. 4) followed by a detailed analysis listed in Table 5. Thus, the predicted modeled structure can be considered of good quality, and it was used for molecular docking studies with triglycerides as a ligand.

Table 5 Table showing MolProbity results obtained after Ramachandran analysis

Full size table

Molecular dynamic simulation

The stability and analysis of the native protein structure were performed by surrounding them into a cubical box at a temperature of 300 K that was maintained computationally. Various computational analyses were carried out to evaluate the stability of the system. The stability of lipase enzyme from wheat’s gene was evaluated with several time-dependent structural parameters (like RMSD, RMSF, R_g) which was obtained from the 50-ns molecular dynamics simulation [22].

RMSD

The root mean square deviation (RMSD) is used to measure the difference between the backbones of a protein structure, from its initial structural conformation to its final position. From the deviations produced during the course of its simulation, the stability of the protein structure, relative to its conformation can be determined. Smaller deviations indicate a more stable protein structure [33]. Figure 5a, b shows the plot of RMSD vs. time (ns) for TraesCS5B02G157100 and TraesCS3A02G463500, respectively. Fluctuation can be observed with an average value of 0.25 nm, which is due to the flexibility of several loops and α-helix. The RMSD value fluctuates between 0.15 nm and 0.32 nm with an average of 0.23 nm.

RMSF

The root mean square fluctuation (RMSF) plots provide us information about the flexible regions of the protein complexes. In proteins, helical and sheet structures show lower RMS fluctuation as compared to the loop, turns, and coils. The lower RMSF value indicates the well-structured regions, whereas the higher RMSF value indicates loosely organized loop or terminal ends. The RMSF plot for lipase enzyme model coded by TraesCS5B02G157100 (Fig. 6a) and TraesCS3A02G463500 (Fig. 6b) is presented. All of the RMSF values were small and below 1nm, indicating stability of the protein [26]. The RMSF value for the Cα backbone was also calculated for 50 ns simulation in order to evaluate the stability of the structure mainly Cα atoms. The RMSF value for the Cα residue plot for TraesCS5B02G157100 and TraesCS3A02G463500 is presented in Fig. 7a ,b, respectively.

Radius of gyration (R _g)

The radius of gyration (R_g) was determined to understand the level of compaction in the structure of the enzyme. The R_g value is assigned as the mass-weighted RMSD fit of a collection of atoms from their common center of mass [33]. Figure 8 describes the compactness of the structure during a complete simulation. The R_g plot for the lipase enzyme model coded by TraesCS5B02G157100 and TraesCS3A02G463500 is shown in Fig. 8a, 8b, respectively.

Principal component analysis

In this current study, PC analysis was carried out to get the detailed insight into the concerted motions of lipase enzyme based on the equilibrium phase of MD simulations. In order to qualitatively understand the differences in a motional pattern, a porcupine plot was generated by performing the extreme projections of MD trajectories. To visualize the movement of the backbone, the “Mode vectors” present in the PyMol software were used and a porcupine plot was generated (Fig. 9) by aligning the trajectories over the original protein structure. The direction of the arrow (red color) is indicative of the direction of motions, and the length of the arrow reflects the strength of the movements.

Molecular docking

The optimized modeled structure obtained from MD simulation was further used for molecular docking with tributyrin (triglyceride, CID: 6050). Docking results predicted extensive interactions between ligand and catalytic site. Protein–ligand complex structures with the lowest docking binding energy were selected. The binding energy for tributyrin with a modeled structure of a lipase enzyme coded by wheat TraesCS5B02G157100 (catalytic site: 181–185, GHSQG) and TraesCS3A02G463500 (catalytic site: 173–177, GHSMG) were −9.83 kcal/mol and −6.67 kcal/mol, respectively (Table 6). The interaction of the ligand with the enzyme was observed in UCSF Chimera v.1.13.1 which is shown in Fig. 10.

Table 6 Binding affinity of protein with tributyrin

Full size table

The H-bond distance between the ligand and catalytic triad was explored using the Pymol visualization tool. The modeled structure of lipase enzyme coded by TraesCS5B02G157100 is attached with the ligand by hydrogen bond of distance 2.2 Å (Fig. 11), and lipase enzyme coded by TraesCS3A02G463500 has hydrogen bond distance of 2.6 Å with the ligand (Fig. 12).

Conclusion

Data available at Ensembl Plants database was mined to search for all the lipase genes from annotated Triticum aestivum genome and a list of lipase genes was obtained. Among the list of genes retrieved excluding genes code for lipoxygenase, a motif finding and CDD search were performed to search for the GXSXG motif. Sub-cellular localization prediction was carried out, and a total of 21 lipase genes were finally screened down which was found to be present in a secretory pathway. Further evolutionary relationship predicted ensemble genes ID TraesCS5B02G157100 and TraesCS3A02G463500 may have a strong evolutionary relationship with Arabidopsis thaliana and Oryza sativa. Thus, these sequences were modeled and docked with tributyrin, and binding efficiency of −9.83 kcal/mol and −6.67 kcal/mol, respectively, was observed. Several analysis methods were employed for trajectory analysis, including RMSD, RMSF, R_g calculation, interaction energy calculation, and PC analysis. Both the protein sequences are expected as a signal peptide, i.e., they are involved in the secretory pathway. So they can be easily purified and can be further used for research work. This work provides us a basic understanding of the gene encoding lipase in the wheat genome.

Availability of data and materials

The data and materials are original and available.

Abbreviations

EC:: Enzyme commission
QMEAN:: Qualitative Model Energy Analysis
RMSD:: Root mean square deviation
RMSF:: Root mean square fluctuation
MD:: Molecular dynamic
CDD:: Conserved Domain Database

References

Abraham MJ, Murtola T, Schulz R et al (2015) GROMACS: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1:19–25
Article Google Scholar
Amadei A, Linssen AB, Berendsen HJ (1993) Essential dynamics of proteins. Proteins 17(4):412–425. https://doi.org/10.1002/prot.340170408
Article Google Scholar
Barros M, Fleuri LF, Macedo GA (2010) Seed lipases: sources, applications and properties-a review. Braz J Chem Eng 27(1):15–29. https://doi.org/10.1590/S0104-66322010000100002
Article Google Scholar
Bhardwaj K, Raju A, Rajasekharan R (2001) Identification, purification, and characterization of a thermally stable lipase from rice bran. A new member of the (phospho) lipase family. Plant Physiol 127(4):1728–1738. https://doi.org/10.1104/pp.010604
Article Google Scholar
Bolser D, Staines DM, Pritchard E et al (2016) Ensembl plants: integrating tools for visualizing, mining, and analyzing plant genomics data. Plant Bioinform:115–140. https://doi.org/10.1007/978-1-4939-3167-5_6
Casas-Godoy L, Duquesne S, Bordes F et al (2012) Lipases: an overview. Lipases Phospholipases:3–30. https://doi.org/10.1007/978-1-61779-600-5_1
Chen J, Wang J, Zhu W (2017) Zinc ion-induced conformational changes in new Delphi metallo-β-lactamase 1 probed by molecular dynamics simulations and umbrella sampling. Phys Chem Chem Phys 19(4):3067–3075. https://doi.org/10.1039/C6CP08105C
Article Google Scholar
De Castro E, Sigrist CJ, Gattiker A et al (2006) ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins. Nucleic Acids Res 34(suppl_2):362–365
Article Google Scholar
DeLano WL (2002) Pymol: an open-source molecular graphics tool. CCP4 Newsletter Protein Crystallograph 40(1):82–92
Google Scholar
Deschenes, L.A. and David A. Vanden Bout University of Texas, Austin, 2000. Origin 6.0: Scientific Data Analysis and Graphing Software Origin Lab Corporation (formerly Microcal Software, Inc.). Web site: www.originlab.com. Commercial price: 595.Academicprice: 446.
Emanuelsson O, Brunak S, Von Heijne G et al (2007) Locating proteins in the cell using TargetP, SignalP and related tools. Nat Protocols 2(4):953–971. https://doi.org/10.1038/nprot.2007.131
Article Google Scholar
Eze SO, Chilaka FC, Akunwata CU (2007) Properties of lipase (EC 3.1. 1.3) from different varieties of maize. Anim Res Int 4(2):650–652
Google Scholar
Gerits LR, Pareyt B, Decamps K et al (2014) Lipases and their functionality in the production of wheat-based food systems. Comprehens Rev Food Sci Food Saf 13(5):978–989. https://doi.org/10.1111/1541-4337.12085
Article Google Scholar
Gill BS, Appels R, Botha-Oberholster AM (2004) A workshop report on wheat genome sequencing: International Genome Research on Wheat Consortium. Genetics 168(2):1087–1096. https://doi.org/10.1534/genetics.104.034769
Article Google Scholar
Grant BJ, Rodrigues AP, ElSawy KM, McCammon JA, Caves LS (2006) Bio3d: an R package for the comparative analysis of protein structures. Bioinformatics 22(21):2695–2696. https://doi.org/10.1093/bioinformatics/btl461
Article Google Scholar
Hasan F, Shah AA, Hameed A (2006) Industrial applications of microbial lipases. Enzyme Microbial Technol 39(2):235–251. https://doi.org/10.1016/j.enzmictec.2005.10.016
Article Google Scholar
Jiang Y, Chen R, Dong J et al (2012) Analysis of GDSL lipase (GLIP) family genes in rice (Oryza sativa). Plant Omics 5(4):351
Google Scholar
Jorgensen WL, Maxwell DS, Tirado-Rives J (1996) Development and testing of the OPLS all-atom force field on conformational energetics and properties of organic liquids. J Am Chem Soc 118(45):11225–11236. https://doi.org/10.1021/ja9621760
Article Google Scholar
Kapranchikov VS, Zherebtsov NA, Popova TN (2004) Purification and characterization of lipase from wheat (Triticum aestivum L.) germ. Appl Biochem Microbiol 40(1):84–88. https://doi.org/10.1023/B:ABIM.0000010360.46824.56
Article Google Scholar
Kotera M, Goto S (2016) Metabolic pathway reconstruction strategies for central metabolism and natural product biosynthesis. Biophys Physicobiol 13(0):195–205. https://doi.org/10.2142/biophysico.13.0_195
Article Google Scholar
Kumar S, Stecher G, Li M et al (2018) MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol Biol Evol 35(6):1547–1549. https://doi.org/10.1093/molbev/msy096
Article Google Scholar
Kumari P, Poddar R (2019) A comparative multivariate analysis of nitrilase enzymes: an ensemble based computational approach. Comput Biol Chem 83:107095. https://doi.org/10.1016/j.compbiolchem.2019.107095
Article Google Scholar
Marchler-Bauer, A., Lu, S., Anderson, J.B., et al. 2010. CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res, 39(suppl_1); 225-229
Messaoudi A, Belguith H, Hamida JB (2011) Three-dimensional structure of Arabidopsis thaliana lipase predicted by homology modeling method. Evolut Bioinform 7:7122
Article Google Scholar
Morris GM, Huey R, Lindstrom W et al (2009) AutoDock4 and AutoDockTools4: automated docking with selective receptor flexibility. J Comput Chem 30(16):2785–2791. https://doi.org/10.1002/jcc.21256
Article Google Scholar
Nezafat N, Karimi Z, Eslami M et al (2016) Designing an efficient multi-epitope peptide vaccine against Vibrio cholerae via combined immunoinformatics and protein interaction based approaches. Comput Biol Chem 62:82–95. https://doi.org/10.1016/j.compbiolchem.2016.04.006
Article Google Scholar
O’Boyle NM, Banck M, James CA et al (2011) Open Babel: an open chemical toolbox. J Cheminform 3(1):33. https://doi.org/10.1186/1758-2946-3-33
Article Google Scholar
O'Connor J, Perry HJ, Harwood JL (1992) A comparison of lipase activity in various cereal grains. J Cereal Sci 16(2):153–163. https://doi.org/10.1016/S0733-5210(09)80147-1
Article Google Scholar
Padaria, J.C., Bhatt, D., Biswas, K., Singh, G. and Raipuria, R., 2013. In-silico prediction of an uncharacterized protein generated from heat responsive SSH library in wheat ('Triticum aestivum'L.). Plant Omics, 6(2).
Pérez MM, Gonçalves ECS, Vici AC et al (2019) Fungal lipases: versatile tools for white biotechnology. Recent Adv White Biotechnol Through Fungi:361–404. https://doi.org/10.1007/978-3-030-10480-1_11
Pett LB (1935) Studies on the distribution of enzymes in dormant and germinating wheat seeds: dipeptidase and protease. II. Lipase. Biochem J 29(8):1898
Article Google Scholar
Pettersen EF, Goddard TD, Huang CC et al (2004) UCSF Chimera—a visualization system for exploratory research and analysis. J Comput Chem 25(13):1605–1612. https://doi.org/10.1002/jcc.20084
Article Google Scholar
Shukla R, Shukla H, Sonkar A et al (2018) Structure-based screening and molecular dynamics simulations offer novel natural compounds as potential inhibitors of Mycobacterium tuberculosis isocitrate lyase. J Biomol Struct Dynamics 36(8):2045–2057. https://doi.org/10.1080/07391102.2017.1341337
Article Google Scholar
Siedow JN (1991) Plant lipoxygenase: structure and function. Ann Rev Plant Biol 42(1):145–188. https://doi.org/10.1146/annurev.pp.42.060191.001045
Article Google Scholar
Sievers F, Wilm A, Dineen D et al (2011) Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol 7(1):539. https://doi.org/10.1038/msb.2011.75
Article Google Scholar
Tavener RJA, Laidman DL (1972) The induction of lipase activity in the germinating wheat grain. Phytochemistry 11(3):989–997
Article Google Scholar
Thomson CA, Delaquis PJ, Mazza G (1999) Detection and measurement of microbial lipase activity: a review. Crit Rev Food Sci Nutr 39(2):165–187. https://doi.org/10.1080/10408399908500492
Article Google Scholar
Tiwari GJ, Chiang MY, De Silva JR et al (2016) Lipase genes expressed in rice bran: LOC_Os11g43510 encodes a novel rice lipase. J Cereal Sci 71:43–52. https://doi.org/10.1016/j.jcs.2016.07.008
Article Google Scholar
Waterhouse A, Bertoni M, Bienert S et al (2018) SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res 46(W1):296–303
Article Google Scholar
Yan F, Liu X, Zhang S, Su J, Zhang Q, Chen J (2018) Molecular dynamics exploration of selectivity of dual inhibitors 5M7, 65X, and 65Z toward fatty acid binding proteins 4 and 5. Int J Mol Sci 19(9):2496
Article Google Scholar

Download references

Acknowledgements

The authors are thankful to the DBT, Sub-Distributed Information Center (BTISnet SubDIC; BT/BI/04/065/04), Govt. of India, and Department of Bio-Engineering, Birla Institute of Technology, Mesra, for the support and for providing essential facilities.

Funding

DBT, Sub-Distributed Information Center (BTISnet SubDIC; BT/BI/04/065/04), Govt. of India

Author information

Authors and Affiliations

Department of Bio-Engineering, Birla Institute of Technology, Mesra, Ranchi, 835215, India
Shradha Rani, Priya Kumari, Raju Poddar & Soham Chattopadhyay

Authors

Shradha Rani
View author publications
You can also search for this author in PubMed Google Scholar
Priya Kumari
View author publications
You can also search for this author in PubMed Google Scholar
Raju Poddar
View author publications
You can also search for this author in PubMed Google Scholar
Soham Chattopadhyay
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

SR and PK performed the experiments, analysis, and manuscript writing. SC involved in the conceptualization and manuscript correction. RP involved in the overall monitoring and analysis of the data. The authors have read and approved the manuscript.

Corresponding author

Correspondence to Soham Chattopadhyay.

Ethics declarations

Ethics approval and consent to participate

Not applicable

Consent for publication

Not applicable.

Competing interests

The authors declare that no competing financial interests exist.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rani, S., Kumari, P., Poddar, R. et al. Study of lipase producing gene in wheat – an in silico approach. J Genet Eng Biotechnol 19, 73 (2021). https://doi.org/10.1186/s43141-021-00150-1

Download citation

Received: 28 December 2020
Accepted: 18 March 2021
Published: 17 May 2021
DOI: https://doi.org/10.1186/s43141-021-00150-1

Study of lipase producing gene in wheat – an in silico approach

Abstract

Background

Results

Conclusions

Background

Methods

Sequence retrieval and analysis

Motif and domain search

Subcellular localization prediction

Phylogenetic analysis

Multiple sequence alignment

Molecular modeling

Molecular dynamic simulation

Principal component analysis

Molecular docking

Presentation and analysis software

Results and discussion

Sequence retrieval and analysis

Motif and domain search

Subcellular localization prediction

Phylogenetic analysis

Multiple sequence alignment

Molecular modeling

Molecular dynamic simulation

RMSD

RMSF

Radius of gyration (R g)

Principal component analysis

Molecular docking

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Radius of gyration (R _g)