Insights into the genes involved in the ethylene biosynthesis pathway in Arabidopsis thaliana and Oryza sativa

Background Ethylene is a gaseous plant hormone that acts as a requisite role in many aspects of the plant life cycle, and it is also a regulator of plant responses to abiotic and biotic stresses. In this study, we attempt to provide comprehensive information through analyses of existing data using bioinformatics tools to compare the identified ethylene biosynthesis genes between Arabidopsis (as dicotyledonous) and rice (as monocotyledonous). Results The results exposed that the Arabidopsis proteins of the ethylene biosynthesis pathway had more potential glycosylation sites than rice, and 1-aminocyclopropane-1-carboxylate oxidase proteins were less phosphorylated than 1-aminocyclopropane-1-carboxylate synthase and S-adenosylmethionine proteins. According to the gene expression patterns, S-adenosylmethionine genes were more involved in the rice-ripening stage while in Arabidopsis, ACS2, and 1-aminocyclopropane-1-carboxylate oxidase genes were contributed to seed maturity. Furthermore, the result of miRNA targeting the transcript sequences showed that ath-miR843 and osa-miR1858 play a key role to regulate the post-transcription modification of S-adenosylmethionine genes in Arabidopsis and rice, respectively. The discovered cis- motifs in the promoter site of all the ethylene biosynthesis genes of A. thaliana genes were engaged to light-induced response in the cotyledon and root genes, sulfur-responsive element, dehydration, cell cycle phase-independent activation, and salicylic acid. The ACS4 protein prediction demonstrated strong protein-protein interaction in Arabidopsis, as well as, SAM2, Os04T0578000, Os01T0192900, and Os03T0727600 predicted strong protein-protein interactions in rice. Conclusion In the current study, the complex between miRNAs with transcript sequences of ethylene biosynthesis genes in A. thaliana and O. sativa were identified, which could be helpful to understand the gene expression regulation after the transcription process. The binding sites of common transcription factors such as MYB, WRKY, and ABRE that control target genes in abiotic and biotic stresses were generally distributed in promoter sites of ethylene biosynthesis genes of A. thaliana. This was the first time to wide explore the ethylene biosynthesis pathway using bioinformatics tools that markedly showed the capability of the in silico study to integrate existing data and knowledge and furnish novel insights into the understanding of underlying ethylene biosynthesis pathway genes that will be helpful for more dissection.


Background
The gas ethylene has been known as a signaling molecule, which regulates stress responses and various developmental processes in plants [1,2], such as a boost of fruit ripening, petal and leaf abscission, flower senescence, incitement of root initiation, and prevention of seedling elongation [3]. Besides, ethylene is produced in response to environmental stresses [3], consisting of wounding [4], flooding [5], bacteria, viruses, fungi, nematodes, and insects [6]. Ethylene has been known as a regulated hormone under stress conditions [7], and several studied ecotypes on stressresponsive genes revealed various basal expression levels [8]. Increment the production of ethylene works as a signaling mechanism with intense physiological outcomes [1,9,10]. Ethylene is synthesized from methionine through its transformation to S-adenosylmethionine that it is converted via the enzyme 1-aminocyclopropane-1-carboxylate synthase into methylthioadenosine and 1-aminocyclopropane-1-carboxylic acid (ACC) as the precursor of ethylene [11]. 1-aminocyclopropane-1-carboxylic acid (ACC) is oxidized to HCN, CO 2 , and C 2 H 4 by ACC oxidase (ACO) [12]. Besides, ACC could be turned from transformation to ethylene by forming the conjugate N-malonyl-ACC [13]. Bleecker et al [11] the 1-aminocyclopropane-1-carboxylate synthase (ACS) activity is regulated at the transcriptional and post-transcriptional levels [1,9]. Owing to the influence of ethylene on senescence and ripening, vast vegetables, fruit, and flowers are lost. Therefore, as a reversible manner, the endeavor has been done to delay or prevent fruit ripening. The activity of ACC synthase has been illustrated with antisense RNA experiments in the role of the rate-limiting phase in ethylene synthesis [14].
The natural diversity of ethylene production suggests that plants by fine-tuning biosynthetic of ethylene and signaling pathways can adapt to different environments. Observation of some stress-responsive genes revealed that this adaptation could be associated to modify the expression of ACS genes via epigenetic modifications [8,15]. Moreover, it has been demonstrated that ethylene influence the transcription and translation of many genes which are related to ripening [16], in tomato, at least eight ACS genes have been recognized [17]. The Arabidopsis genome consists of 12 putative ACS-like genes, further, from the ACS genes, ACS3 was identified as pseudogene by a short sequence, besides, ACS12 and ACS10 encode an aminotransferase sans the catalytic activity of ACS [18]. The nine remaining ACS genes encode an ACS proteins group which could be categorized into 3 types, according to the absence or presence of putative phosphorylation sites at the proteins Cterminal extension [19,20]. Type-1 ACS proteins consist of an almost lengthy C-terminal domain that shares the target sites and extremely conserved sequences for a calciumdependent protein kinase (CDPK) and mitogen-activated protein kinase (MAPK) [21][22][23][24], while, type-2 have only the anticipated CDPK phosphorylation site. Nonetheless, type-2 ACS proteins consist of an exclusive regulatory motif named a target of ethylene overproducer (ETO1) (TOE) that overlaps by the CDPK target site. Besides, TOE motif mediates interaction by ETO1 E3 ligase, and its two paralogs, ETO1-Like (EOL1 and EOL2), also it is needed for type-2 ACS degradation [24][25][26][27][28]; type-3 ACS contains only a short expansion of amino acids in the C-terminal domain, and no target sites for a MAPK and CDPK [19,24]. Another plant that was selected for this study was rice as monocotyledonous, rice is the main staple cereal that feeds almost half of the world's population. Owing to the enhancing worldwide demand of the growing population, approximately, 50% enhance in production of rice will be needed [10]. Rice has the shortest genome of the main cereals and wealthy genetic diversity. Moreover, the sequence of rice whole-genome furnishes the basis to identify the homologous genes for other crops [29,30]. Its sustainability and productivity are crucially threatened via several biotic and abiotic stresses such as submergence, drought, salinity, and chilling, but ethylene plays an initial role in adopting plants under stress conditions [20,31]. In deep water rice, it has been demonstrated that OsACO1 is involved in the internode elongation, also, submergence enhances the ACO enzyme activity and levels of OsACO1 mRNA [32,33]. The expression of OsACO3 and OsACO2 genes in etiolated rice seedlings was also revealed to be diversely controlled via auxin and ethylene [34].
Ethylene is the main hormone, which controls many physiological pathways. Ethylene has been suggested to be more potent versus necrotrophic pathogens (like B. cinerea) than against biotrophic pathogens. Ethylene insensitive mutants etr1, ein3, and ein2, display increased susceptibility to B. cinerea [35,36]. Also, plants that overexpress transcription factors associated with the jasmonic acid and ethylene pathways expose an enhanced resistance to different necrotrophs [37][38][39]. In Arabidopsis, overexpression of AP 2 C 1 that encodes a Thr or Ser protein type 2C phosphatase decreased production of ethylene and compromises resistance to the necrotrophic pathogen B. cinerea [40]. On the other hand, the ACSs could be adjusted via putative endogenous signal receptors like phytohormones and intracellular accumulation of secondary metabolites, such as calcium [3]. Moreover, usage of ACC or ethylene could enhance plant salinity tolerance, mainly by increasing the expression of reactive oxygen species scavengers [41][42][43]. The expression of ACO genes from various species is also associated with the ethylene biosynthesis rate, as well as ACS, and the transcript levels of multiple ACO genes are regulated under stress conditions [44,45]. There are some ethylene response factors (ERFs) gene family, across the environmental stress-responsive genes, the mRNA levels of various ERF are controlled via several molecules produced and hormones in various stress conditions [46]. Ethylene plays a biphasic role, inhibiting and stimulating growth dependent upon the species, developmental stages of organs or tissue, and environmental conditions [47,48]. Ethylene prevents hypocotyl elongation by the switch on the transcription factors ethylene response factor 1 (ERF1) [49][50][51] and waved-dampened 5 (WDL5) in Arabidopsis [52] in the low light severity or dark. Transcription factor hypocotyl 5 (HY5) also gets involved in this action that is degraded via the E3 ligase constitutive photomorphogenic 1 (COP1) [53]. Adjustment of the ACS transcript levels seems to be a critical mechanism to control the alteration of plant ethylene production. Nonetheless, recent studies put forward that posttranslational modifications, like ubiquitination and phosphorylation, provide as a momentous mechanism to adjust the stability of the ACS proteins that will be led to regulate the levels of ethylene in plants [24,54,55].
Considering the riches of the genome sequence information of rice and Arabidopsis which is supplying a valuable resource to study and dissect ethylene biosynthesis genes in monocotyledons (rice) and dicotyledons (Arabidopsis). The genes that are responsible for the biosynthesis of ethylene in Arabidopsis and rice were retrieved delicately. Regarding the importance of the post transcription and translation modifications, the study of the phosphorylation, glycosylation, and miRNA target ethylene biosynthesis genes will be useful. Besides, the cis-regulatory elements in promoter regions of ethylene biosynthesis genes will give a better understanding of the regulation of these genes expression. Moreover, the perception of cis-acting regulatory elements can help to change gene expression patterns through plant genetic engineering approaches to avoid biotic and abiotic stress damages. The present study was the first study to provide comprehensive information and a wide analysis of ethylene biosynthesis genes by available bioinformatics tools for dissection of promoter regions, mRNA, and protein sequences of ethylene biosynthesis genes of two important model plants including Arabidopsis and rice.

Retrieve the ethylene genes and sequence analysis
The involved genes identification for the pathway of ethylene biosynthesis in Arabidopsis and rice were performed using the Plantcyc (https://www.plantcyc.org/). The sequences of transcript and polypeptide of all involving genes in ethylene biosynthesis of Arabidopsis thaliana from the Arabidopsis Information Resource (TAIR) (https://www. arabidopsis.org/) and Oryza sativa from Rice Genome Annotation Project Database (http://rice.plantbiology.msu. edu/index.shtml) were retrieved, respectively [56].

Evolutionary analysis
The full length of the amino acid sequence of all predicted SAM, ACS, and ACO proteins of rice and Arabidopsis were used to align using ClustalX. The phylogenetic tree was constructed using the neighbor-joining method of clustal omega (https://www.ebi.ac.uk/Tools/msa/clustalo/).

3D protein structure prediction and domain analysis
Three-dimensional (3D) protein structure and ligandbinding site of SAM, ACS, and ACO genes were predicted using the homology modeling of SWISS-MODEL [58]. Also, protein sequences of studied genes were analyzed using the MOTIF Search program https://www.genome.jp/ tools/motif/ for finding the conserved motifs and domains.

Gene expression analysis and identification of miRNA targets
Microarray expression of intended genes in Arabidopsis thaliana and Oryza sativa under biotic and abiotic stresses and hormones treatment were obtained from the Genevistigator database [59]. The Affymetrix rice genome array (2836 samples) and Affymetrix Arabidopsis ATH1 genome array (10615 samples) were selected to study the expression patterns of ethylene biosynthesis genes in rice and Arabidopsis, respectively. The psRNATarget server (http:// plantgrn.noble.org/psRNATarget/) applied to find existing miRNAs of Arabidopsis thaliana and Oryza sativa at 3.5 expectation level by searching all the transcript sequences of desired genes in Arabidopsis and rice [60].

Prediction of putative Cis-elements
To identify the probable cis-regulatory elements, the promoter sequences (1500 bp upstream of transcription start site) of ethylene biosynthesis pathway genes in Arabidopsis and rice were perused by plantpan2 database (http://plantpan2.itps.ncku.edu.tw/index.html) [61].

Results
Biochemical characteristics SAM, ACS, and ACO genes in Arabidopsis thaliana and Oryza sativa The genes which are involved in the pathway of biosynthesis of ethylene in Arabidopsis and rice were detected by the Plantcyc database. According to the ethylene biosynthesis pathway, 26 and 28 engaging enzymes were predicted in A. thaliana and O. sativa, respectively (Fig. 1). Besides, from 26 identified genes 4, 9, and 13 were identified as methionine adenosyltransferase (SAM), aminocyclopropane-1carboxylate synthase (ACS), and aminocyclopropane-1carboxylate oxidase (ACO), respectively in A. thaliana, as well as, number of 6, 6, and 16 predicted genes were involved in SAM, ACS, and ACO, respectively, in O. sativa. The number of ACO engaged genes was more than the genes which were involved in SAM and ACS ( Fig. 1, Table 1).
The total number of amino acids in studied genes ranged from 251-490 aa that AT3G46500 was the smallest protein involved in ACO and AT4G37770 was the largest predicted protein at ACS in Arabidopsis (Table 1). Also, the length of amino acids varied from 157 to 544 aa in rice that LOC_ Os05g05670 was the smallest protein, and LOC_Os10g37899 was the largest protein both engaged with ACO. Furthermore, the high length of proteins was contributed to the ACS in both Arabidopsis and rice (  11.18 (LOC_Os05g35000). Most of the predicted proteins in A. thaliana were stable except the proteins involved in ACS; however, in rice, the larger part of the proteins that contribute to the ACS and ACO was unstable ( Table 1). The range of the aliphatic index was from 74.55 (AT1G62380) to 89.75 (AT1G35190) in A. thaliana, further, the aliphatic index was varied in O. sativa between 70.45 (LOC_Os05g35000) and 96.53 (LOC_ Os02g57990). The lowest and highest aliphatic indices presented in ACO and SAM rice predicted proteins, respectively ( Table 1). The predicted localization of the proteins was diverse and included the chloroplast, Golgi apparatus, cytoplasm, and nucleus ( Table 1). The majority of SAM proteins were localized to the chloroplast in both A. thaliana and O. sativa except LOC_Os02g57990 (Golgi apparatus), LOC_Os07g29440, and LOC_Os01g10940 (cytoplasm) in O. sativa ( Table 1). The ACS predicted proteins were localized in the chloroplast and cytoplasm. Besides, most of the ACO proteins were associated with the cytoplasm in both A. thaliana and O. sativa; however, the LOC_ Os10g37899 and LOC_Os08g33020 were located in the chloroplast as well as cytoplasm and nucleus, respectively ( Table 1). The results revealed that genes involved in  ethylene biosynthesis from rice are more varied than these genes from Arabidopsis.

Phylogenetic relationship
To investigate the evolutionary relationships among involved genes of the ethylene biosynthesis pathway, we constructed the phylogenetic tree by the rooted neighborjoining method using the amino acid sequences from Arabidopsis, and rice (Fig. 2). According to the phylogenic tree, the LOC_Os2g57990 that was predicted to have Golgi apparatus localization had more genetic distance than other rice-SAM genes. Also, two rice-SAM proteins (LOC_Os07g29440 and LOC_Os01g10940) which were predicted to cytoplasm localization had high similarity based on amino acid sequences (Table 1, Fig. 2). According to the evolutionary relationships among SAM proteins, it seems that rice SAM genes had more variation than Arabidopsis SAM genes (Fig. 2). ACS proteins were clustered into three groups that 8 of 15 ACSs were located in the first group. Interestingly, a predicted ACS protein of rice (LOC_Os06g03990) had a high distance with others ( Fig. 2). In the first group, AT3G61510, AT1G01480, AT4G11280 (ACS6), and LOC_Os04g8850 had more distance than other ACS proteins from rice and Arabidopsis and it was worth noting that these proteins were predicted to locate in the chloroplast (Table 1, Fig. 2). Also, the evolutionary relationships of ACO proteins revealed that they could be clustered into three groups based on the similarity of amino acid sequences. In this way, the first group contained 15 ACOs while 13 ACOs were clustered into second. The LOC_Os01g61440 had more genetic distance than other the studied ACOs. All Arabidopsis ACO proteins were predicted to have cytoplasm localization, but rice ACO proteins were different in terms of protein localization. Also, phylogenetic analysis between ACO proteins showed that rice ACO proteins had high variation than Arabidopsis ACO proteins (Table 1, Fig. 2).

Protein structure and domain analysis
In this study, the 3D protein structure of all SAM, ACS, and ACO genes and their ligand-binding site was predicted based on the homology model using the SWISS-MODEL database for predicting the protein-protein interactions (Figs. 3, 4, 5). The ligand-sites for S-adenosylmethionin and protein-ligand interaction profiler (PLIP) were observed in all Arabidopsis-SAM proteins and three rice-SAM proteins (LOC_ Os01g18860, LOC_Os01g22010, and LOC_Os0504510) (Fig.  3). The ligand site of MES (2-(N-Morpholino)-ethanesulfonic acid) was observed in all predicted-ACS proteins except AT1G01480 (Fig. 4). Also, the ligand-binding site of PLP (Pyridoxal-5-Phosphate) was found in the structure of ACO proteins. However, the binding sites of AAD ((2-Aminooxy-Ethyl)-[5-(6-Amino-Purin-9-YL)-3, 4-Dihydroxy-Tetrahydro -Furan-2-Ylmethyl]-Methyl-Sulfonium) and 2-Amino-4-(2-Amino-Ethoxy)-Butyric acid were observed only in Arabidopsis-ACS proteins (Fig. 5). For most ACO proteins, the ligand-binding site was not predicted; however, the ionbinding sites (Fe, zinc, and nickel ion) were observed in some ACO proteins (Fig. 5). According to the 3D structure and ligand type, AT2G19590 was most similar to LOC_ Os09g27750 and LOC_Os09g27820, and also, AT3G46500 was similar to LOC_Os10g37899 (Fig. 5). The motif analysis for SAM, ACS, and ACO proteins was carried out using the MOTIF Search program (https:// www.genome.jp/tools/motif/), separately (Fig. 6). According to the result of motif analysis, location and order of 3 motifs in SAM proteins were similar except for LOC_ Os02g57990, LOC_Os07g29440, and LOC_Os01g10940. Moreover, motifs with different lengths and locations observed in LOC_Os02g57990, that the LOC_Os2g57990 was predicted to have Golgi apparatus localization had more genetic distance than other rice-SAM genes ( Table 1, Fig. 2, 6). Aminotron_1_2 motif was detected in all of the studied ACS proteins in Arabidopsis and rice, which are almost located in the same position. Besides, Beta_elim_lase was identified in LOC_O05g25490, LOC_Os01g09700, At04g08040, At04g26200, and At01g01480 (Fig. 6). Two  Fig. 6).
Gene expression: Anatomy, development stages, biotic, and abiotic stresses and hormones treatment The expression patterns of SAM, ACS, and ACO genes were evaluated in different tissues, organs, growth and development stages, as well as, under biotic and abiotic stresses and hormones treatment in Arabidopsis and rice via microarray data analysis available online using the Genevestigator database (Figs. 7, 8, 9). SAM genes including SAM1, SAM2, METK3, and METK4 high expressed in the primary cell, seedling, inflorescence, shoot and root in Arabidopsis, whereas, LOC_Os01g22010 and LOC_Os05g04510 showed a high level of expression in studied tissue and organs of rice, also, LOC_Os01g18860 and LOC_Os02g57990 expressed in medium level (Fig. 7). In Arabidopsis, ACS1, ACS2, ACS4, ACS5, ACS8, ACS7, ACS6, ACS11, and ACS9 showed exclusive expressions in different tissues and organs. Most of the ACS genes showed medium expressions in studied tissue and organs except ACS5 that expressed in low level in all studied tissues and organs, and ACS6 showed a high level of expression in the primary cell, shoot, and root (Fig. 7). In rice, LOC_Os06g03990 displayed a high level of expression in studied tissues and organs, whereas, LOC_Os05g10780, LOC_Os03g51740, LOC_Os04g48850, and LOC_Os01g09700 showed a medium level of expression in all studied tissues and organs except LOC_ Os01g09700 had a low level of expression in the inflorescence (Fig. 7). Most of the ACO genes expressed in medium level, DIN11 and At03g46500 showed the lowest level of expression in inflorescence, as well as, At3g46500 had a low expression level in shoot among the studied ACO genes in Arabidopsis. Besides, At01g77330, ACO1, At03g50210, and At01g35190 expressed at a high level at the shoot, also At01g77330 had a high level of expression in the primary cell and seedling. Moreover, At03g50210 demonstrated a high level of expression in the primary cell and inflorescence (Fig. 7). Nine studied ACO showed various expression levels in rice, according to the obtained results LOC_Os10g37899, and LOC_Os02g53180 revealed the highest level of expression, whereas, LOC_O04g55070 and LOC_O05g35000 showed low expression levels in studied tissue and organs of rice (Fig. 7). It could be concluded that almost all SAM, ACS, and ACO expressed in studied tissue and organs, but at different levels. The results of SAM, ACS, and ACO genes expression were investigated in different growth and development stages (Fig. 8). The Arabidopsis-SAM genes are mostly expressed in the germination stage, while two rice-SAM genes (LOC_ Os01g18860 and LOC_Os01g22010) are highly expressed in the ripening stage. Furthermore, Arabidopsis-ACS2 more induced than other ACS genes that showed high expression in the seeds (Fig. 8). Among ACO genes, At01g35190, and At03g50210 were more up-regulated than others and they had high expression in the seeds. Regarding the gene expression patterns, SAM genes were more involved in the rice-ripening stage, while in Arabidopsis, ACS and ACO genes were contributed in maturity (Fig. 8).
The expression of SAM, ACS, and ACO genes was studied under biotic and abiotic stresses and hormones treatment in Arabidopsis and rice through the existence of microarray data (Fig. 9). The At02g36880 (SAM gene) showed high differential expression under stress conditions in Arabidopsis. The At02g36880 gene is up-regulated in 24-CBL+ glucose (dark) in hormone treatment at the seedling stage, but it down-regulated under heat (seedling), salt (root), and temperature, as well as, G. cichoracearum (biotic stress). In rice, the LOC_Os01g22010 gene up-regulated under drought (leaf) and dehydration, on the contrary, it down-regulated in the drought (tillering) and heat stresses, interestingly LOC_Os02g57990 displayed vice-versa pattern in studied conditions (Fig. 9). Considering to filtration of ACS genes in Arabidopsis, At04g37770 demonstrated various expressions under different stress conditions, where this gene is up-regulated under IAA (seedling), NAA, NAA + FLG22, RALF and shift NPA to NAA, while it downregulated in some abiotic stresses. Besides, the gene expression profile of some ACS genes illustrated that LOC_ Os01g09700 gene is especially up-regulated at the different time courses of trans-zeatin treatment and drought stress condition, but it showed down-regulation under heat stress (Fig. 9). Regarding the expression pattern of ACO genes in Arabidopsis, the AT03g49620 and At01g62380 genes showed up-regulation and down-regulation in most of the studied conditions, respectively. While, LOC_Os01g61440 up-regulated under abiotic stresses consisting of the cold, dehydration, drought, and heat condition in rice (Fig. 9).
It is worth noting that some of the studied genes with similar expression patterns under particular stress had a   9). It seems that similar expression patterns of these genes in exclusive stresses were associated with the alike cis-elements underlying the promoter region of these genes. It reveals that the transcript of these genes adjusted with the identified transcription factors in the same conditions. Thus, gene expression study under various conditions showed environmental signals and stresses influence on the regulation of ethylene biosynthesis pathway, the achieved results could help to figure out how the underlying pathway gene networks were organized and adjusted in various tissues, organs, developmental stages, and stress conditions.

Prediction the miRNA targets
In the present study, the sites of microRNAs (miRNA) were predicted using published miRNA sequences of psRNATarget server for Arabidopsis and rice ( Table 2). The result of miRNA targeting the transcript sequences of SAM, ACS, and ACO genes revealed that SAM1 (AT1G02500) from Arabidopsis was targeted by ath-miR843 while osa-miR1858 targeted two rice-SAMs (LOC-Os01g22010 and LOC-Os05g04510) transcripts. Two rice-ACOs and one Arabidopsis-ACO contained the link-sites of ath-miR3933, osa-miR5809, and osa-miR531, respectively. All microRNAs inhibition involved the transcript cleavage. In our study, the complex between published miRNAs with transcript sequences of ethylene biosynthesis genes in A. thaliana and O. sativa were identified that would be helpful to understand the regulation the gene expression after the transcription process.

Cis-regulatory elements in promoter site
Gene expression is broadly adjusted in the transcription phase, where the interactions amongst cis-regulatory elements and transcription factors in the promoter region of the genes which perform a crucial role. In other words, the cis-regulatory elements (CREs) as non-coding DNA are mainly located in upstream of genes, which are determined via transcription factors that control the gene expression in various conditions. Analyses of the promoter region of the induced genes led to the discovering of cis-acting elements, also the ethylene-responsive element-binding protein (EREBP) family that interacts with ethylene response factors (ERFs) and DNA [3]. Transcription factors related to the ERF family have  been demonstrated to be engaged in several developmental processes [64][65][66], abiotic [67,68], and biotic [69] stress responses. The upstream of studied genes (promoter site) was screened to identify the key cis-elements that regulate the gene expression under different conditions ( Table 3). The AAACAAA sequence named anaero1 consensus was observed in the most promoter sites of ethylene biosynthesis genes of A. thaliana and O. sativa. The anaero1 consensus is one of the motifs found in the promoters of anaerobic genes involved in the fermentative pathway [76]. The binding sites of common transcription factors such as MYB, WRKY, and ABRE that control target genes in abiotic and biotic stresses were generally distributed in promoter sites of ethylene biosynthesis genes of A. thaliana. SORLIP1AT, SURECOREATSULTR11, ABRELATERD1, MYBCOREATCYCB1, and LS7ATPR1, discovered in promoter site of all the ethylene biosynthesis genes of A. thaliana genes that these cis-motifs are engaged to response light-induced cotyledon and root genes [70], sulfurresponsive element [71], dehydration [72], cell cycle phaseindependent activation and salicylic acid [73], respectively. Besides, the binding sites of some key cis-regulatory elements such as BIHD1OS, CGACGOSAMY3 and GAR-E2OSREP1 that involved in disease resistance [78], sugar starvation [79] and gibberellin-responsive element (GARE) [81], respectively, were commonly distributed in promoter sites of ethylene biosynthesis genes of O. sativa.

Potential phosphorylation and glycosylation sites
Phosphorylation and glycosylation are the prevalent posttranslational modification of proteins which could modify object site and activity of protein [82]. The potential phosphorylation sites of studied proteins were predicted based on the presence of serine, threonine, and tyrosine amino acids (Fig. 10). Phosphorylation is catalyzed by kinases that transmit a phosphoryl group commonly from ATP, but also from ADP to the hydroxyl group of particular Ser, Tyr, or Thr residues in their target proteins. Nevertheless, also His and both Asp and His in plant twocomponent signaling can be phosphorylated [83][84][85][86]. The result illustrated that the LOC_Os05g05670 (as an ACO protein) had the minimum phosphorylation sites while the highest phosphorylation number (53 sites) predicted in LOC_Os06g03990 (as an ACS protein). The predicted phosphorylation sites in SAM, ACS and ACO proteins of Arabidopsis ranged from 19 (AT3G49630 as an ACO protein) to 48 (AT1G01480 as an ACS protein). According to our findings, the ACO proteins were less phosphorylated than ACS and SAM proteins. This is likely that phosphorylation of ACS adjusts ethylene production was supported through the study that mutation of the C-terminal extension of ACS5 in Arabidopsis persuades the eto2-1 mutant to overproduce. The predicted-glycosylation sites within amino acid sequences of SAM, ACS, and ACO proteins were presented in table 4. All Arabidopsis-SAMs showed similar glycosylation patterns while the glycosylation patterns were very different in rice-SAMs and 50% of them were not predicted any glycosylation site. ACS proteins showed the highest glycosylation sites whereas AT2G22810 (ACS4) had four predicted-glycosylation sites (as hyperglycosylated protein). The rice-ACO proteins showed the minimum predicted-glycosylation sites that 75% had no glycosylation site. Also, 38% of Arabidopsis-ACO proteins had no potential glycosylation site.

Discussion
Ethylene is one of the simplest well-characterized plant hormones and ethylene biosynthesis including three simple steps, beginning from the amino acid methionine in both dicot and monocot plants [13]. Firstly, methionine is transformed to S-adenosyl methionine (SAM), which is afterward converted to 1-aminocyclopropane-1-carboxylic acid (ACC) via ACC synthases (ACS). Eventually, ACC is converted to ethylene through ACC oxidases (ACO) [87]. In this study, 26 and 28 engaging genes involving in the ethylene biosynthesis pathway, were predicted in A. thaliana and O. sativa, respectively. The selected SAM, ACS, and ACO genes in Arabidopsis and rice were variable in physicochemical properties including protein length, GRAVY value, aliphatic index, molecular weight, isoelectric points (pI), and instability index. The GRAVY values in involved-ethylene biosynthesis genes of Arabidopsis were more varied than rice. The GRAVY value associated with the solubility of proteins, and it could calculate the sum of hydropathy values [88,89]. According to the predicted GRAVY value, ACS enzymes of Arabidopsis are more hydrophilic than ACS enzymes of rice. Besides, the ACOs of rice showed high variation based on physicochemical properties. Besides, the lowest and highest aliphatic indices observed in ACO and SAM rice predicted proteins, respectively. The aliphatic index is an important factor for the thermostability of proteins [90].
Research showed that Arabidopsis 14-3-3 protein exploits as positively adjusting the ethylene biosynthesis via increasing the ACS protein stability by the interaction with ACS proteins [55]. The proteins by high aliphatic index may have a greater half-life and they could be engaged in high reaction temperature [91].
Regarding the appearance and advancement of the genomic era, progressively, more genome sequences are Table 4 The predicted N-glycosylation sites in amino sequences of methionine adenosyltransferase (SAM), aminocyclopropane-1carboxylate synthase (ACS), and aminocyclopropane-1-carboxylate oxidase (ACO) proteins in Arabidopsis and rice using NetNGlyc 1.0 server (http://www.cbs.dtu.dk/services/NetNGlyc/) [62]   released, which pave the way for evolutionary and comprehensive studies of any gene family from various species [82]. Our result based on studied predicted proteins indicated that involved-proteins in the ethylene biosynthesis pathway of rice had high variation than Arabidopsis. Lee and Yoon [20] indicated that the similarity of structure and the conserved regulatory motif discovered in both ACS proteins from these two plant species rice and Arabidopsis indicate the being of an evolutionally conserved mechanism, which underlies the ethylene biosynthesis regulation in rice and Arabidopsis. Also, the different ligand sites were observed in the predicted-3D structure of ACO and ACS proteins. Illuminating the biochemical and biological roles of proteins to determine their interacting partners, could be time-consuming and hardly implement by in vivo and/or in vitro approaches, besides most of the recently sequenced proteins will have unclear functions and structures as well. Although, computational approaches for predicting protein-ligand binding sites suggest an alternating practical solution. Therefore, it is momentous to discover these key sites to understand the protein function [92][93][94]. MES (2-(N-Morpholino)-ethanesulfonic acid) binding site was observed in all predicted-ACS proteins except AT1G01480, while the binding sites of AAD ((2-Aminooxy-Ethyl)-[5-(6-Amino-Purin-9-YL)-3,4-Dihydroxy-Tetrahydro-Furan-2-Ylmethyl]-Methyl-Sulfonium) and 2-Amino-4-(2-Amino-Ethoxy)-Butyric acid just observed in Arabidopsis-ACS proteins. The structure of the ligandbinding site can influence the protein function, protein evolution, and protein-protein interaction [95]. Ethylene plays a main role in the senescence and fruit ripening initiation, also boosts the transcription and translation of responsive genes engaged to fruit softening, cellwall metabolism, and membrane metabolism, via switching on the ethylene signaling transduction [89,96,97]. The results of gene expression demonstrated that SAM, ACS, and ACO genes were differentially induced in plant development stages and they had different expression patterns in monocots and dicotyledonous in response to stresses. ACS2 gene of Arabidopsis is more induced than other ACS genes showing high expression in seeds. In Arabidopsis, ACS transcripts have been illustrated in etiolated seedlings, roots, stems, leaves, siliques, and flowers [18,98,99]. Each of the multigene family is differentially expressed for the time of auxin treatment, wounding, and ripening [100]. For example, LE-ACS4, and LE-ACS2 genes are expressed at the ripening time in tomato [101], persuaded in mature green fruits after treatment by exogenous ethylene [101,102] and over induced upon pericarp tissues wounding [103]. Some ACS genes including At04g37770, At04g26200, and At04g11280 genes were up-regulated under salt stress. Lelièvre et al. indicated that expression of the ACC synthase gene is controlled through ethylene only during/ after chilling treatment, but the expression of the ACC oxidase gene could be regulated separately through either ethylene or chilling [104]. As already noted, in Arabidopsis, various abiotic stresses often enhance ethylene biosynthesis by enhancing the transcription of distinct subsets of ACS genes. Transcript levels of the ACS6 gene elevate in response to ozone [105]. ACS2, ACS9, ACS6, and ACS7 are induced during hypoxia [106], but the expression of all the ACS genes decreased under anaerobic conditions in Arabidopsis [98]. Nevertheless, the transcript levels of separate subsets of the ACS genes enhance in response to osmotic stress, drought, high temperatures conditions, and after wounding [98,99]. Gene expression is broadly adjusted in the transcription phase, where the interactions amongst cis-regulatory elements and transcription factors in the promoter region of the genes which perform a crucial role. The binding sites of important transcription factors including ABRE, MYB, and WRKY that regulate target genes under stresses were generally distributed in promoter sites of ethylene biosynthesis genes of A. thaliana. Considering the regulatory role of these elements could distinguish much of plant stress response by these elements existence [46,107,108]. Also, different cis motifs including sulfur-responsive element, dehydration, and hormone (salicylic acid, gibberellin, and abscisic acid) responsive elements were observed in upstream of SAM, ACO, and ACS genes. Cis-acting elements are particular binding sites for proteins that engaged in the initiation and regulation of transcription, which is suppressing or activating the gene transcription in response to altering growth conditions and different environmental stress [109]. Our results indicated that the most SAM and ACO genes were down-regulated in response to abiotic stresses that various factors such as type of cis-regulatory elements may affect the expression patterns. Collectively, the current study revealed that involved genes in the ethylene biosynthesis pathway play key roles, not only in regulating development stages such as the ripening stage but also in regulating the response to abiotic and biotic stresses tolerance.
The result of miRNA targeting the transcript sequences of SAM, ACS, and ACO genes showed that ath-miR843 and osa-miR1858 play a key role to regulate the posttranscription modification of SAM genes in Arabidopsis and rice, respectively. The ath-miR843 involves in response to low-oxygen (hypoxia) stress [110], and osa-miR1858 is one of the mirRNA that is related to rice grains development [111]. Also, the target site of ath-miR159a was found in the transcript sequence of AT2G22810 and AT4G37770 as ACS genes. MIR159a is a key microRNA that targets mRNAs coding of MYB proteins that bind to the regulative site of floral meristem identity gene LEAFY [112], also ath-miR159a involved in hypoxia stress [110]. The prediction result of the posttranslation modification showed that ACS proteins were more phosphorylated and glycosylated. Phosphorylation and glycosylation are the prevalent post-translational modification of proteins which could modify object site and activity of protein [82]. Phosphorylation, as one of the most plentiful post-translational modifications, plays the main role in plant metabolism and signal transduction via modifying protein interactions, protein activities, or subcellular location [24,86,[113][114][115]. Regarding evidence, it seems that the biosynthesis of ethylene is adjusted by phosphorylation events that probably affect the ACS protein turnover. Working on the usage of phosphatase inhibitors and kinase in tomato tissues and suspension cell cultures demonstrated that phosphorylation influence the activity and/or turnover of ACS [116]. Thus, it seems that ACS phosphorylation preserves the protein from the destruction that in turn may lead ACS to accumulate and ACS activity to enhance, considering for the burst of ethylene production via ripening fruit [117], noteworthy, LeACS2 protein of tomato has been discovered to be phosphorylated in response to wounding [117]. The glycosylation could make alterations to the stability of the protein [118] and protein's molecular weight [119]. To sum up, it seems the ethylene biosynthesis proteins from Arabidopsis were more glycosylated than rice's proteins. Some studies highlight the possibility of posttranslational regulation of ACS [115,118].

Conclusion
Nowadays, computational analysis plays a substantial role in plant science. Appropriate computational approaches coupled with suitable databases are fundamental for detecting, organizing, integrating data information content furnishing novel insights into the involved genes in important pathways and biological systems as well. Ethylene is a gaseous hormone that controls various physiological pathways. In this study, the involved genes in ethylene biosynthesis were evaluated using available bioinformatics tools in Arabidopsis and rice. Results revealed that involved-enzymes in ethylene biosynthesis had more variation based on physic-chemical characters and patterns of gene expression, protein structure, post-translation modification, and type of cis-regulatory elements. The genes in the ethylene biosynthesis pathway of rice had high variation than Arabidopsis indicated that probably SAM, ACS, and ACO genes of dicots such as Arabidopsis are derived from monocot such as rice. All SAM, ACS, and ACO genes are expressed in studied tissue and organs, but at different levels. SAM genes are more involved in the rice-ripening stage, while in Arabidopsis, ACS and ACO genes are contributed in maturity. Also, the SAM, ACS, and ACO genes expression of rice in different tissue and organs demonstrated more variation in comparison with the Arabidopsis genes. Regarding the post-translation modification result, the ACO proteins were less phosphorylated than ACS, and SAM proteins, and it seems the ethylene biosynthesis proteins from Arabidopsis were more glycosylated than rice's proteins that can affect the protein activity, or subcellular location. Overall, the current study described that involved genes in the ethylene biosynthesis pathway play the key roles in controlling the response to abiotic and biotic stresses tolerance that various factors such as PPIs, type of cisregulatory elements, and post-transcription/translation modifications could affect their expression. Our study was the first in silico and review study which widely assessed SAM, ACS, and ACO genes that are involved in ethylene biosynthesis and it provided an expanded landscape of computational analysis for further dissection and functional characterization of SAM, ACS, and ACO genes.