- Open Access
Genetic diversity analysis in a mini core collection of Damask rose (Rosa damascena Mill.) germplasm from Iran using URP and SCoT markers
Journal of Genetic Engineering and Biotechnology volume 19, Article number: 144 (2021)
Rosa damascena Mill is a well-known species of the rose family. It is famous for its essential oil content. The aim of the present study was to assess the genetic diversity and population structure of a mini core collection of the Iranian Damask rose germplasm. This involved the use of universal rice primers (URP) and start codon targeted (SCoT) molecular markers.
Fourteen URP and twelve SCoT primers amplified 268 and 216 loci, with an average of 19.21 and 18.18 polymorphic fragments per primer, respectively. The polymorphic information content for URR and SCoT primers ranged from 0.38 to 0.48 and 0.11 to 0.45, with the resolving power ranging from 8.75 to 13.05 and 9.9 to 14.59, respectively. Clustering was based on neighbor-joining (NJ). The mini core collection contained 40 accessions and was divided into three distinct clusters, centered on both markers and on the combination of data.
Cluster analysis and principal coordinate analysis were consistent with genetic relationships derived by STRUCTURE analysis. The findings showed that patterns of grouping did not correlate with geographical origin. Both molecular markers demonstrated that the accessions were not genetically diverse as expected, thereby highlighting the possibility that gene flow occurred between populations.
As a large genus in the Rosaceae family, Rosa has 200 species and covers more than 18,000 cultivars . The Caucasus, Syria, Morocco, and Andalusia are all home to Rosa damascena, while Iran is usually referred to as a source of diversity in this respect . Accordingly, a great variation of Damask rose landraces is expected to be discovered in this country. In addition to horticultural uses, roses are of economic importance because of the essential oils in their petals . Rosa damascena has particular genotypes and cultivars which are noteworthy for their medicinal properties and oil [4,5,6]. Since genetic variation is available within the genus of Rosa, its breeding is usefully dependent on the systematic characterization of genetic resources and the study of likely mechanisms for hybridization. Morphological markers describe an organism’s phenotypic characteristics and are the first to outline an organism’s measurable characteristics. Each species in the Rosa genus has a wide, overlapping range of morphological variations that are affected by environmental factors. Thus, it would be insufficient to classify species and genotypes on morphological ground only . According to Kiani et al. , the most of Iranian Damask roses are tetraploid; however, some other ploidy levels were observed.
For the classification and recognition of rose genotypes, chemotaxonomic studies are often addressed in a large variety of different phenolic structures and isozyme markers [8,9,10,11]. Nonetheless, there is a limited number of regularly resolvable loci, but this can reduce the efficiency of these markers [9, 12]. The molecular approach is more acceptable because it provides easy access to the genetic material (genome) which makes it much easier to recognize plant relationships . Molecular markers can identify genetic polymorphism at the DNA level and can be used in analyzing genetic variation, genetic distance estimate, parentage determination, marker-assisted selection, and gene localization. Many DNA-based molecular markers are available for the purpose of distinguishing biodiversity among plant populations. However, the selection of DNA markers depends on the type of study. Therefore, it is important to compare the various molecular markers and decide which molecular marker is appropriate for the species under study. New innovations have given rise to new molecular markers that can be used in describing genetic characteristics of plants in the Rosa genus. Several molecular assays have been used in recent years to test the genetic variation of various rose plants [14,15,16,17,18,19,20,21,22]. In theory, these molecular approaches, operations, classes, polymorphic count, function, and time requirements are varied.
In the plant genome, the SCoT marker mechanism relies on the start codon (ATG) which has a short conservation around it . These markers can reproduce well within annealing temperatures  and have great potential as a relatively popular tool. SCoT marker system is a simple, low cost, polymorphic, reproducible, and reliable marker system. SCoT markers are known to be useful in a various studies, such as, cultivar recognition, genetic diversity evaluation, DNA fingerprinting, marker assistant selection and quantitative trait loci mapping [25, 26]. This approach has an important context within genetic studies while its benefits are numerous [27,28,29,30].
Kang et al.  used a polymerase chain reaction (PCR) method using universal rice primers (URP) that provide a powerful tool for investigating the DNA diversity of most eukaryotic and prokaryotic genomes, with potential use in taxonomic and phylogenic research, as well as in population genotypic screening of individuals, both at the inter- and intraspecies level. As a matter of long primers and elevated annealing temperatures, URP-PCR has an advantage over randomly amplified polymorphic DNA (RAPD) and arbitrarily primed polymerase chain reaction (AP-PCR) methods. DNA marker performance is evaluated through factors like the marker index (MI) and the polymorphism information content (PIC). Comparing the ability of marker techniques can assist researchers in selecting the required markers in the amplification of genome fragments, thereby being more effective in using these markers for potential breeding studies .
This study aimed to investigate genetic variation in different Rosa damascena accessions from Iran and to demonstrate the effectiveness of different marker systems.
Plant materials and DNA extraction
In total, 40 Damask rose genotypes were collected from five regions in Iran (Fig. 1, Table 1). Sucker roses were harvested from Iran’s rose oil–producing regions. These areas were divided according to geographical and climatological conditions, and each region consisted of some provinces (Fars, Isfahan, East Azerbaijan, Kerman, Semnan, Gilan, Kermanshah, Lorestan, Hormozgan, Tehran, and Markazi provinces). The geographical details are mentioned in Table 1. Accessions have been collected from the gene bank collection of Barij Essence company in Kashan. Young leaves from each accession were collected for DNA extraction since late March to early June. The CTAB procedure  was used, with slight modifications (changing the amount and content of the extraction buffer, the incubation time, and adding polyethylene glycol), to extract total genomic DNA. Electrophoresis was performed on a 1% agarose gel to evaluate the quality of DNA. High-quality genomic DNA samples were considered to be without broken DNA for amplification.
PCR amplification of different markers
The sequence and annealing temperature of all primers for the analysis are given in Table 2. The genomic DNA of all 40 genotypes was amplified with a set of 12 SCoT primers and 14 URP primer sequences . The amplification was done in a Bio-Rad (T100) thermal cycler. Twenty microliters of PCR reaction mixtures consisted of 6.5 μl ddH2O, 10 μl master mix 2XPCR (ready-to-use PCR master mix 2X; Ampliqon), 2 μl isolated DNA per sample (50 ng/μl), and 1.5 μl per primer (10 pmole/ml). Each PCR cycle ran on initial denaturating at 94 °C for 5 min, 35 denaturation cycles at 94 °C for 45 s, with a primer annealing time of 45 s (Table 2). This procedure was applied for each primer. Primer elongation lasted for 90 s at 72 °C. A final extension cycle ran for 10 min (72 °C). In order to detect polymorphism among accessions, the PCR product was transferred to 1.2% agarose gel wells, and then electrophoresis was performed at 90 volts. The gel was then immersed in ethidium bromide solution for 15 min (10 mg/ml). Using the gel documentation method, the illustration of banding patterns was obtained under UV light (Bio-Rad). SCoT and URP primers were used on the gel for the amplified fragments.
The amplified fragments were scored as absent (0) or present (1) in each sample. Screening the primers involved using several discriminatory criteria, including the number of polymorphic bands (NPB), total amplified bands (TAB), percentage of polymorphism bands (PPB), resolving power (Rp), polymorphism information content (PIC), and marker index (MI). PIC was calculated based on the formula given by Anderson et al. .
Molecular variation analysis (AMOVA) operated via GenAlEx ver. 6.5 to classify genetic diversity . For each sample, GenAlEx ver. 6.5 was used for determining the percentage of polymorphic loci (PPL), effective number of alleles (Ne), and total number of alleles (Na) , Nei’s  gene diversity (H), and Shannon’s information index (I) . Then, Jaccard’s method was used for finding genetic dissimilarities by DARwin ver. 6 software . The neighbor-joining (NJ) method contributed to the construction of the Fan-dendrogram using MEGA ver. 10.1 software . The genetic makeup of populations was analyzed by the Bayesian-based model. This was performed by STRUCTURE (ver. 2.3.4) . It estimated the clusters of population genetics (K) and the ratio of individual assignment out of each population. For each ‘K’ varying from 1 to 10, the analysis was repeated ten times, and the initial burn-in period was set to 100,000 followed by 100,000 Markov Chain Monte Carlo (MCMC) iterations. Finally, the DK was calculated by STRUCTURE HARVESTER, an online program .
URP and SCoT polymorphism
In this analysis, the genetic polymorphism of Rosa damascena was tested using 14 URP and 12 SCoT primers. Table 2 gives a description of the informativeness criteria being calculated for URP and SCoT primers. There were 268 amplified fragments among the 40 accessions. These were amplified by all URP primers and turned out to be entirely polymorphic. The polymorphic band count was from 16 (URP-4) to 20 (URP-1, URP-2, URP-3, URP-8, URP-9, URP-10, URP-15, and URP-17), while averaging at 19.21. There was a variation in PIC values from 0.38 (URP-4) to 0.48 (URP-1) with an average of 0.42. The average value of resolving power (Rp) was 13.05 and URP-1 which displayed the highest value (16.35), although its lowest (8.75) belonged to URP-4. The highest value of MI was measured for URP-1 (9.6), while the lowest value was associated with URP-4 (6.08). In SCoT, 12 primers generated 216 loci, all being polymorphic fragments. The total bands per primer varied between 16 (SCoT-5, SCoT-26) and 20 (SCoT-4, SCoT-8, SCoT-12). The PIC values were between 0.11 (SCoT-21) and 0.45 (SCoT-4, SCoT-12), while having an average of 0.37. The Rp value ranged from 9.9 (SCoT-2) to 14.59 (SCoT-4) for the twelve primers, thereby distinguishing between various genotypes. The lowest and highest values of MI occurred in SCoT-21 (1.9), SCoT-12 and SCoT-4 (9), respectively.
Genetic diversity analysis
Variations among and within the Rosa damascena populations were detected by molecular variance analysis (AMOVA) (Table 3). The results of AMOVA showed a higher molecular variation (%) within populations (URP = 96%, SCoT = 90%, combined data = 93%), compared to the variation among populations (URP = 4%, SCoT = 10%, combined data = 7%). The genetic differentiation coefficient (GST)/ gene flow (Nm) equaled 0.117/3.773, 0.185/2.197, and 0.14/2.86, respectively, for URP, SCoT, and the combined data. Genetic diversity per population varied considerably (Table 4). According to URP data, the highest number of alleles (Na) occurred in region I (1.96) and region II (1.94). In region IV and region II, the highest number of effective alleles (Ne = 1.60 and 1.58) were detected, as well as the highest value of Nei’s gene diversity (He = 0.34) and Shannon’s information (I = 0.51). Thus, the highest polymorphic loci (%) (PPL = 97.39%) occurred among the population of region I. According to the SCoT results, the populations of region I and region II had the largest Na (1.97 and 1.92), Ne (1.62 and 1.61), H (0.36 and 0.35), and I (0.53 and 0.52) values, respectively. Such populations also showed high PPL values (98.15 and 94.44%). In addition, region II appeared in the combined data analysis (SCoT + URP) and displayed the highest values for I (0.52), H (0.35), and Ne (1.60). The highest values of these parameters (PPL = 97.73%, Na = 1.96) occurred in the population of region I.
Genetic distances and groping accessions
The Jaccard distance coefficient pairs of accessions were estimated by binary data from URP and SCoT primers. In URP, the genetic distance of a pairwise pattern varied from 0.180 to 0.872 with an average of 0.659 among different pairs of landraces of Rosa damascena Mill. The highest distance coefficient (0.872) was observed between accessions Minab (16) (region II) and Barzok1 (9) (region I), whereas the lowest distance (0.180) was identified between two accessions of region V, Khoramabad (39) and Borujerd (40). According to SCoT data, however, the estimated Jaccard’s distance coefficient ranged from 0.107 to 0.785, with an average of 0.602. The largest distance (0.785) was recorded between accessions Barzok2 (11) (region I) and Tabriz (34) (region IV), while the smallest distance (0.107) was observed among two accessions from region III, Semnan1 (26) and Semnan2 (27). The pairwise genetic distance coefficient was measured based on combined data and revealed a wide spectrum of 0.153–0.789, while averaging at 0.631 among all 40 accessions. The largest distance (0.785) was recorded between accessions Barzok2 (11) (region I) and Tabriz (34) (region IV), while the smallest distance (0.107) was observed among two accessions from region III, Semnan1 (26) and Semnan2 (27). The pairwise genetic distance coefficient was identified among Mashhad ardehal (2) and Barzok1 (9) from Esfahan province (region I), whereas the least distance belonged to the Lorestan province (region V) (Khoramabad and Borujerd) (data not shown).
To examine genetic relationships between genotypes, cluster analysis was used through the Neighbor-joining (NJ) method for the 40 Rosa damascena accessions. A dendrogram was created using SCoT, URP, and combined data (SCoT + URP). The dendrogram classified all accessions into three major clusters (Fig. 2A–C). According to the URP data, the first cluster (AI) comprised 11 accessions. The subpopulation AI was divided into clades springing from Gilan (4 accessions), Kermanshah (3 accessions), Esfahan, and Kerman (2 accessions). The second cluster (AII) consisted of 15 accessions from Esfahan (6 accessions), Semnan (2 accessions), Lorestan (2 accessions), Kerman (2 accessions), Gilan, Kermanshah, and Fars (1 accession). Fourteen accessions were classified in the third cluster (AIII) and comprised 4 accessions from Esfahan, 2 accessions from Fars, along with all accessions of Hormozgan, East Azerbaijan, Tehran, and Markazi (Fig. 2A). In SCoT, the cluster BI mainly consisted of 20 accessions from Esfahan, kerman (3 accessions) and all accessions of Gilan, Kermanshah, Lorestan, Semnan, Gilan, Kermanshah, Esfahan, Kerman, Lorestan, and Semnan. The second cluster consisted of Esfahan (6 accessions), Kerman, and Tehran (1 accession) regions (BII). The third cluster (BIII) consisted of 3 accessions from Esfahan, 1 accession of Tehran, along with all accessions of Markazi, Hormozgan, Fars, and East Azerbaijan (Fig. 2B). According to the clustering pattern which was gathered by combined data, the 40 accessions were categorized into three groups (Fig. 2C). The first cluster consisted of 5, 4, 3, 2, 2, 2, and 1 accessions from Gilan, Kermanshah, Esfahan, Kerman, Lorestan, Semnan, and Fars regions, respectively (CI). The second cluster could be separated into sub-originating regions, mainly from Markazi (2 accessions), Tehran, and Fars (1 accession) (CII). The third cluster (CIII) consisted of 9 accessions from Esfahan; 2 accessions from East Azerbaijan, Hormozgan, Kerman; and 1 accession from Tehran and Fars.
Mantel correlation test showed a low and statistically nonsignificant correlation (r = 0.49) between distances revealed by SCoT and URP data for all 40 accessions across five collected regions.
Principal coordinate analyses (PCoA)
The principal coordinate analysis ultimately assisted in analyzing and depicting the population structure. According to URP (A), SCOT (B), and the combined data (C), the first three principal coordinates explained 36.69, 37.34, and 33.85% of molecular variations, respectively. The PCoA biplots showed that all accessions displayed a scattered distribution in the plot, although this did not follow their origins (Fig. 3A–C). Indeed, the results of cluster analysis supported these observations (Fig. 2).
Population structure analysis
Bayesian clustering was used for determining the population structure of the 40 accessions. The membership proportions varied from K = 1 to K = 10, and with URP primers, probabilities were most precisely derived at K = 3. Out of the 40 accessions, subgroup 1 included 12 accessions from Kermanshah (4), Esfahan (2), Gilan (4), and Kerman (2), as well as subgroup 2 which comprised all accessions from Markazi, Tehran, Hormozgan, East Azerbaijan, as well as some accessions from Esfahan (5), Fars (2), and Kerman (1) populations. Twelve accessions of Semnan (2), Lorestan (2), Esfahan (5), Kerman, Gilan, and Fars (1) populations were classified into subgroup 3 (Fig. 4A). As shown in Fig. 4B, delta K had the largest ad hoc value at K = 3, confirming that the 40 accessions are better divided into three subgroups using SCoT data. Subgroup 1 (17) comprised accessions from Gilan (5), Kermanshah (4), Esfahan (3), Semnan (2), Kerman (2), and Fars (1) populations. Subgroup 2 (15) comprised accessions from the Esfahan (7), Tehran (2), Tabriz (2), Lorestan (2), and Kerman (1) populations; subgroup 3 (8) had accessions from the Esfahan (2), Hormozgan (2), Markazi (2), Fars (1), and Kerman (1) populations. With 26 polymorphic URP and SCoT primers, four different subgroups were achieved (Fig. 4B). Subgroup 1 comprised all accessions from the East Azerbaijan, Tehran, Hormozgan, and Lorestan; 10 accessions from Esfahan; and 2 accessions of Kerman populations, whereas all accessions from the Markazi and one accession from the Fars were grouped into subgroup 2. All accessions from the Semnan were grouped into subgroup 3. Fourteen accessions from Gilan (5), Kermanshah (4), Esfahan (2), Kerman (2), and Fars (1) populations were allocated to subgroup 4.
It is very difficult to evaluate the genetic diversity of R. damascena if only morphological features were to be available as markers. Meanwhile, technological tools for the identification of biodiversity include rapid, reliable procedures to describe genetic relationships and variation among roses. DNA markers are the most common tools in current research trends on rose genetic diversity [43,44,45,46,47].
In this research, the genetic variation of the 40 Rosa damascena accessions was measured using two marker techniques: URP and SCoT. Our results indicated a significant genetic variation within the populations. We compared the effectiveness of URP and SCoT as new gene-based markers for identifying genetic variation among Rosa damascena. By both markers, the proportion of polymorphism turned out to be 100% (Table 2) which was greater than the polymorphic ratios of bands. Given this polymorphic percentage, these markers can serve as a powerful tool in identifying and discriminating between rose genotypes. Henuka et al.  used RAPD markers and reported 98.54% polymorphism. Korkmaz and Dogan  observed 90.1% and 88.8% polymorphisms among twenty-seven Rosa spp. in Turkey, after using ISSR and RAPD markers, respectively. Panwar et al.  also reported 94% genetic polymorphism with ISSR markers. Carvalho et al.  found 93.7% polymorphism among a selection of rose genotypes based on ISSR markers. Jamali et al.  reported 77% polymorphism. These high percentages of polymorphism reflect the heterozygous nature of the polyploid genome structure of rose species. Agarwal et al.  studied genetic diversity in 29 Indian rose germplasms using SCoT marker. Based on their results, a high level of polymorphism was observed among the genotypes, which was in line with our results. The SSR markers also not only revealed a high level of diversity in R. damascena germplasm in Iran, but also showed a high level of variation in Pakistani genotypes .
URP markers showed higher values of TAB, TPB, Rp, PIC, and MI than SCoT markers in terms of marker informativeness indices. Therefore, the markers showed higher values of these indices and suggested that the Iranian Rosa damascena germplasm has a good degree of genetic diversity. The polymorphic information content (PIC) of a parameter represents the amount of polymorphism of a marker, as this can vary from zero to half. The larger the value, the greater the number of alleles and the higher the frequency of polymorphisms for that position in the study population. In the present study, the relatively high PIC and MI values for the URP primers provided an estimation of the discriminating ability of the URP marker systems . They showed better resolution and differentiation. In general, in the present experiment, small differences were observed between markers in terms of indices. Statistics showed that both SCoT and URP methods have similar performance in the occurrence of genetic polymorphisms among the evaluated populations. Also, high levels of polymorphism showed that markers of both methods are useful in studying genetic variation. They are equally effective in distinguishing between Rosa damascena populations with close kinship ratios.
According to the AMOVA, 96% and 90% of genetic variations were revealed by the URP and SCoT markers, respectively, which were partitioned within populations, suggesting that the observed variation within genotypes was higher than among them (Table 3). Interpopulation differentiation (GST) and gene flow (Nm) variables backed up these findings. As a result, the GST values for URP, SCoT, and combined data were 0.117, 0.185, and 0.148, respectively, revealing that genetic variation among populations is relatively low. The indirect estimate of gene flow (Nm) via GST was 3.77 (URP), 2.19 (SCoT), and 2.86 (combined data). The total number of migrants per generation exceeds two. Here, genetic differences may be partly due to gene flow, as the populations of this species are significantly affected by genetic drift. Also, local populations are different if Nm < 1 . High values of Nm occurred in populations, and thus, gene flow prevented drastic genetic differences among gemmates. Population size and the spread of alleles among various regions can add details to this finding . Kiani et al.  studied genetic relationships among 41 R. damascena accessions from Iran using 31 RAPD. The authors reported that the genetic variation within the collected populations was more than the variation among them. Similar results were achieved in the present study; however, the variation within the populations with both markers was higher than the previous study.
Table 4 shows a list of genotypes of genetic diversity indices. Maximum values of indices in relation to genetic diversity (Ne, Na, I, PPL, and He) were reported for region I and region II populations using SCoT and combined data. In the URP marker system, region I had the highest polymorphism percentage (PPL) and the highest number of alleles (Na), whereas precision of genetic diversity was provided by Shannon’s information index and Genetic Diversity Index for populations of regions II and IV. As divergent populations, regions I and II could be selected according to SCoT and combined data, while regions II and IV could be selected according to the URP data. A larger genetic variability here may reflect the population’s frequent allelic variation, while weather conditions can affect ultimate variation among the populations . Furthermore, this finding suggests that these regions could be a strong source of diversity for potential breeding projects which can benefit from new alleles and candidate genes . Also, the highest genetic distance between accessions were based on all marker systems from regions I, II, and IV, as reported in the results of the genetic distance. Therefore, in inbreeding and hybridization systems, these accessions may be used as parents to achieve maximum heterosis if they have desirable traits. According to Pirseyedi et al. , an extreme degree of genetic diversity was observed among 12 Iranian Damask rose genotypes [45, 57, 58]. In contrast, Agaoglu et al.  and Baydar et al.  studied the genetic diversity of R. damascena in Turkey, via RAPD and AFLP techniques. Genetic uniformity existed among R. damascena cultivars.
In the current study, spatial distribution did not align with genetic relationships, based on the neighbor-joining cluster analysis (Fig. 2). For example, using URP analysis, populations from Iran’s north (Gilan province) and west (Kermanshah province) were grouped together in the same subgroup. Also, with SCoT marker analysis, the populations of Minab (sampled from the south (Hormozgan province)) and Tabriz (sampled from the northwest (East Azerbaijan province)) were classified in the same subgroup. Moreover, the combination of URP and SCoT showed a clustering trend that contradicts the spatial distribution of populations. For example, populations from Esfahan (sampled from the central areas of Iran) and Fars (sampled from the south) were clustered together. Because of the country’s diverse climate and the adaptation of damask rose to adverse environmental conditions, it seems that ecotypes of this plant have been moved and relocated by migrating people across the country, especially on foothills where crops usually do not grow. Pirseyedi et al.  noted genetic affinity between the damask rose of Kashan and Kazeroon districts, despite the long distance between them. Baydar et al.  used AFLP and microsatellite markers and found that R. damascena plants in Turkey can be derived from the same original genotype by vegetative propagation. Rusanov et al.  reported that rose plants of Iran and India may have a common origin. Based on the results, the patterns of grouping did not correlate with geographical origin. Similar results were observed in microsatellite analysis of Damask rose accessions from various regions of Iran . Usually, a larger sample is necessary to determine the relationship between molecular data with geographical distance, whether there is isolation of populations due to barriers in gene flow, or whether different climatic conditions lead to differentiation within the species .
In the current research, the PCO analysis confirmed the results of cluster analysis. Genetic proximities were visually depicted by PCO among populations. In URP and SCoT, genetic difference and geographical distance were not clear-cut. Besides, phenotypic traits have high correlations in some occasions, while the first two components justify more than 90% of the changes. Meanwhile, molecular markers could not justify the higher values of variance of the primary variables by several of the main components. In investigating the genetic diversity using molecular data, the markers should have a uniform and appropriate distribution in the genome so that they can be sampled from the entire genome. As shown in Fig. 3, the genotypes were well distributed throughout the environment which can be due to the great variety between genotypes and the suitability of markers and primers used. The mini core collection covered a large amount of the genome and had differentiated values among genotypes in the environment.
The neighbor-joining cluster analysis was confirmed by the Bayesian clustering algorithm through STRUCTURE analysis in comparing the 40 accessions  (Fig. 4). In the combined data system, however, accessions 27 and 28 (Semnan province) were placed in a separate group (Fig. 4C). Without considering predetermined groups, the Bayesian clustering approach used genetic knowledge to assess the population membership of individuals. Focused on multilocus genotypes, they assign members or parts of their genome to several clusters . Using the online structure harvester software and the Evanno method, the best K and the number of subpopulations (∆K) were identified. In both marker systems, the best level of population classification were K = 3 and in the combined data system K = 4. In this clustering, it was found that the different populations of R. damascena can group into one cluster, such as cluster ‘BI’ in the SCoT marker system, in which seven populations were grouped from five regions and different altitudes. Moreover, our results showed that clustering by both markers and combined markers made similar classifications of the populations of Kermanshah (region V) and Gilan (region IV), thereby assigning them to the same subgroup, while Hormozgan (region II) and East Azerbaijan (region IV) were classified together in a subgroup. If genotypes or cultivars gather into one category from different areas, it may mean that they have the same genetic heritage [63, 64]. This may have been due to human transmission of plants or genetic movement and displacement by natural variables . The genetic evidence provided here, as well as the available literature, means that plant dispersal by humans has played a large role in the development of R. damascena populations throughout Iran. It seems that due to its high tolerance to drought, this crop is one of the most suitable species in arid provinces of the country. Due to a decrease in agricultural water resources and rainfall, its cultivation can replace many agricultural products which have high water requirements. In addition, the ability of this plant to adapt well to different climates and soil conditions in Iran has made farmers inclined to introduce it to other regions in the country. While the genetic origin of these plants is the same in different regions, the obvious difference may be attributed to the climate in which they emerge. Inter and intraspecific variation can be affected by temperature and rainfall .
Roses usually cross-pollinate and are self-incompatible which makes them more genetically diverse between and within populations [67,68,69]. Jurgens et al.  investigated the genetic variability of R. canina in Brandenburg (Germany). Fifty-five genotypes were classified into twelve subgroups. They attributed the high genetic variation to the outcrossing, seed dispersal system and polyploidy within the R. canina populations. The level of genetic variation is affected by breeding system, life cycle, seed dispersal, and geographic distribution which are important factors among populations. Rose species are known to be outcrossing, but there is little evidence on their outcrossing frequencies . Contrary to the results of the current research, a study on Rosa canina L. via ISSR markers suggested that geographical distance is effective in causing allelic gaps among genotypes, and ecological conditions could cause genetic variation in R. canina .
The results of model-based clustering was based on the Bayesian statistical index, assuming that the Ancestry model is Admixture type and the allelic frequency model is of continuous type. While also assuming a range of K = 1 to 10 (the number of populations), many populations in the existing germplasm are not completely separated based on the regions from which these genotypes originated or were collected (Fig. 4). The mixing observed in this germplasm confirms the hypothesis that the studied genotypes are of mixed types. That is, plant i may have inherited parts of the genome from offspring in the K population. In fact, the formation of different subgroups in population structures depends on the frequency of allelic differences between the genotypes that make up the population. Most of the genotypes were not attributed completely to subgroups, thereby indicating that many genotypes have intermediate genetic traits of various subgroups, as a matter of genetic variation in this research.
Crop improvement is influenced by information about the degree and distribution of genetic variation, as well as relationships between breeding materials. The results of the present study revealed a high level of polymorphism in the Iranian R. damascene populations by the two marker systems. The mean values of PIC for URP and SCoT markers were 0.42 and 0.37, respectively, indicating the efficiency of the two markers in detecting polymorphism among the studied samples. Also, the results confirmed the efficiency of combined data in estimating the genetic diversity among the populations. The used marker systems showed a comprehensive pattern of the genetic diversity among the Iranian R. damascene populations, which could provide a future insight into Damask rose breeding programs.
Availability of data and materials
All data generated or analyzed during this study are included in this published article.
Universal rice primer
Start codon targeted
Polymorphism information content
Number of observed alleles
Number of effective alleles
Nei’s gene diversity
Number of polymorphic bands
Principal coordinate analysis
Shannon’s information index
Total amplified bands
Percentage of polymorphism
Analysis of molecular variance
Degree of freedom
Sum of squares
Mean of squares
- Est. Var:
Estimated variance components
Gudin S (2000) Rose genetics and breeding. Plant Breed Rev 17:159–189. https://doi.org/10.1002/9780470650134.ch3
Chevallier A (1996) The Encyclopaedia of Medicinal Plants. Dorling Kindersely, London https://ulud.pw/veka-taxir-cugo-xizy-mype.pdf
Kovats E (1987) Composition of essential oil Part 7 Bulgarian oil of rose (Rosa damascene Mill.) J. Chromatogr 406:185–222. https://doi.org/10.1016/s0021-9673(00)94030-5
Mahmoud N, Piacente S, Pizza C, Bueke A, Khan AI, Hay AJ (1996) The anti-HIV activity and mechanisms of action of pure compounds isolated from Rosa damascene. Biochem Biophys Res Commun 229(1):73–79. https://doi.org/10.1006/bbrc.1996.1759
Andogan BC, Baydar H, Kaya S, Demirci M, Ozbasar D, Mumcu E (2002) Antimicrobial activity and chemical composition of some essential oils. Arch Pharm Res 25:860–864. https://doi.org/10.1007/bf02977005
Ozkan G, Sagdic O, Baydar NG, Baydar H (2004) Antioxidant and anti-bacterial activities of R. damascena flower extracts. Food Sci Technol Int 10(4):277–281. https://doi.org/10.1177/1082013204045882
Lewis WH (1957) Revision of the genus Rosa in Eastern North America: a review. American Rose Annual 42:116–126
Kiani M, Zamani Z, Khalighi A, Fatahi R, Byrne DH (2010) Microsatellite analysis of Iranian Damask rose (Rosa damascena Mill.) germplasm. Plant Breed 129(5):551–557. https://doi.org/10.1111/j.1439-0523.2009.01708.x
Kuhns LJ, Fretz TA (1978) Distinguishing rose cultivars by polyacrylamide gel electrophoresis. Isozyme variation among cultivars. J Am Soc Hortic Sci 103:509–516. https://doi.org/10.1023/A:1003052031309
Lee JS, Kim YR (1982) Genetic studies on natural populations of Rosa multiflora Thunb. by isozyme and multivariate analyses (Korean). Hanguk Wonye Hakhoe Chi 23:141–162 https://agris.fao.org/agris-search/search.do?recordID=XB8235263
Walker CA, Werner DJ (1997) Isozyme and randomly amplified polymorphic DNA (RAPD) analyses of Cherokee rose and its putative hybrids ‘Silver Moon’ and ‘Anemone’. J Am Soc Hortic Sci 122:659–664. https://doi.org/10.21273/JASHS.122.5.659
Kim Y (1994) A study of selected species of Rosa using isozyme polymorphisms. M Sc thesis, Texas A and M University, College Station, Texas. DOI:https://doi.org/10.21273/HORTSCI.34.2.341
Williams JGK, Kubelik AR, Livak KJ, Rafalski JA, Tingey SV (1990) DNA polymorphisms amplified by arbitrary primers are useful as genetic markers. Nucleic Acids Res 18(22):6531–6535. https://doi.org/10.1093/nar/18.22.6531
Mirali N, Aziz R, Nabulsi I (2012) Genetic characterization of Rosa damascene species growing in different regions of Syria and its relationship to the quality of the essential oils. Int J Med Arom Plants 2:41–52
Azeem S, Khan AI, Awan FS, Riaz A, Bahadur S (2012) Genetic diversity of rose germplasm in Pakistan characterized by random amplified polymorphic DNA (RAPD) markers. Afr J Biotechnol 11(47):10650–10654. https://doi.org/10.5897/AJB10.1375
Vukosavljev M, Zhang J, Esselink GDWPC, van’t Westende CP, Visser RGF, Arens P, Smulders MJM (2013) Genetic diversity and differentiation in roses: a garden rose perspective. Sci Hortic (Amsterdam) 162:320–332. https://doi.org/10.1016/j.scienta.2013.08.015
Henuka R, Raju D, Janakiram N (2015) Characterization and analysis of genetic diversity among different species of rose (Rosa species) using morphological and molecular markers. Indian J Agric Sci 85(2):240–245 https://www.researchgate.net/publication/272172373_Characterization_and_analysis_of_genetic_diversity_among_different_species_of_rose_Rosa_species_using_morphological_and_molecular_markers
Panwar S, Singh KP, Sonah H, Deshmukh R, Prasad K, Sharma T (2015) Molecular fingerprinting and assessment of genetic diversity in rose (Rosa × hybrida). Indian J Biotechnol 14:518–524 http://nopr.niscair.res.in/handle/123456789/34002
Rai H, Raju D, Kumar A, Janakiram T, Namita S, Krishnan G, Rana J (2015) Characterization and analysis of genetic diversity among different species of the rose using morphological and molecular markers. Indian J Agr Sci 85(2):240–245 https://www.cabdirect.org/cabdirect/abstract/20153150987
Ogras T, Bastanlar EK, Metin ÖK, Kandemir I, Özcelik H (2017) Assessment of genetic diversity of rose genotypes using ISSR markers. Turk J Botany 41(4):347–355. https://doi.org/10.1007/s12298-017-0446-7
Korkmaz M, Dogan NY (2018) Analysis of genetic relationships between wild roses (Rosa L. Spp.) growing in Turkey. Erwerbs-Obstbau 60(4):305–310. https://doi.org/10.1007/s10341-018-0375-9
Agarwal A, Gupta V, Haq SU, Jatav PK, Kothari S, Kachhwaha S (2019) Assessment of genetic diversity in 29 rose germplasms using SCoT marker. J King Saud Univ Sci 31(4):780–788. https://doi.org/10.1016/j.heliyon.2019.e01346
Collard BC, Mackill DJ (2009) Start codon targeted (SCoT) polymorphism: a simple, novel DNA marker technique for generating gene-targeted markers in plants. Plant Mol Biol Report 27(1):86–93. https://doi.org/10.1007/s11105-008-0060-5
Amom T, Nongdam P (2017) The use of molecular marker methods in plants: a review. Int J Curr Res Rev 9(17):1–7. https://doi.org/10.7324/IJCRR.2017.9171
Gorji AM, Poczai P, Polgar Z, Taller J (2011) Efficiency of arbitrarily amplified dominant markers (SCoT, ISSR and RAPD) for diagnostic fingerprinting in tetraploid potato. Am J Potato Res 88(3):226–237. https://doi.org/10.1007/s12230-011-9187-2
Zhang J, Xie W, Wang Y, Zhao X (2015) Potential of start codon targeted (SCoT) markers to estimate genetic diversity and relationships among Chinese Elymus sibiricus accessions. Molecules 20(4):5987–6001. https://doi.org/10.3390/molecules20045987
Etminan A, Pour-Aboughadareh A, Noori A, Ahmadi-Rad A, Shooshtari L, Mahdavian Z, Yousefiazar-Khanian M (2018) Genetic relationships and diversity among wild Salvia accessions revealed by ISSR and SCoT markers. Biotechnol Biotechnol Equip 32(3):610–617. https://doi.org/10.1080/13102818.2018.1447397
Jalilian H, Zarei A, Erfani-Moghadam J (2018) Phylogeny relationship among commercial and wild pear species based on morphological characteristics and SCoT molecular markers. Sci Hortic 235:323–333. https://doi.org/10.1016/j.scienta.2018.03.020
Saidi A, Daneshvar Z, Hajibarat Z (2018) Comparison of genetic variation of anthurium (Anthurium andraeanum) cultivars using SCoT, CDDP and RAPD markers. Plant Tissue Cult Biotechnol 28(2):171–182. https://doi.org/10.3329/ptcb.v28i2.39676
Amom T, Tikendra L, Apana N, Goutam M, Sonia P, Koijam AS, Nongdam P (2020) Efficiency of RAPD, ISSR, iPBS, SCoT and phytochemical markers in the genetic relationship study of five native and economical important bamboos of North-East India. Phytochemistry 174:112330. https://doi.org/10.1016/j.phytochem.2020.112330
Kang H, Kwon S, Go S (2003) PCR-based specific and sensitive detection of Pectobacterium carotovorum ssp. carotovorum by primers generated from a URP-PCR fingerprinting-derived polymorphic band. Plant Pathol 52(2):127–133. https://doi.org/10.1046/j.1365-3059.2003.00822.x
Powell W, Morgante M, Andre C, Hanafey M, Vogel J, Tingey S, Rafalski A (1996) The comparison of RFLP, RAPD, AFLP and SSR (microsatellite) markers for germplasm analysis. Mol Breed 2(3), 225-238. DOI. https://doi.org/10.1007/BF00564200
Doyle JJ, Doyle JL (1990) Isolation of plant DNA from fresh tissue. Focus 12:13–15
Singh AK, Rana M, Singh S, Kumar S, Kumar R, Singh R (2014) CAAT box-derived polymorphism (CBDP): a novel promoter-targeted molecular marker for plants. J Plant Biochem Biotechnol 23(2):175–183. https://doi.org/10.1007/s13562-013-0199-5
Anderson JA, Churchill G, Autrique J, Tanksley S, Sorrells M (1993) Optimizing parental selection for genetic linkage maps. Genome 36(1):181–186. https://doi.org/10.1139/g93-024
Peakall R, Smouse PE (2006) GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol Ecol Notes 6(1):288–295. https://doi.org/10.1111/j.1471-8286.2005.01155.x
Kimura M, Crow JF (1964) The number of alleles that can be maintained in a finite population. Genetics 49(4):725. https://doi.org/10.1093/genetics/49.4.725
Nei M (1978) Estimation of average heterozygosity and genetic distance from a small number of individuals. Genetics 89(3):583–590. https://doi.org/10.1093/genetics/89.3.583
Lewontin RC (1972) The apportionment of human diversity. Evolutionary Biology:381–398. https://doi.org/10.1007/978-1-4684-9063-3_14
Jaccard P (1908) Nouvelles recherches sur la distribution florale. Bull Soc Vaud Sci Nat 44:223–270 (in French). https://doi.org/10.5169/seals-268384
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S (2011) MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol 28(10):2731–2739. https://doi.org/10.1093/molbev/msr121
Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155(2):945–959. https://doi.org/10.1093/genetics/155.2.945
Rusanov K, Kovacheva N, Vosman B, Zhang L, Rajapakse S, Atanassov A, Atanassov I (2005) Microsatellite analysis of Rosa damascena Mill. accessions reveals genetic similarity between genotypes used for rose oil production and old Damask rose varieties. Theor Appl Genet 111(4):804–809. https://doi.org/10.1007/s00122-005-2066-9
Baydar NG, Baydar H, Debener T (2004) Analysis of genetic relationships among Rosa damascena plants grown in Turkey by using AFLP and microsatellite markers. J Biotechnol 111(3):263–267. https://doi.org/10.1016/j.jbiotec.2004.04.014
Kiani M, Zamani Z, Khalighi A, Fatahi R, Byrne DH (2008) Wide genetic diversity of Rosa damascena Mill. germplasm in Iran as revealed by RAPD analysis. Sci Hortic 115(4):386–392. https://doi.org/10.1016/j.scienta.2007.10.013
Koopman WJ, Wissemann V, De Cock K, Van Huylenbroeck J, De Riek J, Sabatino GJ, Maes B (2008) AFLP markers as a tool to reconstruct complex relationships: a case study in Rosa (Rosaceae). Am J Bot 95(3):353–366. https://doi.org/10.3732/ajb.95.3.353
Yan Z, Denneboom C, Hattendorf A, Dolstra O, Debener T, Stam P, Visser P (2005) Construction of an integrated map of rose with AFLP, SSR, PK, RGA, RFLP, SCAR and morphological markers. Theor Appl Genet 110(4):766–777. https://doi.org/10.1007/s00122-004-1903-6
Carvalho A, Lima-Brito J, Macas B, Guedes-Pinto H (2009) Genetic diversity and variation among botanical Portuguese wheat cultivars revealed by ISSR assays. Biochem Genet 47(3-4):276–294. https://doi.org/10.1007/s10528-009-9227-5
Jamali M, Ghanbari A, Estaji A, Torabi Giglou M, Saidi M (2019) Genetic diversity of dog rose (Rosa canina L.) using ISSR markers. Iran J Genet Plant Breed 8(2):1–8. https://doi.org/10.30479/ijgpb.2020.9955.1222
Farooq A, Kiani M, Khan MA, Riaz A, Khan AA, Anderson N, Byrne DH (2013) Microsatellite analysis of Rosa damascena from Pakistan and Iran. Hortic Environ Biotechnol 54(2):141–147. https://doi.org/10.1007/s13580-013-0042-x
Hajmansoor S, Bihamta MR, Alisoltani A (2013) Genetic diversity among and within Iranian and non-Iranian barely (Hordeum vulgare L.) genotypes using SSR and storage proteins markers. Biochem Syst Ecol 46:7–17. https://doi.org/10.1016/j.bse.2012.08.001
Wright S (1951) The genetical structure of populations. Ann Eugen 15(1):323–354. https://doi.org/10.1111/j.1469-1809.1949.tb02451.x
Dumolin-Lapegue S, Demesure B, Fineschi S, Le Corre V, Petit RJ (1997) Phylogeographic structure of white oaks throughout the European continent. Genetics 146(4):1475–1487. https://doi.org/10.1093/genetics/146.4.1475
Ni J-L, Zhu AG, Wang XF, Xu Y, Sun ZM, Chen JH, Luan MB (2018) Genetic diversity and population structure of ramie (Boehmeria nivea L). Ind Crops Prod 115:340–347. https://doi.org/10.1016/j.indcrop.2018.01.038
Gholamian F, Etminan A, Changizi M, Khaghani S, Gomarian M (2019) Assessment of genetic diversity in Triticum urartu Thumanjan ex Gandilyan accessions using start codon targeted polymorphism (SCoT) and CAAT-box derived polymorphism (CBDP) markers. Biotechnol Biotechnol Equip 33(1):1653–1662. https://doi.org/10.1080/13102818.2019.1691466
Pirseyedi SM, Mardi M, Davazdahemami S, Kermani MJ, Mohammadi SA (2005) Analysis of the genetic diversity of 12 Iranian Damask rose (Rosa damascena Mill.) genotypes using amplified fragment length polymorphism markers. Iran J Biotech 3(4):225–230 https://www.sid.ir/en/journal/ViewPaper.aspx?id=45121
Babaei A, Tabaei-Aghdaei SR, Khosh-Khui M, Omidbaigi R, Naghavi MR, Esselink GD, Smulders MJ (2007) Microsatellite analysis of Damask rose (Rosa damascena Mill.) accessions from various regions in Iran reveals multiple genotypes. BMC Plant Biology 7(1):12. https://doi.org/10.1186/1471-2229-7-12
Mirzaei L, Rahmani F, and Beigmohamadi M (2015) Assessment of genetic variation among Rosa species using ISSR genetic marker. J Biodivers Environ Sci 7(3), 254_260.https://www.researchgate.net/publication/282504457_ASSESMENT_OF_GENETIC_VARIATION_AMONG_ROSA_SPECIES_USING_ISSR_GENETIC_MARKERS
Agaoglu Y, Ergül A, Baydar N (2000) Molecular analysis of genetic diversity oil rose (Rosa damascena Mill.) grown Isparta (Turkey) region. Biotechnol Biotechnol Equip 14(2):16–18. https://doi.org/10.1080/13102818.2000.10819080
Goudarzi F, Hemami MR, Rancilhac L, Malekian M, Fakheran S, Elmer KR, Steinfartz S (2019) Geographic separation and genetic differentiation of populations are not coupled with niche differentiation in threatened Kaiser’s spotted newt (Neurergus kaiseri). Scientific Reports 9(1):1–12. https://doi.org/10.1038/s41598-019-41886-8
Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol 14(8):2611–2620. https://doi.org/10.1111/j.1365-294X.2005.02553.x
Chen C, Durand E, Forbes F, François O (2007) Bayesian clustering algorithms ascertaining spatial population structure: a new computer program and a comparison study. Mol Ecol Notes 7(5):747–756. https://doi.org/10.1111/j.1471-8286.2007.01769.x
Besnard G, Khadari B, Villemur P, Bervillé A (2000) Cytoplasmic male sterility in the olive (Olea europaea L). Theor Appl Genet 100(7):1018–1024. https://doi.org/10.1007/s001220051383
Sarri V, Baldoni L, Porceddu A, Cultrera N, Contento A, Frediani M, Cionini P (2006) Microsatellite markers are powerful tools for discriminating among olive cultivars and assigning them to geographically defined populations. Genome 49(12):1606–1615. https://doi.org/10.1139/g06-126
Percifield RJ, Hawkins JS, McCoy JA, Widrlechner MP, Wendel JF (2007) Genetic diversity in Hypericum and AFLP markers for species-specific identification of H. perforatum L. Planta Med 73(15):1614. https://doi.org/10.1055/s-2007-993749
Nybom H, Carlson-Nilsson U, Werlemark G, Uggla M (1997) Different levels of morphometric variation in three heterogamous dog rose species (Rosa sect. Caninae, Rosaceae). Plant Syst Evol 204(3-4):207–224. https://doi.org/10.1007/BF00989206
Cole P, and Melton B (1986) Self-and cross-compatibility relationships among genotypes and between ploidy of the rose. J Am Soc Hortic Sci 111(1), 122-125.
Ueda Y, Akimoto S (2001) Cross-and self-compatibility in various species of the genus Rosa. J Hortic Sci Biotechnol 76(4):392–395. https://doi.org/10.1080/14620316.2001.11511382
Charlesworth D (2003) Effects of inbreeding on the genetic diversity of populations. Philos Trans R Soc Lond B Biol Sci 358(1434):1051–1070. https://doi.org/10.1098/rstb.2003.1296
Jürgens R, Ball A, Verster A (2009) Interventions to reduce HIV transmission related to injecting drug use in prison. Lancet Infect Dis 9(1):57–66. https://doi.org/10.1016/S1473-3099(08)70305-0
Jürgens A, Seitz B, Kowarik I (2007) Genetic differentiation of Rosa canina (L.) at regional and continental scales. Plant Syst Evol 269(1-2):39–53. https://doi.org/10.1007/s00606-007-0569-3
The authors would like to acknowledge Dr. Alireza Pour-aboughadareh from Seed and Plant Improvement Institute, Iran.
Ethics approval and consent to participate
This article does not contain any studies with human participants or animals performed.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Genotype classification. Classification of genotypes into different groups according to the figure 2 based on SCoT +URP (C) data
Pairwise genetic distance coefficients. Jaccard distance coefficients based on SCoT and URP data
Mantel correlation. The correlation between SCoT and URP data for all 40 accessions across five regions
About this article
Cite this article
Mostafavi, A.S., Omidi, M., Azizinezhad, R. et al. Genetic diversity analysis in a mini core collection of Damask rose (Rosa damascena Mill.) germplasm from Iran using URP and SCoT markers. J Genet Eng Biotechnol 19, 144 (2021). https://doi.org/10.1186/s43141-021-00247-7
- Rosa damascena Mill
- Genetic variation
- Population structure