Skip to main content

In silico designing of a recombinant multi-epitope antigen for leprosy diagnosis



Leprosy is caused by Mycobacterium leprae and Mycobacterium lepromatosis. Most of the affected population lives in low-income countries and may take up to 10 years to show any clinical signs, which is how physicians diagnose it. However, due to progressive cell damage, early diagnosis is very important. The best way to confirm leprosy is through bacilloscopic, which only confirms the diagnosis and has low accuracy or PCR, that requires specialized operators and is expensive. Since the bacteria are fastidious and do not grow in any culture media, therefore, diagnosing leprosy in the lab is still a challenge. In this concern, a recombinant multi-epitope protein can be a beneficial strategy in the management of the diagnosis, as diverse immunogenic epitopes are precisely selected to detect specific antibodies. Therefore, the purposes of the present study were to select immunogenic epitopes from different relevant proteins, with immunogenic properties, and then to construct a recombinant multi-epitope protein that accuses the presence of the antibodies in the early stages of the disease, making it more than appropriate to be applied as a diagnostic tool.


We selected 22 common proteins from both species and, using bioinformatics tools, predicted B and T cell epitopes. After multiple filtering and analyzing, we ended up with 29 epitopes {MHC-I (total 18) and MHC-II (total 11)} from 10 proteins, which were then merged into one construct. Its secondary and tertiary structures were also predicted and refined to comprise the amino acid residues in the best conformation possible. The multi-epitope protein construct was stable, non-host homologous, non-allergic, non-toxic, and elicit humoral and cellular responses. It has conformational B cell epitopes and potential to elicit IFN-γ, IL-4, and IL-10 secretion.


This novel recombinant multi-epitope protein constructed using the common epitopes from M. leprae and M. lepromatosis has a huge immunological potential, is stable, and can be lyophilized to be used in ELISA plates or even in biosensors, which are user-friendly diagnosis tools, facilitating translation into human sample tests.


Only in 2019 the WHO (World Health Organization) reported 202,226 new cases of Leprosy worldwide, with almost 80% of the cases in just 3 countries: India, Brazil, and Indonesia (114,451, 27,863, and 17,439, respectively) [1, 2]. Leprosy is caused by Mycobacterium leprae and Mycobacterium lepromatosis, which can invade Schwann cells [3] affecting both the dermis and peripheral nerves [4]. The cell invasion causes nerve demyelination through nerve cell communication deregulation [5]. The damage to the myelin causes permanent loss of thermal and tactile sensibility, besides pain sensation [4]. Leprosy may take up to 11 years until any clinical manifestation occurs, but even before that, it is transmissible [6, 7].

The immunological response in leprosy is highly dependent on the host’s genetic background, and it drives its clinical manifestations [6, 7]. The leprosy spectrum ranges from tuberculoid leprosy (TL) to lepromatous leprosy (LL). In TL, the immune response has mainly a cellular profile, with a Th1 response, producing cytokines like interferon gamma (IFN-γ), interleukin (IL)-2, and IL-12. On the other hand, the LL pole has a Th2/Th17 response, with more antibody titers; IL-10, IL-4, and IL-13 secretion;, and a higher bacillary load [8, 9]. Between those poles exist borderline tuberculoid (BT), borderline-borderline (BB), and borderline lepromatous (BL) clinical manifestations, with mixed immunologic characteristics, ranging from the Th1 profile to the Th2/Th17 according to the poles [10]. The borderline presentation is the one that most of the patients fit, and the nerve involvement is more severe, causing higher levels of disability [11].

The leprosy diagnosis is mainly based on clinical and laboratorial evaluations. Due to the progressive cell damage, the early diagnosis is very important; however, most diagnoses are performed when there is already a significant nerve damage [12]. The best way to confirm leprosy is through PCR, which requires specialized operators, is expensive, and is very difficult to be conducted in the field. Another option is bacilloscopic, which is not used as a diagnosis, only as confirmation of the clinical diagnosis, and has low accuracy. Serological tests exist only based on M. leprae and are not sensitive enough, detecting only LL and symptomatic cases, but not PB [13,14,15]. The bacteria are fastidious and do not grow in any culture media; therefore, diagnosing leprosy in the lab is still a challenge [8]. Given the different immunological responses, van Hooij et al. established that the combination of humoral and cellular detection is efficient in diagnosing both MB and PB [16, 17].

To try and stop the transmission of new leprosy cases, the WHO set a few objectives to be fulfilled from 2016 to 2020. One of the objectives was the development of a new diagnosis tool [18]. Since most of the affected population lives in lower-income countries, they do not have access or cannot afford the number of tests necessary for all the population at risk [16, 19], and the serum or whole blood-based assays are not conclusive for all types of leprosy [16]. This situation leaves the population with only the possibility of discovering the disease after clinical manifestations, increasing the transmission [20, 21] and making it difficult to finish the cycle.

Bioinformatics tools are of the utmost utility to assess the immunogenic peptides within a protein, saving time, money, and even diminishing the use of animals, since it provides multiple filters before in vitro and in vivo tests are performed [22,23,24,25]. Given that M. leprae and M. lepromatosis are not yet cultivable in any culture media [8], bioinformatics is the best way to assess its proteins and their immunogenic potential. It also provides the possibility to create recombinant multi-epitope constructs, which can hold several antigenic epitopes, differing from natural proteins or whole-cell preparations [26]. Multi-epitope proteins can be used in enzyme-linked immunosorbent assay (ELISA), lateral flow tests, biosensors, and cellular assays, which require minimal or zero sample preparation. These constructs may also increase the sensitivity and specificity of the detection method, given that it is possible to assess homology relations with other microorganisms and permits the fusion of epitopes from different proteins and different sites of the organism. The multi-epitope constructs enable a higher immunogenic density and diminish the cross-reaction risk of whole bacteria antigen, which is common in leprosy diagnosis [27]. Diseases such as hepatitis B [23], Chagas disease [28], cryptococcosis [25], leishmaniosis [29, 30], tuberculosis [31], and toxocariasis [32] already have great results with recombinant multi-epitope proteins in their diagnosis.

The largest challenge of the multi-epitope diagnosis is to construct a chimeric protein with all the epitopes exposed to interact with the antibodies, as the coiling event predicted can be different in practice [33]. Furthermore, despite the advances in genetic engineering and recombinant expression technologies, some obstacles endure in protein production as toxicity, instability, inability to fit the environment, and errors in expression vector selection, among others [34].

There is an effective therapy against leprosy, the multidrug therapy (MDT) program [35]; however, if the diagnosis is not made early in the disease, the nerve damage is unrecoverable, causing persistent physical damage [36]. Hence, to improve the diagnostic tools for this severe disease, we propose a novel recombinant multi-epitope-based antigen, using bioinformatics tools. To be able to diagnose both strains that cause the disease, 22 common proteins of M. leprae and the recently described M. lepromatosis were selected.

The purposes of the present study were to select deeply immunogenic epitope proteins, with immunogenic properties or certified to be detected in diagnosis tests, and then to construct a recombinant multi-epitope protein that could be applied as a diagnostic tool. Advanced techniques in protein structure design and evaluation were performed to build a stable and safe multi-epitope protein. Also, in silico cloning was applied to arrange codon bias to get an idea about the capacity of the protein to be expressed.


The complete workflow of the methodology used in this study is described in Fig. 1.

Fig. 1
figure 1

AE Graphical representation of the pipeline used in this study to design a universal recombinant multi-epitope antigen for leprosy diagnosis

Data selection

The M. lepromatosis and M. leprae proteins ML0091, ML0405, ML1636, ML2055, ML2331, ML2346, and ML1556 were previously proved to detect leprosy at some level [26]. ML2028, ML2055, ML2380, and ML2531 were tested as immunizers in mice, and they demonstrated reduced bacterial burden [4]. NP_301196.1, NP_301663.1, NP_301805.1, NP_301958.1, NP_302056.1, NP_302185.1, NP_302232.1, NP_302292.1, NP_302342.1, NP_302490.1, and NP_302503.1 were obtained as immunogenic proteins from our previous results, through reverse vaccinology analysis [37]. The sequences of these proteins were retrieved from National Center for Biotechnology Information (NCBI) in FASTA format [38]. The antigenicity of these selected proteins was evaluated by VaxiJen [39]. In total, 22 proteins shared among both strains were used for the next steps.

Prediction of epitopes that binds to MHC I alleles

The epitopes able to bind to MHC I alleles and activate cytotoxic T lymphocytes (CTL) were predicted by two different platforms to improve the confidence of the prediction. The Immune Epitope Database and Analysis Resource (IEDB) contain thousands of high- and low-affinity epitopes used in training to enhance the accuracy of the predictor [40, 41]. Aiming to develop a diagnostic tool to be used in all endemic areas, we selected all 27 alleles with high frequency in the global population. The lengths of our peptides were 9 amino acid residues [42]. Default parameters were chosen for the prediction since they combine artificial neural network (ANN), scoring matrix method (SMM), and combinatorial library. Epitopes with percentile rank smaller than 1% were selected for our study, due to their enhanced probability to be immunogenic. NetCTL-1.2 server can assess binding affinity, antigenic processing, and transportation, integrated into the epitope prediction, using both ANN and SMM to make the predictions [43, 44]. The same alleles used in IEDB were used in NetCTL-1.2.

Prediction of epitopes that binds to MHC II alleles

For epitopes that activate helper T lymphocyte (HTL) (MHC II-binding epitopes), we also used two different predictors, IEDB tool [40] and NetMHCII-2.3 server [45]. The MHC II cleft size can accommodate epitopes from 13 to 25 amino acids; thus, we chose to use a 15-residue length as a standard, since the NetMHCII-2.3 server allows users to use this length, approving the comparison between both programs. In IEDB, we selected only epitopes with percentile rank lower than 3%. For the IC50, which is used to determine the epitopes’ affinity with the MHC, we chose an IC50 < 1000 nM [45]. ANN is also used by the NetMHCII-2.3 server with various epitope databases to increase data training and predict the epitopes [46].

Prediction of B cell epitopes

To predict linear B cell epitopes, we used ABCpred [47, 48] which uses ANN for predictions and LBtope server [49] which uses the support vector machine (SVM)-based models for the prediction. We chose the epitope’s length as 16 due to its better accuracy properties [48, 50, 51].

Filtering and immunogenicity assessment of MHC I epitopes

All the epitopes predicted were filtered through an in-house python script which compare the results from both programs for each epitope (Fig. 1A). After the recognition of epitopes predicted by the two programs, the same script was used to find overlapping epitopes between B cells and MHC II with at least nine sequential amino acid residues. The last time that the script was used was to search for the overlap between class I epitopes predicted as immunogenic by the immunogenicity tool and the remaining class I epitopes. Class I immunogenicity tool [52] uses amino acid properties and their position within the peptide to predict immunogenic properties. Only peptides with a score greater than 0.1 were chosen.

Sequence construction

The epitopes that passed through all those filters were then merged into different constructs with the sequence AAY for MHC I epitopes and GPGPG for MHC II as peptide linker sequences, which help in protein folding [53].

Evaluation of host homology and physical–chemical properties

To evaluate the similarity between the constructed protein with human proteins, and therefore reduce autoimmunity possibilities, a BLASTp was carried out. The whole multi-epitope protein sequence and its individual epitopes were submitted against the UniProtKB Human database.

Molecular mass, theoretical pI, extinction coefficient, aliphatic index, grand average of hydropathicity (GRAVY), estimated half-life for three model organisms (Escherichia coli, yeast, and mammal cells), and the instability index were analyzed through the final construct sequence using ProtParam [54]. Solubility index was also assessed by Protein-Sol [55], which evaluates several properties based on E. coli expression data.

Secondary structure prediction

The secondary structure of the final epitope construct was predicted by RaptorX template-based protein structure modeling server [56] and PSIPRED. PSIPRED predicts the secondary structure and generates the pictures by applying complex ANN and position-specific scoring matrix (PSSM) [57].

Structural modeling, refinement, and properties assessment

To predict the tertiary structure (3D), three different programs were used, and the best 3D structure was chosen based on its structural quality. For the evaluation, PROCHECK was used through SAVES v6.0 [58, 59] to generate the Ramachandran plot. Phyre2 intensive method comprises the multiple alignments of the sequence of interest with homologous sequences using threading and ab initio techniques followed by the secondary structure’s prediction with the PSIPRED. Then, a hidden Markov model (HMM) is determined with the information from these two steps combined. The models with the best scores are used, from a search in an HMM database of known protein structures, to determine the modeling and error correction [60]. Multiple-template threading (MTT) and scoring methods are used in RaptorX to predict the 3D structures and to indicate the quality of models predicted [56]. Finally, I-TASSER uses an interactive method based on the templates according to fragment assembly simulations with further refinement to construct the models [57].

To enhance he local and global quality of the modeled 3D structure, we used GalaxyWeb Server which applies the methods for the refinement of amino acid side chains using light and aggressive relaxation approaches [61].

Antigenicity, IFN-γ, IL-4, and Il-10 inducing potential

The final construct sequence was analyzed for crucial aspects related to the induction of immune responses, toxicity, and allergenicity. We used VaxiJen to assess the antigenic capacity through the automatic cross-covariance method, thus analyzing the physical–chemical properties and predicting the ability to induce immune responses without the need to do alignments [39].

The search for epitopes able to induce IFN-γ production was performed with the IFNepitope predictor, using MHC II epitopes. This predictor uses a SVM hybrid method based on motifs to perform the prediction [62]. IL-4 and IL-10 inductions were also assessed by different predictors (IL-4Pred and IL-10Pred), by the same method [63, 64]. The ProInflam web server was used as well to predict the pro-inflammatory potential of the peptides included in the protein [65].

Conformational B cell epitopes prediction

The ElliPro web-based tool was used to predict conformational B cell epitopes from the refined predicted structure of our multi-epitope protein [66]. These epitopes are generally conformational, which means they are away in linear distance but close in spatial proximity [67].

In silico cloning

To verify the capacity of cloning and expression of the multi-epitope protein in an appropriate expression vector, we performed in silico cloning. Using JCat, we adapted the codon of our peptide according to the E. coli K12 expression system’s codon usage through reverse translation. With the cDNA-optimized sequence, the codon optimization for E. coli k12 was performed, and it returned the Codon Adaptation Index (CAI), which must have a score higher than 0.8, and the GC content rate should be between 30 and 70%. Furthermore, to clone the final optimized gene sequence, we used the pET28a( +) vector obtained from the Addgene website (, with Blpi and BamHI restriction sites. Finally, the optimized sequence was inserted into the pET28a( +) vector using the SnapGene tool [68] to ensure protein expression.


Prediction of B, CTL, and HTL epitopes

All the selected proteins had predicted antigenicity, assessed by VaxiJen analysis, showing their capacity to recognize peptides of immunological relevance (Additional file 1: Table S1). The 21 proteins submitted at ABCpred generated a total of 729 epitopes and 2098 in LBtope. Using the in-house python script, we searched for overlapping epitopes that were predicted by both programs, in order to find common epitopes, and it returned 227 B cell shared epitopes (Additional file 1: Table S2). For cytotoxic T lymphocyte (MHC I), we used IEDB MHC-I binding predictions and NetCTL 1.2 server. The first program predicted 2273 epitopes and the latter 1146, with 992 common epitopes predicted by the two software (Additional file 1: Table S3). IEDB MHC-II binding predictions and NetMHCII 2.3 were the tools used for MHC II epitope prediction, with 637 and 3734 epitopes, respectively. There were 586 common epitopes found (Additional file 1: Table S4).

Epitope’s screening

To find epitopes with the potential to induce both humoral and cellular immune responses, we applied the in-house python script, searching for overlaps between MHC II (637) and B (227) epitopes, with similarity of at least nine sequential residues. At this screening step, we reduced the total number of MHC II and B epitopes to 40 overlapping epitopes (Additional file 1: Table S5). By applying the class I immunogenicity tool, we predicted the 350 most immunogenic epitopes, with scores greater than 0.1 (Additional file 1: Tables S5 and S6), using the previous common MHC I epitopes (992). Those 350 epitopes were, then, overlapped with the 40 ones resulting from humoral and cellular overlap (MHC I with B cell epitopes overlapping), giving a total of 20 epitopes (Additional file 1: Table S5) Fig. 1A.

From the 20 selected MHC I epitopes, we compared the sequence and its percentile rank. MHC I epitopes with only one or two residues of difference were excluded, using the lowest percentile as a parameter. As a result, a total of 29 {MHC-I (total 18) and MHC-II (total 11)} epitopes, from 10 proteins, were selected for the final recombinant multi-epitope construction (Table 1).

Table 1 Final list of the selected epitopes

Multi-epitope sequence construction: structural modeling, refinement, and properties assessment

Different amino acid sequences were constructed with the selected epitopes to evaluate and select the one with the best structure quality. To give our construct flexibility to make its conformational changes, we joined these sequences with different peptide linkers since they assist in protein folding, which is important for the conformational epitopes [67]. For MHC I, we used AAY linkers, and for MHC II, we used GPGPG, forming a 431-amino acid multi-epitope protein; however, we made changes in the positions of epitopes. All sequences were submitted to the structural prediction in I-TASSER, Phyre2, and RaptorX. Afterward, a Ramachandran plot was constructed for all the amino acid sequences.

The best structure quality obtained was the one modeled by RaptorX, with 86.2% of the residues in the most favored regions, 9.1% in the additional allowed regions, 2.1% in the generously allowed regions, and 2.6% in the disallowed regions (Fig. 2A, B). We performed the refinement with GalaxyWeb Server, getting the best result with model 5, with 86.6% of the residues in the most favored regions, 12.5% in the additional allowed regions, 0.6% in the generously allowed regions, and 0.6% in the disallowed regions (Fig. 2C, D).

Fig. 2
figure 2

3D structure and the Ramachandran plot of the recombinant protein structure before and after. A Tertiary structure generated by the Phyre2 server. B Ramachandran plot for 3D structure generated by Phyre2 showed 86.2% of the residues in favored regions, 9.1% in allowed regions, and 2.6% in disallowed regions. C Representing the refined tertiary structure obtained by the GalaxyRefine server. D Ramachandran plot for the 3D structure generated by GalaxyRefine showed 86.6%, 12.5%, and 0.3% of the residues in favored, allowed, and disallowed regions, respectively

Secondary structure prediction

According to PSIPRED and RaptorX, the predicted secondary structure has 431-residue protein, 31% helix, 20% beta-sheet, and 47% loop formation (Additional file 1: Fig. S1).

Host homology and physical–chemical properties

Host homology was performed through NCBI BlastP with Homo sapiens (taxid 9606), and no significant similarity was found.

The protein’s molecular mass is 46,779.35 (46.7 kDa), with a theoretical pI of 9.38, which means its behavior is at basic pH. The protein is considered stable since the instability index is 29.99. The aliphatic index is 91.79, also showing stability in changes in temperature. Positive great average of the hydropathy value (GRAVY) scores mean hydrophobicity, as our result is 0.489.

The solubility analysis performed through Protein-Sol described a score of 0.382, which is lower than the tool threshold of 0.45. The threshold corresponds to E. coli solubility, and scores higher than 0.45 has a higher solubility average.

Antigenicity, cytokine-inducing potential, and conformational B cell epitopes

The application of multi-epitope protein in cell-based in vitro platforms depends on its ability to be antigenic and to induce cytokine production. The designed protein had a predicted antigenicity score of 0.5596 through VaxiJen which means it is probably antigenic. The tool IFNepitope predicted two epitopes as probable IFN-γ inducer, and IL-4Pred and ProInflam predicted five and IL-10Pred four epitopes (Table 2). ElliPro predicted six linear and two conformational epitopes with scores greater than 0.7 (Table 3).

Table 2 Predicted IFN-γ inducer, IL-4Pred, and ProInflam-inducing epitopes
Table 3 The B cell conformational epitopes with a PI score greater than 0.7

In silico cloning

The codon adaptation Jcat software analysis showed that the GC content of the constructed sequence is 56.07%, and the CAI (Codon adaptation index) index is 1.0. Both are within the parameter range, which is important to measure the cloning and expression potential. In order to construct the cloning vector, through the SnapGene tool, restriction site sequences of the enzymes BipI and BamHI were inserted in the expression vector pET28a( +) (Addgene) totalizing 6310 base pairs in the complete clone length (Fig. 3).

Fig. 3
figure 3

In silico cloning. The recombinant multi-epitope DNA sequence cloned into the pET28a( +) (Addgene) expression vector, represented in red color. The insert was added between the BipI and BamHI restriction sites


In fact, there is a successful treatment for leprosy. However, the damage caused by the disease, due to the lack of accurate and early diagnosis, causes irreversible damage which highlights the need for highly sensitive detection tools for rapid diagnosis and epidemiological surveillance of the disease [35]. Also, in addition to being a neglected disease, many studies consider only M. leprae as the focus of their studies. Here, we chose to develop a universal chimeric protein that can identify not only M. leprae, but also M. lepromatosis infection, increasing the chance of diagnosis.

For being fastidious bacteria, M. leprae and M. lepromatosis were never cultivated in axenic media [8], which heightens the difficulty to work with them. That said, bioinformatics is an optimistic alternative, as it enables us to run innumerous tests without the bacteria, only with its genome.

ML0091, ML2380, and ML2346 were first described by Cole et al. in 2001 [69]. The first ones are similar to M. tuberculosis Rv3810 and Rv0455c proteins, but, as it was described, no homology was found in our construct. On the other hand, ML2346 has no known homology [13, 59]. As to its function, ML0091 is a 28-kDa antigen precursor, ML2380 is a possible secreted protein, and ML2346 is a hypothetical protein [69].

Duthie et al. tested ML0091 and ML2346 against the patients’ sera from Goiânia, Brazil, having a positive response in 71% and 29% of the cases, respectively [14]. The protein ML2328 was also used in the construction of LepVax (a subunit vaccine against M. leprae) by Duthie et al. [4].

The other proteins used were predicted by our group [37] in a reverse vaccinology approach; NP_301958.1, NP_302056.1, NP_302292.1, and NP_302503.1 are secreted proteins, and NP_302185.1, NP_301196.1, and NP_302342.1 are putative surface-exposed proteins. All these proteins are components of the core genome from four strains of M. leprae and two strains of M. lepromatosis. Since the definition of the core genome is to be present in all strains analyzed, it makes our recombinant multi-epitope protein a good candidate to diagnose the disease caused by any of them.

Given that diagnosis of leprosy is essential for treatment initiation and the earlier it begins, the better the response [54], many are the attempts to create a diagnostic approach that can detect leprosy in all its spectrum [63] and before any clinical sign [54, 63, 64], since infected individuals can spread the disease even before that [16]. Even though PCR is effective [63], leprosy is endemic in areas of difficult access or poverty, which makes its laboratory diagnosis hard [65]. Here, we propose a chimeric protein that has the potential to detect both humoral and cellular responses, which is established as efficient in diagnosing leprosy [10, 66, 67, 70] even in a not-so-controlled environment and with a lower cost than PCR.

Immunoinformatics is an in silico approach that helps in predicting epitopes that have a greater chance to be immunogenic; nevertheless, it may not be accurate if we take into consideration that proteins undergo unique biological complex processes driven by genetics to be presented as an epitope by a cell [20, 22, 71]. Nonetheless, several multi-epitope constructs are already described as being effective, for example, LID-1, a fusion construct of ML0405 and ML2331 that can diagnose MB leprosy 6 to 8 months prior to the onset of clinical symptoms [14]. LepReact, a delayed-type hypersensitivity skin test, made from LID-1, was able to detect antigen-specific immune responses from M. leprae in guinea pigs and armadillos [72]. As to other diseases, Chagas disease detection can be improved using TcF43 and TcF26, proteins derived from the fusion of selected T. cruzi TR proteins [28]; Yin et al. validated a high-accuracy ELISA assay using a recombinant protein for diagnosis of human brucellosis [27]; Ebrahimi et al. developed a multi-epitope protein with potential epitopes for the diagnosis of human toxocariasis [32].

Most of these tests are directed to humoral immunity, which is one of our goals, since it is less expensive and has a good accuracy in LL and MB leprosy, where antibodies are more present [8,9,10]; however, cellular assays may enable the detection of TT and PB leprosy. Sampaio et al. described IFN-γ secretion upon antigen-specific stimulation as an indicator of progression to the tuberculoid pole and IL-4 or IL-5 as an indicator of progression to the lepromatous pole [73].

T cell interferon-gamma release assays (IGRA) were developed as an alternative for delayed-typed hypersensitivity tests for latent tuberculosis diagnosis, reducing false-positive results [74, 75]. Since we found two IFN-γ, five IL-4, four IL-10 inducing epitopes, and five epitopes that induce pro-inflammatory responses within our protein, these properties point to a recombinant multi-epitope protein that can be used in a cytokine production assay, similar to the aforementioned IGRA, being able to detect different cellular immune profiles associated to different clinical manifestations of leprosy. IL-10 together with IL-4 is known to be associated with LL pole and MB leprosy while IFN-γ associated with other proinflammatory cytokines and characterized the TT pole and PB leprosy [73, 76,77,78].

With results in both the humoral and cellular response, this protein will be able to diagnose leprosy without much difficulty due to the detection of both ends of its spectrum. Other studies using immunoinformatics to construct multi-epitope proteins for diagnosis purposes had good results in silico, but they lacked sensitivity and specificity in ex vivo [79] or had strong specificity and weak sensibility [80].


This novel recombinant multi-epitope protein has a huge immunological potential, is stable, and can be lyophilized to be used in ELISA plates or even in biosensors, which are user-friendly diagnosis tools, facilitating translation into human sample tests.

Availability of data and materials

All the required methods and results to reproduce this article are provided as part of the paper and the supplementary material.



Artificial neural network


Borderline tuberculoid




Borderline lepromatous


Codon Adaptation Index


Cytotoxic T lymphocytes


Enzyme-linked immunosorbent assay




Interferon gamma


Half maximal inhibitory concentration


Immune Epitope Database


Interferon-gamma release assays


Lepromatous leprosy


Multidrug therapy


Major histocompatibility complex


Multiple-template threading








Polymerase chain reaction


Position-specific scoring matrix


Scoring matrix method


Tuberculoid leprosy


T helper cells


World Health Organization


  1. WHO (2019) Number of new leprosy cases. Accessed 9 Aug 2020.

  2. Lazo-Porras M, Prutsky GJ, Barrionuevo P et al (2020) World Health Organization (WHO) antibiotic regimen against other regimens for the treatment of leprosy: a systematic review and meta-analysis. BMC Infect Dis 20(1):1–14.

    Article  Google Scholar 

  3. Rambukkana A (1979) Role of α-dystroglycan as a Schwann cell receptor for Mycobacterium leprae. Science 282:2076–2079.

    Article  Google Scholar 

  4. Duthie MS, Pena MT, Ebenezer GJ et al (2018) LepVax, a defined subunit vaccine that provides effective pre-exposure and post-exposure prophylaxis of M. leprae infection. NPJ Vaccines 3:12.

    Article  Google Scholar 

  5. Rambukkana A (2004) Mycobacterium leprae-induced demyelination: a model for early nerve degeneration. Curr Opin Immunol 16:511–518.

    Article  Google Scholar 

  6. Mi Z, Liu H, Zhang F (2020) Advances in the immunology and genetics of leprosy. Front Immunol 11:567.

    Article  Google Scholar 

  7. Nath I, Saini C, Valluri VL (2015) Immunology of leprosy and diagnostic challenges. Clin Dermatol 33:90–98.

    Article  Google Scholar 

  8. Sharma R, Singh P, McCoy RC et al (2020) Isolation of Mycobacterium lepromatosis and development of molecular diagnostic assays to distinguish Mycobacterium leprae and M. lepromatosis. Clin Infect Dis 71:e262–e269.

    Article  Google Scholar 

  9. de Sousa JR, de Sousa RPM, de Souza Aarão TL et al (2016) In situ expression of M2 macrophage subpopulation in leprosy skin lesions. Acta Trop 157:108–114.

    Article  Google Scholar 

  10. Froes LAR, Trindade MAB, Sotto MN (2020) Immunology of leprosy. Int Rev Immunol 1–21.

  11. Talhari C, Talhari S, Penna GO (2015) Clinical aspects of leprosy. Clin Dermatol 33:26–37.

    Article  Google Scholar 

  12. Franco-Paredes C, Rodriguez-Morales AJ (2016) Unsolved matters in leprosy: a descriptive review and call for further research. Ann Clin Microbiol Antimicrob 15:33.

    Article  Google Scholar 

  13. Roset Bahmanyar E, Smith WC, Brennan P et al (2016) Leprosy diagnostic test development as a prerequisite towards elimination: requirements from the user’s perspective. PLoS Negl Trop Dis 10:e0004331.

    Article  Google Scholar 

  14. Duthie MS, Goto W, Ireton GC et al (2007) Use of protein antigens for early serological diagnosis of leprosy. Clin Vaccine Immunol 14:1400–1408.

    Article  Google Scholar 

  15. Kumar A, Parkash O, Girdhar BK (2014) Analysis of antigens of Mycobacterium leprae by interaction to sera IgG, IgM, and IgA response to improve diagnosis of leprosy. Biomed Res Int 2014:1–10.

    Article  Google Scholar 

  16. van Hooij A, TjonKon Fat EM, van den Eeden SJF et al (2017) Field-friendly serological tests for determination of M leprae-specific antibodies. Sci Rep 7:8868.

    Article  Google Scholar 

  17. van Hooij A, Tjon Kon Fat EM, Batista da Silva M et al (2018) Evaluation of immunodiagnostic tests for leprosy in Brazil China and Ethiopia. Sci Rep 8:17920.

    Article  Google Scholar 

  18. Regional Office for South-East Asia, World Health Organization (2016) Global Leprosy Strategy 2016-2020: Accelerating towards a leprosy-free world. WHO Regional Office for South-East Asia.

  19. Britton WJ, Lockwood DNJ (2004) Leprosy. The Lancet 363:1209–1219.

    Article  Google Scholar 

  20. Sampaio LH, Stefani MM, Oliveira RM et al (2011) Immunologically reactive M. leprae antigens with relevance to diagnosis and vaccine development. BMC Infect Dis 11:26.

    Article  Google Scholar 

  21. Duthie MS, Hay MN, Morales CZ et al (2010) Rational design and evaluation of a multiepitope chimeric fusion protein with the potential for leprosy diagnosis. Clin Vaccine Immunol 17:298–303.

    Article  Google Scholar 

  22. Davies M, Flower D (2007) Harnessing bioinformatics to discover new vaccines. Drug Discov Today 12:389–395.

    Article  Google Scholar 

  23. de Souza MQ, Galdino AS, dos Santos JC et al (2013) A recombinant multiepitope protein for hepatitis B diagnosis. Biomed Res Int 2013:1–7.

    Article  Google Scholar 

  24. Acevedo GR, Juiz NA, Ziblat A et al (2020) In silico guided discovery of novel class I and II Trypanosoma cruzi epitopes recognized by T cells from Chagas’ disease patients. J Immunol 204:1571–1581.

    Article  Google Scholar 

  25. de Serpa Brandão RMS, Faria AR, de Andrade HM et al (2018) Novel recombinant multiepitope proteins for the detection of anti-Cryptococcus antibodies. Future Microbiol 13:429–436.

    Article  Google Scholar 

  26. Oliveira TR, Longhi MT, de Morais ZM et al (2008) Evaluation of leptospiral recombinant antigens MPL17 and MPL21 for serological diagnosis of leptospirosis by enzyme-linked immunosorbent assays. Clin Vaccine Immunol 15:1715–1722.

    Article  Google Scholar 

  27. Yin D, Li L, Song X et al (2016) A novel multi-epitope recombined protein for diagnosis of human brucellosis. BMC Infect Dis 16:219.

    Article  Google Scholar 

  28. Duthie MS, Guderian JA, Vallur AC et al (2016) Multi-epitope proteins for improved serological detection of Trypanosoma cruzi infection and Chagas disease. Diagn Microbiol Infect Dis 84:191–196.

    Article  Google Scholar 

  29. Heidari S, Hajjaran H, Kazemi B et al (2021) Identification of immunodominant proteins of Leishmania infantum by immunoproteomics to evaluate a recombinant multi-epitope designed antigen for serodiagnosis of human visceral leishmaniasis. Exp Parasitol 222:108065.

    Article  Google Scholar 

  30. Jameie F, Dalimi A, Pirestani M, Mohebali M (2020) Detection of leishmania infantum infection in reservoir dogs using a multiepitope recombinant protein (PQ10). Arch Razi Inst 75:327–338.

    Article  Google Scholar 

  31. Gutiérrez-Ortega A, Moreno DA, Ferrari SA et al (2021) High-yield production of major T-cell ESAT6-CFP10 fusion antigen of M. tuberculosis complex employing codon-optimized synthetic gene. Int J Biol Macromol 171:82–88.

    Article  Google Scholar 

  32. Ebrahimi M, Seyyedtabaei SJ, Ranjbar MM et al (2020) Designing and modeling of multi-epitope proteins for diagnosis of Toxocara canis infection. Int J Pept Res Ther 26:1371–1380.

    Article  Google Scholar 

  33. Napoleão-Pêgo P, Carneiro FRG, Durans AM et al (2021) Performance assessment of a multi-epitope chimeric antigen for the serological diagnosis of acute Mayaro fever. Sci Rep 11:15374.

    Article  Google Scholar 

  34. Ahmad I, Nawaz N, Darwesh NM et al (2018) Overcoming challenges for amplified expression of recombinant proteins using Escherichia coli. Protein Expr Purif 144:12–18.

    Article  Google Scholar 

  35. Lockwood DNJ (2019) Chronic aspects of leprosy—neglected but important. Trans R Soc Trop Med Hyg 113:813–817.

    Article  Google Scholar 

  36. Soares BA, Teixeira KN, de Santana JF et al (2021) Epitope mapping from Mycobacterium leprae proteins: convergent data from in silico and in vitro approaches for serodiagnosis of leprosy. Mol Immunol 138:48–57.

    Article  Google Scholar 

  37. Jaiswal AKJ, Tiwari S, Jamal SB et al (2021) Reverse vaccinology and subtractive genomics approaches for identifying common therapeutics against Mycobacterium leprae and Mycobacterium lepromatosis. J Venom Anim Toxins Incl Trop Dis.

    Article  Google Scholar 

  38. (2021) Home - Protein - NCBI. Accessed 20 Jun 2021.

  39. Doytchinova IA, Flower DR (2007) VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines. BMC Bioinformatics 8:4.

    Article  Google Scholar 

  40. Martini S, Nielsen M, Peters B, Sette A (2020) The Immune Epitope Database and Analysis Resource Program 2003–2018: reflections and outlook. Immunogenetics 72:57–76.

    Article  Google Scholar 

  41. Vita R, Mahajan S, Overton JA et al (2019) The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res 47:D339–D343.

    Article  Google Scholar 

  42. Greenbaum J, Sidney J, Chung J et al (2011) Functional classification of class II human leukocyte antigen (HLA) molecules reveals seven different supertypes and a surprising degree of repertoire sharing across supertypes. Immunogenetics 63:325–335.

    Article  Google Scholar 

  43. Larsen MV, Lundegaard C, Lamberth K et al (2005) An integrative approach to CTL epitope prediction: a combined algorithm integrating MHC class I binding, TAP transport efficiency, and proteasomal cleavage predictions. Eur J Immunol 35:2295–2303.

    Article  Google Scholar 

  44. Larsen M, v, Lundegaard C, Lamberth K, et al (2007) Large-scale validation of methods for cytotoxic T-lymphocyte epitope prediction. BMC Bioinformatics 8:424.

    Article  Google Scholar 

  45. Jensen KK, Andreatta M, Marcatili P et al (2018) Improved methods for predicting peptide binding affinity to MHC class II molecules. Immunology 154:394–406.

    Article  Google Scholar 

  46. Nielsen M, Lund O (2009) NN-align. An artificial neural network-based alignment algorithm for MHC class II peptide binding prediction. BMC Bioinformatics 10:296.

    Article  Google Scholar 

  47. Galanis KA, Nastou KC, Papandreou NC, et al (2019) Linear B-cell epitope prediction: a performance review of currently available methods. bioRxiv 833418.

  48. Chen J, Liu H, Yang J, Chou K-C (2007) Prediction of linear B-cell epitopes using amino acid pair antigenicity scale. Amino Acids 33:423–428.

    Article  Google Scholar 

  49. Singh H, Ansari HR, Raghava GPS (2013) Improved method for linear B-cell epitope prediction using antigen’s primary sequence. PLoS ONE 8:e62216.

    Article  Google Scholar 

  50. Saha S, Raghava GPS (2006) Prediction of continuous B-cell epitopes in an antigen using recurrent neural network. Proteins 65:40–48.

    Article  Google Scholar 

  51. EL-Manzalawy Y, Dobbs D, Honavar V (2008) Predicting flexible length linear B-cell epitopes. In: Computational systems bioinformatics. Published by Imperial College Press and distributed by World Scientific Publishing Co., pp 121–132.

  52. Calis JJA, Maybeno M, Greenbaum JA et al (2013) Properties of MHC class I presented peptides that enhance immunogenicity. PLoS Comput Biol 9:e1003266.

    Article  Google Scholar 

  53. Chauhan V, Rungta T, Goyal K, Singh MP (2019) Designing a multi-epitope based vaccine to combat Kaposi Sarcoma utilizing immunoinformatics approach. Sci Rep 9:2517.

    Article  Google Scholar 

  54. Gasteiger E, Hoogland C, Gattiker A et al (2005) Protein identification and analysis tools on the ExPASy server. The Proteomics Protocols Handbook. Humana Press, Totowa, pp 571–607.

    Chapter  Google Scholar 

  55. Hebditch M, Carballo-Amador MA, Charonis S et al (2017) Protein-Sol: a web tool for predicting protein solubility from sequence. Bioinformatics 33:3098–3100.

    Article  Google Scholar 

  56. Källberg M, Wang H, Wang S et al (2012) Template-based protein structure modeling using the RaptorX web server. Nat Protoc 7:1511–1522.

    Article  Google Scholar 

  57. Yang J, Zhang Y (2015) Protein structure and function prediction using I-TASSER. Curr Protoc in Bioinformatics 52:5.8.1-5.8.15.

    Article  Google Scholar 

  58. Bowie JU, Lüthy R, Eisenberg D (1991) A method to identify protein sequences that fold into a known three-dimensional stucture. Science 253:164–170.

    Article  Google Scholar 

  59. Lüthy R, Bowie JU, Eisenberg D (1992) Assessment of protein models with three-dimensional profiles. Nature 356:83–85.

    Article  Google Scholar 

  60. Kelley LA, Mezulis S, Yates CM et al (2015) The Phyre2 web portal for protein modeling, prediction and analysis. Nat Protoc 10:845–858.

    Article  Google Scholar 

  61. Ko J, Park H, Heo L, Seok C (2012) GalaxyWEB server for protein structure prediction and refinement. Nucleic Acids Res 40:W294–W297.

    Article  Google Scholar 

  62. Dhanda SK, Vir P, Raghava GPS (2013) Designing of interferon-gamma inducing MHC class-II binders. Biol Direct 8.

  63. Dhanda SK, Gupta S, Vir P, Raghava GPS (2013) Prediction of IL4 inducing peptides. Clin Dev Immunol 2013:1–9.

    Article  Google Scholar 

  64. Nagpal G, Usmani SS, Dhanda SK et al (2017) Computer-aided designing of immunosuppressive peptides based on IL-10 inducing potential. Sci Rep 7:42851.

    Article  Google Scholar 

  65. Gupta S, Madhu MK, Sharma AK, Sharma VK (2016) ProInflam: a webserver for the prediction of proinflammatory antigenicity of peptides and proteins. J Transl Med 14:178.

    Article  Google Scholar 

  66. Ponomarenko J, Bui H-H, Li W et al (2008) ElliPro: a new structure-based tool for the prediction of antibody epitopes. BMC Bioinformatics 9:514.

    Article  Google Scholar 

  67. Ferdous S, Kelm S, Baker TS et al (2019) B-cell epitopes: discontinuity and conformational analysis. Mol Immunol 114:643–650.

    Article  Google Scholar 

  68. Insightful Science (2020) SnapGene® software.

    Google Scholar 

  69. Cole ST, Eiglmeier K, Parkhill J et al (2001) Massive gene decay in the leprosy bacillus. Nature 409:1007–1011.

    Article  Google Scholar 

  70. Chen X, Zaro JL, Shen W-C (2013) Fusion protein linkers: property, design and functionality. Adv Drug Deliv Rev 65:1357–1369.

    Article  Google Scholar 

  71. Mirzapour A, Seyyed Tabaei SJ, Bandehpour M et al (2020) Designing a recombinant multi-epitope antigen of Echinococcus granulosus to diagnose human cystic echinococcosis. Iran J Parasitol 15:1–10.

    Google Scholar 

  72. Duthie MS, Pena MT, Khandhar AP et al (2020) Development of LepReact, a defined skin test for paucibacillary leprosy and low-level M. leprae infection. Appl Microbiol Biotechnol 104:3971–3979.

    Article  Google Scholar 

  73. Sampaio LH, Sousa ALM, Barcelos MC et al (2012) Evaluation of various cytokines elicited during antigen-specific recall as potential risk indicators for the differential development of leprosy. Eur J Clin Microbiol Infect Dis 31:1443–1451.

    Article  Google Scholar 

  74. Mori T, Sakatani M, Yamagishi F et al (2004) Specific detection of tuberculosis infection. Am J Respir Crit Care Med 170:59–64.

    Article  Google Scholar 

  75. Lalvani A, Pareek M (2010) Interferon gamma release assays: principles and practice. Enferm Infecc Microbiol Clin 28:245–252.

    Article  Google Scholar 

  76. Sadhu S, Mitra DK (2018) Emerging concepts of adaptive immunity in leprosy. Front Immunol 9.

  77. Weiss DI, Do TH, De BJ et al (2016) Adaptive immune response in leprosy. International Text Book of Leprosy.

    Google Scholar 

  78. Fonseca AB de L, Simon M do V, Cazzaniga RA et al (2017) The influence of innate and adaptative immune responses on the differential clinical outcomes of leprosy. Infect Dis Poverty 6:5.

    Article  Google Scholar 

  79. Alibakhshi A, Bandehpour M, Sharifnia Z, Kazemi B (2020) The development and evaluation of a multi-epitope antigen as a serodiagnostic marker of Toxoplasma gondii infection. Adv Clin Exp Med 29:669–675.

    Article  Google Scholar 

  80. Yin D, Bai Q, Li L et al (2021) Study on immunogenicity and antigenicity of a novel brucella multiepitope recombined protein. Biochem Biophys Res Commun 540:37–41.

    Article  Google Scholar 

Download references


We acknowledge the collaboration and assistance of all team members; the program Programa Interunidades de Pós-Graduação em Bioinformática, UFMG; Department of Immunology, Microbiology and Parasitology, Institute of Biological Science and Natural Sciences, UFTM; the Brazilian funding agencies CAPES (Coordenação de Aperfeiçoamento de Pessoal de Nível Superior, Brasil); CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnológico); FAPEMIG (Fundação de Amparo à Pesquisa de Minas Gerais); and CePRim (Centro de Pesquisa em Rim) from UFTM.


No funding supported this research.

Author information

Authors and Affiliations



MRL, TCVR, and ST conceived the study, designed the protocol, collected and analyzed the initial data, and wrote the paper. ST, SCS, and MVS coordinated and led the entire project. ST, AKJ, SCS, MVS, MRL, VA, and LEAS cross-checked and re-analyzed all the data. HSC, VA, VR, SCS, CJFO, and ST edited the article. The authors read and approved the manuscript.

Corresponding authors

Correspondence to Sandeep Tiwari or Marcos Vinicius da Silva.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

List of Mycobacterium leprae and M. lepromatosis proteins selected for the analysis with their antigenicity value predicted through VaxiJen. Table S2. Number of B cell epitopes. Table S3. The total numbers of MHC I epitopes. Table S4. The total numbers of MHC II epitopes. Table S5. Number of epitopes in MHC II and I filtering. Table S6. The Immunogenicity Scores of MHC I predicted epitopes. Fig. S1. PSIPRED secondary structure prediction.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Lemes, M.R., Rodrigues, T.C.V., Jaiswal, A.K. et al. In silico designing of a recombinant multi-epitope antigen for leprosy diagnosis. J Genet Eng Biotechnol 20, 128 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Chimeric protein
  • Diagnosis
  • Hansen’s disease
  • Immunoinformatics
  • Leprosy