The aim of the study was to evaluate the possibility of predicting potential epitope sequences and location in allergenic proteins from food using EVALLER program by comparison with experimental epitopes summarised in the BIOPEP database of allergenic proteins. Sequences of experimental epitopes from food allergens, present in the BIOPEP database of allergenic proteins were used in the study. Sequences of potential epitopes were found using EVALLER program. The Positive Predictive Value (PPV) has been used as a measure of prediction quality. The potential epitopes fully or partially overlapping with the experimental ones were considered as true positive results whereas these unrelated to the experimental ones as false positive results. The PPV for entire dataset containing 310 potential epitopes was 80.6%. The PPV varied signifi cantly among particular allergen families defi ned according to the AllFam database. Caseins revealed PPV=100% (with one exception), proteins from tropomyosin family and proteins from papain-like cystein protease family -exceeding 50%. The last two families possess also relatively low frequency of epitope occurrence. The predictive potential was poor (less than 50%) for plant allergens from cupin superfamily. Families such as lipocalins from milk and EF-hand family (parvalbumins) revealed high variability within family. The EVALLER program may be used as a tool for the prediction of epitope location although its potential varies considerably among allergen families. High PPV is associated with a high number of known experimental epitopes (such as in caseins) and/or a high degree of sequence conservation within family (caseins, tropomyosins). insertion and deletion computed using the FASTA algorithm [Pearson, 1991;2000], and the Smith-Waterman score [Smith & Waterman, 1981]. The algorithm applied to develop the EVALLER program was compared with other algorithms investigating the allergenic character of the examined proteins [Soeria-Atmadja et al., 2006]. In reference to the offi cial bioinformatics criteria recommended by the WHO [Goodman, 2006], the discussed program produces fewer false-positive results, i.e. cases in which a non-allergenic protein is found to be an allergen [Soeria-Atmadja et al., 2006]. According to a suggestion presented by Björklund et al. [2005], peptides characteristic for allergens could overlap with experimental epitopes determined by mapping, i.e. based on interactions between the peptides corresponding to fragments of protein sequences and the antibodies of persons allergic to the analysed protein [Bohle, 2006;Steckelbroeck et al., 2008]. Comparison of the epitope determination using experimental mapping and prediction using EVALLER is presented in Figure 1.
UnauthenticatedThe aim of the present study was to evaluate the possibility of predicting potential epitope sequences and location in allergenic proteins from food using EVALLER program by comparison with experimental epitopes summarised in the BIOPEP database of allergenic proteins. Table 1. All databases...