“…Depending on this percent value, amino acids are then classified as either buried or exposed on a binary basis, or with discrete exposure levels in case of multiple threshold systems [4]. Several different approaches have been proposed to cope with the solvent accessibility problem: Information Theory [5,6], Bayesian Statistics [7], Probability Profiles [8], Neural Networks [4,9,11,12,13,14,15], Linear Regression [16,17], Support Vector Machines [18,19], Support Vector Regression [20], Look-up Tables [21], meta-methods [22] and many others [23]. However, exploiting sequence similarity to known structures, namely sequence homology, proved to be a substantial improvement strategy for all these methods, both for secondary structure and Solvent Accessibility prediction [9,24].…”