Ten years after the idea of hydrophobic cluster analysis (HCA) was conceived and first published, theoretical and practical experience has shown this unconventional method of protein sequence analysis to be particularly efficient and sensitive, especially with families of sequences sharing low levels of sequence identity. This extreme sensitivity has made it possible to predict the functions of genes whose sequence similarities are hardly if at all detectable by current one-dimensional (1D) methods alone, and offers a new way to explore the enormous amount of data generated by genome sequencing. HCA also provides original tools to understand fundamental features of protein stability and folding. Since the last review of HCA published in 1990 [1], significant improvements have been made and several new facets have been addressed. Here we wish to update and summarize this information.
Autosomal recessive hereditary spastic paraplegia (ARHSP) with thin corpus callosum (TCC) is a common and clinically distinct form of familial spastic paraplegia that is linked to the SPG11 locus on chromosome 15 in most affected families. We analyzed 12 ARHSP-TCC families, refined the SPG11 candidate interval and identified ten mutations in a previously unidentified gene expressed ubiquitously in the nervous system but most prominently in the cerebellum, cerebral cortex, hippocampus and pineal gland. The mutations were either nonsense or insertions and deletions leading to a frameshift, suggesting a loss-of-function mechanism. The identification of the function of the gene will provide insight into the mechanisms leading to the degeneration of the corticospinal tract and other brain structures in this frequent form of ARHSP.
The specificity of a homopyrimidine oligonucleotide binding to a homopurine-homopyrimidine sequence on double-stranded DNA was investigated by both molecular modeling and thermal dissociation experiments. The presence of a single mismatched triplet at the center of the triplex was shown to destabilize the triple helix, leading to a lower melting temperature and a less favorable energy of interaction. A terminal mismatch was less destabilizing than a central mismatch. The extent of destabilization was shown to be dependent on the nature of the mismatch. Both single base-pair substitution and deletion in the duplex DNA target were investigated. When a homopurine stretch was interrupted by one thymine, guanine was the least destabilizing base on the third strand. However, G in the third strand did not discriminate between a C.G and an A.T base pair. If the stretch of purines was interrupted by a cytosine, the presence of pyrimidines (C or T) in the third strand yielded a less destabilizing effect than purines. This study shows that oligonucleotides forming triple helices can discriminate between duplex DNA sequences that differ by one base pair. It provides a basis for the choice of antigene oligonucleotide sequences targeted to selected sequences on duplex DNA.
In silico screening methods based on the 3D structures of the ligands or of the proteins have become an essential tool to facilitate the drug discovery process. To achieve such process, the 3D structures of the small chemical compounds have to be generated. In addition, for ligand-based screening computations or hierarchical structure-based screening projects involving a rigid-body docking step, it is necessary to generate multi-conformer 3D models for each input ligand to increase the efficiency of the search. However, most academic or commercial compound collections are delivered in 1D SMILES (simplified molecular input line entry system) format or in 2D SDF (structure data file), highlighting the need for free 1D/2D to 3D structure generators. Frog is an on-line service aimed at generating 3D conformations for drug-like compounds starting from their 1D or 2D descriptions. Given the atomic constitution of the molecules and connectivity information, Frog can identify the different unambiguous isomers corresponding to each compound, and generate single or multiple low-to-medium energy 3D conformations, using an assembly process that does not presently consider ring flexibility. Tests show that Frog is able to generate bioactive conformations close to those observed in crystallographic complexes. Frog can be accessed at http://bioserv.rpbs.jussieu.fr/Frog.html.
RPBS (Ressource Parisienne en Bioinformatique Structurale) is a resource dedicated primarily to structural bioinformatics. It is the result of a joint effort by several teams to set up an interface that offers original and powerful methods in the field. As an illustration, we focus here on three such methods uniquely available at RPBS: AUTOMAT for sequence databank scanning, YAKUSA for structure databank scanning and WLOOP for homology loop modelling. The RPBS server can be accessed at and the specific services at .
The packing geometry of amino acids in folded proteins is analyzed via a modified Voronoï tessellation method which distinguishes bulk and surface. From a statistical analysis of the Voronoï cells over 40 representative proteins, it appears that the packings are in average similar to random packings of hard spheres encountered in condensed matter physics, with a quite strong fivefold local symmetry. Moreover, the statistics permits one to establish a classification of amino acids in terms of increasing propensity to be buried in agreement with what is known from chemical considerations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.