SummaryWe report an update for the MAFFT multiple sequence alignment program to enable parallel calculation of large numbers of sequences. The G-INS-1 option of MAFFT was recently reported to have higher accuracy than other methods for large data, but this method has been impractical for most large-scale analyses, due to the requirement of large computational resources. We introduce a scalable variant, G-large-INS-1, which has equivalent accuracy to G-INS-1 and is applicable to 50 000 or more sequences.Availability and implementationThis feature is available in MAFFT versions 7.355 or later at https://mafft.cbrc.jp/alignment/software/mpi.html.Supplementary information
Supplementary data are available at Bioinformatics online.
Mitochondria provide numerous essential functions for cells and their dysfunction leads to a variety of diseases. Thus, obtaining a complete mitochondrial proteome should be a crucial step toward understanding the roles of mitochondria. Many mitochondrial proteins have been identified experimentally but a complete list is not yet available. To fill this gap, methods to computationally predict mitochondrial proteins from amino acid sequence have been developed and are widely used, but unfortunately, their accuracy is far from perfect. Here we describe MitoFates, an improved prediction method for cleavable N-terminal mitochondrial targeting signals (presequences) and their cleavage sites. MitoFates introduces novel sequence features including positively charged amphiphilicity, presequence motifs, and position weight matrices modeling the presequence cleavage sites. These features are combined with classical ones such as amino acid composition and physico-chemical properties as input to a standard support vector machine classifier. On independent test data, MitoFates attains better performance than existing predictors in both detection of presequences and in predicting their cleavage sites. We used MitoFates to look for undiscovered mitochondrial proteins from 42,217 human proteins (including isoforms such as alternative splicing or translation initiation variants). MitoFates predicts 1167 genes to have at least one isoform with a presequence. Five-hundred and eighty of these genes were not annotated as mitochondrial in either UniProt or Gene Ontology. Interestingly, these include candidate regulators of parkin translocation to damaged mitochondria, and also many genes with known disease mutations, suggesting that careful investigation of MitoFates predictions may be helpful in elucidating the role of mitochondria in health and disease. MitoFates is open source with a convenient web server publicly available.
The kinetochore forms a dynamic interface with microtubules from the mitotic spindle during mitosis. The Ndc80 complex acts as the key microtubule-binding complex at kinetochores. However, it is unclear how the Ndc80 complex associates with the inner kinetochore proteins that assemble upon centromeric chromatin. Here, based on a highresolution structural analysis, we demonstrate that the N-terminal region of vertebrate CENP-T interacts with the 'RWD' domain in the Spc24/25 portion of the Ndc80 complex. Phosphorylation of CENP-T strengthens a cryptic hydrophobic interaction between CENP-T and Spc25 resulting in a phospho-regulated interaction that occurs without direct recognition of the phosphorylated residue. The Ndc80 complex interacts with both CENP-T and the Mis12 complex, but we find that these interactions are mutually exclusive, supporting a model in which two distinct pathways target the Ndc80 complex to kinetochores. Our results provide a model for how the multiple protein complexes at kinetochores associate in a phospho-regulated manner.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.