Lucian Ilie scite author profile

Abstract. We give two optimal linear-time algorithms for computing the Longest Previous Factor (LPF) array corresponding to a string w. For any position i in w, LPF [i] gives the length of the longest factor of w starting at position i that occurs previously in w. Several properties and applications of LPF are investigated. They include computing the Lempel-Ziv factorization of a string and detecting all repetitions (runs) in a string in linear time independently of the integer alphabet size.

show abstract

HiTEC: accurate error correction in high-throughput sequencing data

Ilie

2010

View full text Add to dashboard Cite

show abstract

DELPHI: accurate deep ensemble model for protein interaction sites prediction

Golding

Ilie

2020

100

View full text Add to dashboard Cite

Motivation Proteins usually perform their functions by interacting with other proteins, which is why accurately predicting protein-protein interaction (PPI) binding sites is a fundamental problem. Experimental methods are slow and expensive. Therefore, great efforts are being made towards increasing the performance of computational methods. Results We propose DELPHI (DEep Learning Prediction of Highly probable protein Interaction sites), a new sequence-based deep learning suite for PPI binding sites prediction. DELPHI has an ensemble structure which combines a CNN and a RNN component with fine tuning technique. Three novel features, HSP, position information, and ProtVec are used in addition to nine existing ones. We comprehensively compare DELPHI to nine state-of-the-art programs on five datasets, and DELPHI outperforms the competing methods in all metrics even though its training dataset shares the least similarities with the testing datasets. In the most important metrics, AUPRC and MCC, it surpasses the second best programs by as much as 18.5% and 27.7%, resp. We also demonstrated that the improvement is essentially due to using the ensemble model and, especially, the three new features. Using DELPHI it is shown that there is a strong correlation with protein-binding residues (PBRs) and sites with strong evolutionary conservation. In addition DELPHI’s predicted PBR sites closely match known data from Pfam. DELPHI is available as open sourced standalone software and web server. Availability The DELPHI web server can be found at www.csd.uwo.ca/~yli922/index.php, with all datasets and results in this study. The trained models, the DELPHI standalone source code, and the feature computation pipeline are freely available at github.com/lucian-ilie/DELPHI. Supplementary information Supplementary data are available at Bioinformatics online.

show abstract

SPRINT: ultrafast protein-protein interaction prediction of the entire human interactome

Ilie

2017

BMC Bioinformatics

View full text Add to dashboard Cite

BackgroundProteins perform their functions usually by interacting with other proteins. Predicting which proteins interact is a fundamental problem. Experimental methods are slow, expensive, and have a high rate of error. Many computational methods have been proposed among which sequence-based ones are very promising. However, so far no such method is able to predict effectively the entire human interactome: they require too much time or memory.ResultsWe present SPRINT (Scoring PRotein INTeractions), a new sequence-based algorithm and tool for predicting protein-protein interactions. We comprehensively compare SPRINT with state-of-the-art programs on seven most reliable human PPI datasets and show that it is more accurate while running orders of magnitude faster and using very little memory.ConclusionSPRINT is the only sequence-based program that can effectively predict the entire human interactome: it requires between 15 and 100 min, depending on the dataset. Our goal is to transform the very challenging problem of predicting the entire human interactome into a routine task.AvailabilityThe source code of SPRINT is freely available from https://github.com/lucian-ilie/SPRINT/ and the datasets and predicted PPIs from www.csd.uwo.ca/faculty/ilie/SPRINT/.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-017-1871-x) contains supplementary material, which is available to authorized users.

show abstract

RACER: Rapid and accurate correction of errors in reads

Ilie

Molnar

2013

View full text Add to dashboard Cite

show abstract

Follow automata

Ilie

Yü

2003

Information and Computation

100

View full text Add to dashboard Cite

Reducing NFAs by invariant equivalences

Ilie

Yü

2003

Theoretical Computer Science

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Lucian Ilie

SHRiMP2: Sensitive yet Practical Short Read Mapping

Computing Longest Previous Factor in linear time and applications

HiTEC: accurate error correction in high-throughput sequencing data

DELPHI: accurate deep ensemble model for protein interaction sites prediction

SPRINT: ultrafast protein-protein interaction prediction of the entire human interactome

RACER: Rapid and accurate correction of errors in reads

Follow automata

Reducing NFAs by invariant equivalences

Contact Info

Product

Resources

About