Alex Rudniy scite author profile

Alex Rudniy

5Publications

35Citation Statements Received

50Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Scranton, New Jersey Institute of Technology, Educational Testing Service

Publications

Order By: Most citations

Mapping biological entities using the longest approximately common prefix method

2014

View full text Add to dashboard Cite

BackgroundThe significant growth in the volume of electronic biomedical data in recent decades has pointed to the need for approximate string matching algorithms that can expedite tasks such as named entity recognition, duplicate detection, terminology integration, and spelling correction. The task of source integration in the Unified Medical Language System (UMLS) requires considerable expert effort despite the presence of various computational tools. This problem warrants the search for a new method for approximate string matching and its UMLS-based evaluation.ResultsThis paper introduces the Longest Approximately Common Prefix (LACP) method as an algorithm for approximate string matching that runs in linear time. We compare the LACP method for performance, precision and speed to nine other well-known string matching algorithms. As test data, we use two multiple-source samples from the Unified Medical Language System (UMLS) and two SNOMED Clinical Terms-based samples. In addition, we present a spell checker based on the LACP method.ConclusionsThe Longest Approximately Common Prefix method completes its string similarity evaluations in less time than all nine string similarity methods used for comparison. The Longest Approximately Common Prefix outperforms these nine approximate string matching methods in its Maximum F1 measure when evaluated on three out of the four datasets, and in its average precision on two of the four datasets.

show abstract

Detecting duplicate biological entities using Shortest Path Edit Distance

Rudniy

Song

Geller

2010

IJDMB

View full text Add to dashboard Cite

Duplicate entity detection in biological data is an important research task. In this paper, we propose a novel and context-sensitive Shortest Path Edit Distance (SPED) extending and supplementing our previous work on Markov Random Field-based Edit Distance (MRFED). SPED transforms the edit distance computational problem to the calculation of the shortest path among two selected vertices of a graph. We produce several modifications of SPED by applying Levenshtein, arithmetic mean, histogram difference and TFIDF techniques to solve subtasks. We compare SPED performance to other well-known distance algorithms for biological entity matching. The experimental results show that SPED produces competitive outcomes.

show abstract

De-Identification of Laboratory Reports in STEM

Rudniy

2018

View full text Add to dashboard Cite

Detecting Duplicate Biological Entities Using Markov Random Field-Based Edit Distance

Song

Rudniy

2008

View full text Add to dashboard Cite

Detecting duplicate biological entities using Markov random field-based edit distance

Song

Rudniy

2009

Knowl Inf Syst

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Alex Rudniy

Mapping biological entities using the longest approximately common prefix method

Detecting duplicate biological entities using Shortest Path Edit Distance

De-Identification of Laboratory Reports in STEM

Detecting Duplicate Biological Entities Using Markov Random Field-Based Edit Distance

Detecting duplicate biological entities using Markov random field-based edit distance

Contact Info

Product

Resources

About