Sandro Luck scite author profile

Sandro Luck

4Publications

22Citation Statements Received

96Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Towards BERT-based Automatic ICD Coding: Limitations and Opportunities

Pascual¹,

Luck²,

Wattenhofer³

2021

View full text Add to dashboard Cite

Automatic ICD coding is the task of assigning codes from the International Classification of Diseases (ICD) to medical notes. These codes describe the state of the patient and have multiple applications, e.g., computer-assisted diagnosis or epidemiological studies. ICD coding is a challenging task due to the complexity and length of medical notes. Unlike the general trend in language processing, no transformer model has been reported to reach high performance on this task. Here, we investigate in detail ICD coding using PubMedBERT, a stateof-the-art transformer model for biomedical language understanding. We find that the difficulty of fine-tuning the model on long pieces of text is the main limitation for BERT-based models on ICD coding. We run extensive experiments and show that despite the gap with current state-of-the-art, pretrained transformers can reach competitive performance using relatively small portions of text. We point at better methods to aggregate information from long texts as the main need for improving BERT-based ICD coding.

show abstract

Towards BERT-based Automatic ICD Coding: Limitations and Opportunities

Pascual¹,

Luck²,

Wattenhofer³

2021

Preprint

View full text Add to dashboard Cite

show abstract

Loss Aversion in Recommender Systems: Utilizing Negative User Preference to Improve Recommendation Quality

Paudel¹,

Luck²,

Bernstein³

2018

Preprint

View full text Add to dashboard Cite

Medley2K: A Dataset of Medley Transitions

Faber¹,

Luck²,

Pascual³

et al. 2020

Preprint

View full text Add to dashboard Cite

The automatic generation of medleys, i.e., musical pieces formed by different songs concatenated via smooth transitions, is not well studied in the current literature. To facilitate research on this topic, we make available a dataset called Medley2K that consists of 2, 000 medleys and 7, 712 labeled transitions. Our dataset features a rich variety of song transitions across different music genres. We provide a detailed description of this dataset and validate it by training a state-of-the-art generative model in the task of generating transitions between songs.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sandro Luck

Towards BERT-based Automatic ICD Coding: Limitations and Opportunities

Towards BERT-based Automatic ICD Coding: Limitations and Opportunities

Loss Aversion in Recommender Systems: Utilizing Negative User Preference to Improve Recommendation Quality

Medley2K: A Dataset of Medley Transitions

Contact Info

Product

Resources

About