Erik Jansson scite author profile

Erik Jansson

5Publications

24Citation Statements Received

50Citation Statements Given

How they've been cited

How they cite others

Affiliations

AstraZeneca (Brazil), AstraZeneca (United Kingdom), University of Aberdeen

Publications

Order By: Most citations

Biological Insights Knowledge Graph: an integrated knowledge graph to support drug development

Geleta

Nikolov

Edwards

et al. 2021

Preprint

View full text Add to dashboard Cite

The use of knowledge graphs as a data source for machine learning methods to solve complex problems in life sciences has rapidly become popular in recent years. Our Biological Insights Knowledge Graph (BIKG) combines relevant data for drug development from public as well as internal data sources to provide insights for a range of tasks: from identifying new targets to repurposing existing drugs. Besides the common requirements to organisational knowledge graphs such as being able to capture the domain precisely and give the users the ability to search and query the data, the focus on handling multiple use cases and supporting use case-specific machine learning models presents additional challenges: the data models must also be streamlined for the performance of downstream tasks; graph content must be easily customisable for different use cases; different projections of the graph content are required to support a wider range of different consumption modes. In this paper we describe our main design choices in implementation of the BIKG graph and discuss different aspects of its life cycle: from graph construction to exploitation.

show abstract

Ablations over transformer models for biomedical relationship extraction

et al. 2020

View full text Add to dashboard Cite

Background: Masked language modelling approaches have enjoyed success in improving benchmark performance across many general and biomedical domain natural language processing tasks, including biomedical relationship extraction (RE). However, the recent surge in both the number of novel architectures and the volume of training data they utilise may lead us to question whether domain specific pretrained models are necessary. Additionally, recent work has proposed novel classification heads for RE tasks, further improving performance. Here, we perform ablations over several pretrained models and classification heads to try to untangle the perceived benefits of each. Methods: We use a range of string preprocessing strategies, combined with Bidirectional Encoder Representations from Transformers (BERT), BioBERT and RoBERTa architectures to perform ablations over three RE datasets pertaining to drug-drug and chemical protein interactions, and general domain relationship extraction. We explore the use of the RBERT classification head, compared to a simple linear classification layer across all architectures and datasets. Results: We observe a moderate performance benefit in using the BioBERT pretrained model over the BERT base cased model, although there appears to be little difference when comparing BioBERT to RoBERTa large. In addition, we observe a substantial benefit of using the RBERT head on the general domain RE dataset, but this is not consistently reflected in the biomedical RE datasets. Finally, we discover that randomising the token order of training data does not result in catastrophic performance degradation in our selected tasks. Conclusions: We find a recent general domain pretrained model performs approximately the same as a biomedical specific one, suggesting that domain specific models may be of limited use given the tendency of recent model pretraining regimes to incorporate ever broader sets of data. In addition, we suggest that care must be taken in RE model training, to prevent fitting to non-syntactic features of datasets.

show abstract

Real-Time Hybrid Hair Rendering

Jansson¹,

Chajdas²,

Lacroix³

et al. 2019

View full text Add to dashboard Cite

Visible surface area and prehension movement patterns

et al. 2010

View full text Add to dashboard Cite

Drug Discovery as a Recommendation Problem: Challenges and Complexities in Biological Decisions

Gogleva

Papa

Jansson

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Erik Jansson

Biological Insights Knowledge Graph: an integrated knowledge graph to support drug development

Ablations over transformer models for biomedical relationship extraction

Real-Time Hybrid Hair Rendering

Visible surface area and prehension movement patterns

Drug Discovery as a Recommendation Problem: Challenges and Complexities in Biological Decisions

Contact Info

Product

Resources

About