2021
DOI: 10.1007/s00521-021-06614-2
|View full text |Cite
|
Sign up to set email alerts
|

Using weak supervision to generate training datasets from social media data: a proof of concept to identify drug mentions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 24 publications
0
5
0
Order By: Relevance
“…Leveraging weak supervision has demonstrated potential in the realm of social media mining [3,28,29]. A dataset acquired without manual curation using a weak labelling heuristic detailed in the respective section is termed as 'bronze-standard dataset'.…”
Section: Bronze Standard Datasetmentioning
confidence: 99%
“…Leveraging weak supervision has demonstrated potential in the realm of social media mining [3,28,29]. A dataset acquired without manual curation using a weak labelling heuristic detailed in the respective section is termed as 'bronze-standard dataset'.…”
Section: Bronze Standard Datasetmentioning
confidence: 99%
“…We retrained the models adding the biocreative validation dataset and finally obtained the predictions on the test data. We filtered all the positive predictions and extracted the spans of the medication term using a medication dictionary ( 47 ). The SMMT_NER utility from the Social Media Mining Toolkit ( 48 ) was utilized for identifying the spans of the medication.…”
Section: Systemsmentioning
confidence: 99%
“…Obtaining a considerable amount of high-quality label data costs a lot. Therefore, the strong dependence on labelled data hinders the application of the deep learning model, which is the bottleneck of supervised learning [24]. In natural language processing, it is difficult to obtain high-quality labelled texts.…”
Section: Related Workmentioning
confidence: 99%