2021 IEEE 24th International Conference on Computer Supported Cooperative Work in Design (CSCWD) 2021
DOI: 10.1109/cscwd49262.2021.9437769
|View full text |Cite
|
Sign up to set email alerts
|

AuthCrowd: Author Name Disambiguation and Entity Matching using Crowdsourcing

Abstract: Despite decades of research and development in named entity resolution, dealing with name ambiguity is still a challenging issue for many bibliometric-enhanced information retrieval (IR) tasks. As new bibliographic datasets are created as a result of the upward growth of publication records worldwide, more problems arise when considering the effects of errors resulting from missing data fields, duplicate entities, misspellings, extra characters, etc. As these concerns tend to be of large-scale, both the genera… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 17 publications
0
0
0
Order By: Relevance
“…For instance, Ferreira et al introduced "AuthCrowd," a crowdsourcing system designed to tackle author name disambiguation and entity matching by decomposing tasks for crowd workers. Experimental results on a real-world dataset of publicly available papers published in peer-reviewed venues demonstrate the potential of this approach to improve author name disambiguation [4].…”
Section: Introductionmentioning
confidence: 93%
“…For instance, Ferreira et al introduced "AuthCrowd," a crowdsourcing system designed to tackle author name disambiguation and entity matching by decomposing tasks for crowd workers. Experimental results on a real-world dataset of publicly available papers published in peer-reviewed venues demonstrate the potential of this approach to improve author name disambiguation [4].…”
Section: Introductionmentioning
confidence: 93%
“…The core of the system lies in a Bi-LSTM classification model, a type of recurrent neural network (RNN) adept at handling sequential data like text.This model analyzes the pre-processed text, identifying linguistic features and stylistic choices often associated with deceptive content. By continuously learning from user-provided feedback on test sample labels [10], the system refines its classification ability, becoming progressively adept at distinguishing between truthful and deceptive statements. This user-centric approach fosters a collaborative environment, empowering users to contribute to a more reliable and trustworthy digital information landscape.…”
Section: Proposed Modelmentioning
confidence: 99%