2022 IEEE International Conference on Big Data (Big Data) 2022
DOI: 10.1109/bigdata55660.2022.10020918
|View full text |Cite
|
Sign up to set email alerts
|

Patentopia: A multi-stage patent extraction platform with disambiguation for certain semantic challenges

Abstract: Bibliographic name disambiguation is an major semantic challenge, but critical to social sciences studies of important intellectual assets. Here we contribute to innovation research in several ways. We show a significant synonym problem in author names and discuss how a pre-processing heuristic step standardizing name variants helps, but homonyms generated with Chinese names are particularly difficult to resolve and manifest in an associated location list. Here we identify a new phenomenon of "onomastic profus… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
0
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 35 publications
0
0
0
Order By: Relevance
“…We see eponymy in only one percent of our sample (estimated thirty observations). However, a significant clustering problem tied to A&P Technology was identified [15]; it was incorrectly associated with several firms, including G&H Technology, M&A Technology, and R&H Technology. It is not clear if these letters are chosen eponymously.…”
Section: Discussion and Future Researchmentioning
confidence: 99%
See 2 more Smart Citations
“…We see eponymy in only one percent of our sample (estimated thirty observations). However, a significant clustering problem tied to A&P Technology was identified [15]; it was incorrectly associated with several firms, including G&H Technology, M&A Technology, and R&H Technology. It is not clear if these letters are chosen eponymously.…”
Section: Discussion and Future Researchmentioning
confidence: 99%
“…This equilibration is the origin of the previously observed onomastic profusion. It generates a new set of homonyms that can confound clustering or manual (supervised) disambiguation processes [15], similarly to the prevalence of certain names among Chinese authors [10].…”
Section: Names and Marketing Strategymentioning
confidence: 99%
See 1 more Smart Citation
“…O Patentsview é uma plataforma de dados abertos que disponibiliza informações de patentes do USPTO. A ferramenta de API (Application Programming Interface) do site permite o acesso direto às informações por meio de programas como o Python e o R. Esses instrumentos de pesquisa favorecem a aplicação de metodologias consolidadas pela patentometria (Comins e Leydesdorff, 2018;Wang et al, 2019;Bianchi, Galaso, Palomeque, 2020;Toole, Jones e Madhavan, 2021) e, ademais, permitem formular novos métodos e processos de investigação das invenções (Belz et al, 2022;Binette et al, 2023;Lampe, 2023).…”
Section: Introductionunclassified