Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery &Amp; Data Mining 2020
DOI: 10.1145/3394486.3406468
|View full text |Cite
|
Sign up to set email alerts
|

Multi-modal Information Extraction from Text, Semi-structured, and Tabular Data on the Web

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 6 publications
0
5
0
1
Order By: Relevance
“…The current limitations and open challenges in the TU research were discussed in the recent surveys (Roldán et al, 2020; Zhang and Balog, 2020; Bonfitto et al, 2021) and tutorials (Burdick et al, 2020; Dong et al, 2020; Pujara et al, 2021). This overview continues the discussion by pointing out how the existing statements of the TU tasks ignore the complexity and diversity of real‐world tables.…”
Section: Current Limitations and Future Directionsmentioning
confidence: 99%
See 1 more Smart Citation
“…The current limitations and open challenges in the TU research were discussed in the recent surveys (Roldán et al, 2020; Zhang and Balog, 2020; Bonfitto et al, 2021) and tutorials (Burdick et al, 2020; Dong et al, 2020; Pujara et al, 2021). This overview continues the discussion by pointing out how the existing statements of the TU tasks ignore the complexity and diversity of real‐world tables.…”
Section: Current Limitations and Future Directionsmentioning
confidence: 99%
“…The early proposals were reviewed by Zanibbi et al (2004), Embley et al (2006), and e Silva et al (2006). Many new advancements have been considered in the recent surveys (Zhang and Balog, 2020; Roldán et al, 2020; Bonfitto et al, 2021) and tutorials (Burdick et al, 2020; Dong et al, 2020; Pujara et al, 2021).…”
Section: Introductionmentioning
confidence: 99%
“…It is not surprising then that they are considered one of the most significant information sources in the current Web [4]. The information in typical HTML tables can be leveraged in many application domains [2,[4][5][6][7][8], e.g., knowledge management, information retrieval, web mining, summarisation, knowledge base construction, or question answering. Researchers commonly refer to the tables that are used to display information as relational tables, in contrast to nonrelational tables that are used to position other elements on the screen.…”
Section: Introductionmentioning
confidence: 99%
“…O artigo também utiliza reconhecimento da entidades ou tags de dados para identificac ¸ão de coordenadas úteis para o dataset proposto, conforme apresentado na Figura 1. Embora pesquisas abordem a extrac ¸ão de dados Web para problemas diferentes as dificuldades são semelhantes, como identificar dados relevantes para o domínio investigado [Dong et al 2020]. O artigo propõe uma alternativa para a criac ¸ão de um dataset a partir do enriquecimento com arquivos de dados georreferenciados.…”
Section: Introduc ¸ãOunclassified