Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries 2022
DOI: 10.1145/3529372.3533295
|View full text |Cite
|
Sign up to set email alerts
|

Vision and natural language for metadata extraction from scientific PDF documents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…Beyond transfer learning, models that move beyond translating object detection to document layout analysis tasks are those that include the text data as training features [12,14,20]. Often these models are "multi-modal" in that they draw from the fields of machine learning methods for image classification and segmentation and the processing of text with natural language processing or similar techniques [31].…”
Section: Unless the Answer Is Better Models?mentioning
confidence: 99%
“…Beyond transfer learning, models that move beyond translating object detection to document layout analysis tasks are those that include the text data as training features [12,14,20]. Often these models are "multi-modal" in that they draw from the fields of machine learning methods for image classification and segmentation and the processing of text with natural language processing or similar techniques [31].…”
Section: Unless the Answer Is Better Models?mentioning
confidence: 99%
“…For instance, in the case of document corpora, Natural Language Processing (NLP) techniques can be employed to extract titles and descriptions. Specifically, automatic metadata extraction techniques such as those in [5,28] can be utilized to extract metadata from each document, such as Publication Date, Author, Language, etc. This metadata can then be used to derive the metadata for the entire collection, such as Publication Range, Authors, Languages, etc.…”
Section: Automatic Metadata Extractionmentioning
confidence: 99%
“…This transformative capability has culminated in the creation of intelligent chatbots capable of learning from human interactions and providing responses that exhibit an exceptional level of subtlety [3]. More specifically, a critical aspect of LLMs is their ability to extract information from complex sources such as technical manuals, establishing them as advanced knowledge dissemination tools [12,13].…”
Section: Introductionmentioning
confidence: 99%