2023
DOI: 10.1587/transinf.2022edp7083
|View full text |Cite
|
Sign up to set email alerts
|

Auxiliary Loss for BERT-Based Paragraph Segmentation

Abstract: Paragraph segmentation is a text segmentation task. Iikura et al. achieved excellent results on paragraph segmentation by introducing focal loss to Bidirectional Encoder Representations from Transformers.In this study, we investigated paragraph segmentation on Daily News and Novel datasets. Based on the approach proposed by Iikura et al., we used auxiliary loss to train the model to improve paragraph segmentation performance. Consequently, the average F1-score obtained by the approach of Iikura et al. was 0.67… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 28 publications
0
1
0
Order By: Relevance
“…The input in BERT includes two special tokens, [CLS] and [SEP]: [CLS] signifies the beginning of the input text, while [SEP] marks the junction between two input segments. Considering the study on the impact of BERT input format on text segmentation results in [ 31 ], we add [SEP] tokens between each dialogue turns before inputting into BERT. As a result, the input of BERT is changed to the following format: …”
Section: Proposed Frameworkmentioning
confidence: 99%
“…The input in BERT includes two special tokens, [CLS] and [SEP]: [CLS] signifies the beginning of the input text, while [SEP] marks the junction between two input segments. Considering the study on the impact of BERT input format on text segmentation results in [ 31 ], we add [SEP] tokens between each dialogue turns before inputting into BERT. As a result, the input of BERT is changed to the following format: …”
Section: Proposed Frameworkmentioning
confidence: 99%