BART-IT: An Efficient Sequence-to-Sequence Model for Italian Text Summarization

Quatra, Moreno La; Cagliero, Luca

doi:10.3390/fi15010015

Cited by 21 publications

(8 citation statements)

References 36 publications

(57 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In summary, the text has a significant development starting from the use of important features such as frequency, word count, and word similarity [19,33]. The advantages of the abstractive approach can remove words that are considered unimportant, while the extractive approach performs summaries based on different phrases in the input data [34]. In the Transformer-based Text-to-Text Transfer Transformer or T5 model, a lot of text summaries are carried out with an abstract approach such as that done by Patwardhan et al [35], Cheng and Yu [36], and Mars [37] which goes through several stages such as tokenization, formation of input-output data, Pretraining and Fine-Tuning, Encoder-Decoder Transformation, and text generation.…”

Section: Related Workmentioning

confidence: 99%

“…deep transformers that perform the hyperparameter combinations of the T5 model. Another study by La Quatra and Cagliero [34] used the BERT model to produce text summaries. In conducting text summaries, there are many challenges faced, one of which is the structure of text data, currently the resulting text data is unstructured and semi-structured.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Enhancing Text Summarization with a T5 Model and Bayesian Optimization

Lubis,

Safitri,

Irvan

et al. 2023

RIA

View full text Add to dashboard Cite

At present the habits and interests of individuals in obtaining information by reading large amounts of information have changed at the stage of reading information more concisely, but these changes have challenges such as the nature of the data which is still unstructured making it difficult to summarize text. This study applies a data cleaning process with text processing and manually annotates to divide the data into summary data and text data so that it can be used for the process of implementing the T5 model and Bayesian optimization. In the implementation of Bayesian optimization using the prior distribution and likelihood parameters. In implementing the T5 model there will be several stages such as processing training and test data then Decodification and Post-Processing processes. The results of this study were obtained using the ROUGE evaluation technique which resulted in an increased evaluation value. The T5 model produces a ROUGE 1 value with an average value of 0.42, ROUGE-2 has a value of 0.55 and ROUGE-L has a value of 0.46 while applying Bayesian optimization produces a ROUGE-1 evaluation with an average value of 0.53 ROUGE-2 has a value of 0.55 and ROUGE-L has a value of 0.59.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Enhancing Text Summarization with a T5 Model and Bayesian Optimization

Lubis,

Safitri,

Irvan

et al. 2023

RIA

View full text Add to dashboard Cite

show abstract

“…In this study, we will combine the models used to summarize, namely the transformer or T5 model with the convolutional Seq2Seq model. Several related studies such as those conducted by Fendji (Fendji et al, 2021) conducted summaries on French Wikipedia documents by applying the T5 model, the T5 model has several advantages in generating text so many researchers use the T5 model in conducting summaries as did Chouikhi (Chouikhi & Alsuhaibani, 2022), Jung (Jung et al, 2022), and Quatra (La Quatra & Cagliero, 2022). In many studies in conducting abstract summaries, there are challenges such as unstructured text which must be changed with text preprocessing techniques such as those carried out by Widyassari (Widyassari et al, 2022) and Christian (Christian et al, 2016) utilizing text preprocessing techniques to be used for document summary processes.…”

Section: Literature Reviewmentioning

confidence: 99%

Improving Text Summarization Quality by Combining T5-Based Models and Convolutional Seq2Seq Models

Lubis,

Safitri,

Irvan

et al. 2023

JAETS

View full text Add to dashboard Cite

In the natural language processing field, there are several sub-fields that are very closely related to information retrieval, such as the automatic text summarization sub-field. obtained from the convolutional T5 and Seq2Seq models in summarizing text on hugging faces found features that can affect text summary such as upper- and lower-case letters which have an impact on changing the understanding of the text of the document. This study uses a combination of parameters such as layer dimensions, learning rate, batch size, and the use of Dropout to avoid model overfitting. The results can be seen by evaluating metrics using ROUGE. This study produces a value of ROUGE-1 on 4 documents that are tested which produces an average of 0.8 which is the optimal value, for ROUGE-2 on 4 documents that are tested which results in an average of 0.83 which is an optimal value while ROUGE-L on 4 documents conducted tests that produce an average of 0.8 which is the optimal value for the summary model.

show abstract

“…To fully address these issues, it is therefore crucial to enhance the model architecture specifically for Chinese. Inspired by the mBART model's summary generation and BART-IT [9] model's success in Italian summarization, our team has optimized the mBART model for Chinese short news(less than 500 Chinese characters) text summarization, aiming to develop a robust model for this specific domain.…”

Section: Introductionmentioning

confidence: 99%

DMSeqNet-mBART: A State-of-the-Art Adaptive-DropMessage Enhanced mBART Architecture for Superior Chinese Short News Text Summarization

Cao,

Hao,

Huang

et al. 2024

Preprint

View full text Add to dashboard Cite

Mandarin Chinese, a widely spoken language globally, has abundant, regularly updated short news texts online. Generating precise summaries of these texts is vital for effective information transmission and comprehension. This article introduces DMSeqNet-mBART, an enhanced mBART-based model, as a state-of-the-art solution for Chinese short news text summarization. This model incorporates Adaptive-DropMessage technology, a novel approach that intelligently discards or retains information based on the attention mechanism’s output. This paper demonstrates that DMSeqNet-mBART excels across several benchmarks, including BERTScore, BLEU, and ROUGE metrics, surpassing other advanced models like GPT-4, T5, and MLC. The paper outlines the Adaptive-DropMessage mechanism, enhanced dynamic convolutional layers, gated residual connections, custom feed-forward networks with batch normalization, and improvements to self-attention and cross-attention. Results from comparative experiments on six recognized Chinese short news text summary datasets indicate that the model’s performance in terms of fluency, completeness, robustness, and accuracy significantly outperforms leading industry models. The DMSeqNet-mBART’s success is attributed to its unique combination of architectural and methodological enhancements, suggesting its suitability for various complex text data processing tasks. The model provides novel insights and methods for processing similar complex text data.

show abstract

BART-IT: An Efficient Sequence-to-Sequence Model for Italian Text Summarization

Cited by 21 publications

References 36 publications

Enhancing Text Summarization with a T5 Model and Bayesian Optimization

Enhancing Text Summarization with a T5 Model and Bayesian Optimization

Improving Text Summarization Quality by Combining T5-Based Models and Convolutional Seq2Seq Models

DMSeqNet-mBART: A State-of-the-Art Adaptive-DropMessage Enhanced mBART Architecture for Superior Chinese Short News Text Summarization

Contact Info

Product

Resources

About