2022
DOI: 10.4018/ijghpc.301579
|View full text |Cite
|
Sign up to set email alerts
|

A Big Data Pipeline and Machine Learning for Uniform Semantic Representation of Data and Documents From IT Systems of the Italian Ministry of Justice

Abstract: In this paper a Big Data Pipeline is presented, taking in consideration both structured and unstructured data made available by the Italian Ministry of Justice, regarding their Telematic Civil Process. Indeed, the complexity and volume of the data provided by the Ministry requires the application of Big Data analysis techniques, in concert with Machine and Deep Learning frameworks, to be correctly analysed and to obtain meaningful information that could support the Ministry itself in better managing Civil Proc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 12 publications
(1 citation statement)
references
References 17 publications
0
1
0
Order By: Relevance
“…Indeed, the reduced number of available samples in the original dataset, together with its complexity deriving from the considerable number of features describing each sample, required the application of different data preparation techniques, in order to avoid or at least reduce overfitting and imbalance problems. Similar issues have been addressed in other works, related to different dominions, such as in Di Martino et al (2021). In particular, since predicting the progression of the MS disease through a single indicator, namely the Expanded disability Status Scale (EDSS) described in Sect.…”
Section: Introductionmentioning
confidence: 73%
“…Indeed, the reduced number of available samples in the original dataset, together with its complexity deriving from the considerable number of features describing each sample, required the application of different data preparation techniques, in order to avoid or at least reduce overfitting and imbalance problems. Similar issues have been addressed in other works, related to different dominions, such as in Di Martino et al (2021). In particular, since predicting the progression of the MS disease through a single indicator, namely the Expanded disability Status Scale (EDSS) described in Sect.…”
Section: Introductionmentioning
confidence: 73%