2024
DOI: 10.3390/math12040506
|View full text |Cite
|
Sign up to set email alerts
|

Finite State Automata on Multi-Word Units for Efficient Text-Mining

Alberto Postiglione

Abstract: Text mining is crucial for analyzing unstructured and semi-structured textual documents. This paper introduces a fast and precise text mining method based on a finite automaton to extract knowledge domains. Unlike simple words, multi-word units (such as credit card) are emphasized for their efficiency in identifying specific semantic areas due to their predominantly monosemic nature, their limited number and their distinctiveness. The method focuses on identifying multi-word units within terminological ontolog… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 76 publications
0
1
0
Order By: Relevance
“…To efficiently analyze log files emitted by industrial machines and predict potential future malfunctions, we utilize AUTOMETA, a text mining system introduced in prior works ( [65,66]). AUTOMETA employs finite automata and a coherent formalization of natural language, focusing on the universal concept of units of meaning.…”
Section: Scientific Foundations Of the Researchmentioning
confidence: 99%
“…To efficiently analyze log files emitted by industrial machines and predict potential future malfunctions, we utilize AUTOMETA, a text mining system introduced in prior works ( [65,66]). AUTOMETA employs finite automata and a coherent formalization of natural language, focusing on the universal concept of units of meaning.…”
Section: Scientific Foundations Of the Researchmentioning
confidence: 99%