Inferring Multilingual Domain-Specific Word Embeddings From Large Document Corpora

Cagliero, Luca; Quatra, Moreno La

doi:10.1109/access.2021.3118093

Cited by 5 publications

(3 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Various methods for sentence selection in both open and closed domains are discussed in Table 1, taking into account the number of categories considered and the objective of constructing the benchmark dataset. [29] Open Building three benchmarks considering rating numbers in AMA-ZON, extracting sentence pairs from Japanese datasets, and randomly picking from Wikipedia -Having a benchmark in the Japanese language considering cultural/social parameters without using the translation methods in order to evaluate NLU ability in the general domain [66] Close In three domains, medicine, technology, and finance, they pick the sentences in Wikipedia based on the defined categories…”

Section: State Of the Artmentioning

confidence: 99%

“…Recall: Recall assesses the model's capacity to identify all relevant instances and is calculated as the ratio of true positives to the sum of true positives and false negatives. • F1-score: The F1-score, as a measure that balances precision and recall, calculated as the harmonic mean of the two values, is used as the evaluation metric in many types of research related to benchmark performance comparison [66,68,78].…”

mentioning

confidence: 99%

See 1 more Smart Citation

Natural Language Processing in Knowledge-Based Support for Operator Assistance

Besharati Moghaddam,

Lopez,

De Vuyst

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

Manufacturing industry faces increasing complexity in the performance of assembly tasks due to escalating demand for complex products with a greater number of variations. Operators require robust assistance systems to enhance productivity, efficiency, and safety. However, existing support services often fall short when operators encounter unstructured open questions and incomplete sentences due to primarily relying on procedural digital work instructions. This draws attention to the need for practical application of natural language processing (NLP) techniques. This study addresses these challenges by introducing a domain-specific dataset tailored to assembly tasks, capturing unique language patterns and linguistic characteristics. We explore strategies to process declarative and imperative sentences, including incomplete ones, effectively. Thorough evaluation of three pre-trained NLP libraries—NLTK, SPACY, and Stanford—is performed to assess their effectiveness in handling assembly-related concepts and ability to address the domain’s distinctive challenges. Our findings demonstrate the efficient performance of these open-source NLP libraries in accurately handling assembly-related concepts. By providing valuable insights, our research contributes to developing intelligent operator assistance systems, bridging the gap between NLP techniques and the assembly domain within manufacturing industry.

show abstract

Section: State Of the Artmentioning

confidence: 99%

mentioning

confidence: 99%

Natural Language Processing in Knowledge-Based Support for Operator Assistance

Besharati Moghaddam,

Lopez,

De Vuyst

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

show abstract

“…Nonetheless, researchers who tackle language-oriented tasks (e.g., Natural Language Processing) have also started to explore, adapt and even propose multilingual methods or offer multilanguage support, e.g., [Cagliero and Quatra 2021, Guarasci et al 2022, Krótkiewicz et al 2016, Pessutto et al 2020]. Regarding the Portuguese language, Morais et al [2020] classify a set of current news from Brazilian news portals into fake, satirical, objective and legitimate news.…”

Section: Related Workmentioning

confidence: 99%

Brazilian Reading Preferences in Goodreads: Cross-state and Cross-region Analyses

Silva

Scofield

Melo-Gomes

et al. 2022

iSys

View full text Add to dashboard Cite

As a multicultural and ethnically diverse nation, Brazil has singular cultural identities in accents, gastronomy and traditions, also reflected in its literature. Here, we model a multipartite network to perform cross-state comparison analyses based on the cosine distance for Brazilian reading preferences. We also explore the impact of the relationships between geographic, socioeconomic, and demographic factors and both shared books and literary genres across Brazilian states. Finally, we extract the backbone of networks to identify cultural clusters in Brazil and each of its macro-regions. Such cross-state analyses highlight the country’s rich cultural diversity, where each region shows its own identity. Our findings open opportunities to the book industry by enhancing current knowledge on social indicators related to reading preferences.

show abstract