Presently, a very large number of public and private data sets are available from local governments. In most cases, they are not semantically interoperable and a huge human effort would be needed to create integrated ontologies and knowledge base for smart city. Smart City ontology is not yet standardized, and a lot of research work is needed to identify models that can easily support the data reconciliation, the management of the complexity, to allow the data reasoning. In this paper, a system for data ingestion and reconciliation of smart cities related aspects as road graph, services available on the roads, traffic sensors etc., is proposed. The system allows managing a big data volume of data coming from a variety of sources considering both static and dynamic data. These data are mapped to a smart-city ontology, called KM4City (Knowledge Model for City), and stored into an RDF-Store where they are available for applications via SPARQL queries to provide new services to the users via specific applications of public administration and enterprises. The paper presents the process adopted to produce the ontology and the big data architecture for the knowledge base feeding on the basis of open and private data, and the mechanisms adopted for the data verification, reconciliation and validation. Some examples about the possible usage of the coherent big data knowledge base produced are also offered and are accessible from the RDF-store and related services. The article also presented the work performed about reconciliation algorithms and their comparative assessment and selection.
The aim of this paper is to describe the work done to exploit the LBC database for the purpose of translation analysis as a resource to edit the bilingual lexical sections of our dictionaries of cultural heritage (in nine languages). This database, made up of nine corresponding corpora, contains texts whose subject is cultural heritage, ranging from technical texts on art history to books on art appreciation, such as tour guides, and travel books highlighting Italian art and culture. We will illustrate the different questions with the SketchEngine LBC French corpus, made up at the moment of 3,000,000 words. Our particular interest here is in research that not only orients lexical choices for translators but that also precedes the selection of bilingual quotations (from our Italian/French parallel corpus) and that we rely on for editing an optional element of the file called "translation notes." We will rely on this as much for works on "universals of translation" already described by Baker (Corpus linguistics and translation studies. Implications and applications. In Baker M et al (eds) Text and technology. Benjamins, Amsterdam/Philadelphia, pp 233-250 (1993)) as for studies aimed at improving translation quality assessment (TQA). We will show how a targeted consultation of different corpora and subcorpora that the database allows us to distinguish ("natural language" vs "translation," "technical texts" vs "popularization texts" or "literary texts") can help us identify approximations or translation errors, so as to build quality comparative lexicographical information.Riccardo Billero wrote parts 1 and 2 of this article: Parts 3, 4, and 5 were written by Annick Farina. The IT support for the project (the database and the statistical results) was produced by Riccardo Billero.
As a result of the wealth of artistic expression that Italy has produced over the centuries, the Italian cultural heritage lexicon has become a crucial object of interest for scholars of varied disciplines. However, while monolingual art dictionaries are currently available, there are no multilingual tools that offer the same level of quality and comprehensiveness. The present work moves in that direction. In particular, the aim of this paper is to describe the work done to date in designing and implementing the LBC database, a resource for constructing a multilingual art dictionary in nine languages (Chinese, French, English, German, Italian, Portuguese, Russian, Spanish and Turkish). This database, made up of nine corresponding corpora, will contain texts whose subject is cultural heritage, ranging from technical texts on art history to books on art appreciation, such as tour guides, and, lastly, travel books highlighting Italian art and culture. Below is a summary of the decisions taken during the work process and the challenges that lie ahead for its future development.
Nowadays, lexicographical studies require an interaction with Digital Humanities. This volume presents the genesis and structure of the LBC database, a digital work support tool developed by the Multilanguage Cultural Heritage Lexicon Research Unit under the aegis of the University of Florence, which allows to carry out text research in six different digital corpora (French, English, Italian, Russian, Spanish, German). The authors illustrate the specificities of each corpus in terms of the chosen sources and propose lexicographical and translational uses.
Il lessico relativo al patrimonio culturale italiano è oramai divenuto un ambito di interesse per molte discipline; a partire da tale considerazione, il progetto Lessico multilingue dei Beni Culturali (LBC) si pone l'obiettivo di analizzare e studiare la terminologia del mondo dell'arte in varie lingue, focalizzandosi in particolare sulla Toscana. Nell'ambito di tale lavoro di ricerca sono state realizzate varie banche dati contenenti testi letterari e tecnici con informazioni di rilievo sui beni culturali toscani al fine di diffonderne la conoscenza, mediante la pubblicazione online di corpora nelle lingue coinvolte nel progetto. Tra i risultati del progetto è da notare il corpus "Español LBC", il quale dispone ad oggi di oltre un milione di parole, l'origine delle quali fa sì che il corpus sia caratterizzato da una alta frequenza di terminologia relativa all'arte, costituendo per questo un punto di riferimento per coloro che vogliono occuparsi di lessico spagnolo nell'ambito dei beni culturali. In particolare, lo scopo di questo paper è quello di descrivere il lavoro svolto nell'ambito della progettazione ed implementazione del corpus, e di analizzarne l'utilizzo per lo studio della terminologia spagnola dell'arte. Nell'ambito della terminologia spagnola dell'arte, come esempio delle possibilità di uso del corpus, verranno mostrate alcune tra le più significative unità pluriverbali, assenti nel dizionario utilizzato come riferimento.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.