Proceedings of the Second ACM International Conference on Web Search and Data Mining 2009
DOI: 10.1145/1498759.1498813
|View full text |Cite
|
Sign up to set email alerts
|

Information arbitrage across multi-lingual Wikipedia

Abstract: The rapid globalization of Wikipedia is generating a parallel, multi-lingual corpus of unprecedented scale. Pages for the same topic in many different languages emerge both as a result of manual translation and independent development. Unfortunately, these pages may appear at different times, vary in size, scope, and quality.Furthermore, differential growth rates cause the conceptual mapping between articles in different languages to be both complex and dynamic. These disparities provide the opportunity for a … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
47
1
1

Year Published

2009
2009
2021
2021

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 78 publications
(50 citation statements)
references
References 8 publications
1
47
1
1
Order By: Relevance
“…Finally, the learned patterns are applied to text of pages for which a new infobox has been predicted, to assign values to infobox attributes. A recent paper by Adar et al (2009) (which only came to our attention at the time of writing) starts from the same observation as we do. It presents a system for completing infoboxes for English, German, French, and Spanish Wikipedia, which is based on learning a mapping between infoboxes and attributes in multiple languages.…”
Section: Previous Worksupporting
confidence: 62%
“…Finally, the learned patterns are applied to text of pages for which a new infobox has been predicted, to assign values to infobox attributes. A recent paper by Adar et al (2009) (which only came to our attention at the time of writing) starts from the same observation as we do. It presents a system for completing infoboxes for English, German, French, and Spanish Wikipedia, which is based on learning a mapping between infoboxes and attributes in multiple languages.…”
Section: Previous Worksupporting
confidence: 62%
“…Así, McDonough (2012), a través de una encuesta online realizada entre traductores voluntarios de Wikipedia, investigó la percepción que tienen respecto a la traducción que se realiza en un entorno de crowdsourcing, concluyendo que su principal motivación es hacer que la información esté disponible para otras personas. Con un enfoque diferente, centrado en explotar las posibilidades de Wikipedia como corpus multilingüe, con vistas, por ejemplo, a la creación de material lexicográfico, también hay numerosas iniciativas (Adafre y de Rijke, 2006;Potthast et al, 2008;Jones et al, 2008;Schönhofen et al, 2008;Adar et al, 2009;Nguyen et al, 2009;Yen et al, 2012;Oliver y Climent, 2012). Estas propuestas ponen de manifiesto la percepción de Wikipedia como fenómeno y producto cultural, así como las variadas posibilidades que ofrece su estudio en el campo de las ciencias sociales y las humanidades.…”
Section: Wikipedia Como Sistema Culturalunclassified
“…In addition to this, there has been the development of technical solutions such as WikiTranslate [30] or CLWE [24] to aid the translation process, supporting individuals to cross-collaborate and discuss their translation work flow [24]. Moreover, in response to the call for improved content and consistency within translated Wikipedia articles, automated techniques have also been developed for improving content of translated Wikipedia articles [31].…”
Section: Related Workmentioning
confidence: 99%