Abstract:Standard-Nutzungsbedingungen:Die Dokumente auf EconStor dürfen zu eigenen wissenschaftlichen Zwecken und zum Privatgebrauch gespeichert und kopiert werden.Sie dürfen die Dokumente nicht für öffentliche oder kommerzielle Zwecke vervielfältigen, öffentlich ausstellen, öffentlich zugänglich machen, vertreiben oder anderweitig nutzen.Sofern die Verfasser die Dokumente unter Open-Content-Lizenzen (insbesondere CC-Lizenzen) zur Verfügung gestellt haben sollten, gelten abweichend von diesen Nutzungsbedingungen die in… Show more
“…Ginsberg et al [18] only selected 45 significant keywords from 50 million. The method of exhaustion employed by Ginsberg et al [18] is computationally expensive and not easily reproducible by researchers with limited resources [27]. In some cases, researchers have solely relied on keywords recommended by Google [23], [24], [26].…”
Section: Abstractelection and Filteringmentioning
confidence: 99%
“…In the same year as the Ginsberg's publication [18], several studies investigated the usefulness of Google searches for forecasting unemployment in various countries [21±25]. Several papers also used search query data to predict consumption [26], [27], house pricing and sales [28], and travel and consumer confidence [27]. Though studies using web search query data have achieved good results in empirical practice, the field is still young and rapidly developing, with room for discussion and improvement.…”
Several approaches have been proposed for near real-time detection and prediction of the spread of influenza. These include search query data for influenza-related terms, which has been explored as a tool for augmenting traditional surveillance methods. In this paper, we present a method that uses Internet search query data from Baidu to model and monitor influenza activity in China. The objectives of the study are to present a comprehensive technique for: (i) keyword selection, (ii) keyword filtering, (iii) index composition and (iv) modeling and detection of influenza activity in China. Sequential time-series for the selected composite keyword index is significantly correlated with Chinese influenza case data. In addition, one-month ahead prediction of influenza cases for the first eight months of 2012 has a mean absolute percent error less than 11%. To our knowledge, this is the first study on the use of search query data from Baidu in conjunction with this approach for estimation of influenza activity in China.
“…Ginsberg et al [18] only selected 45 significant keywords from 50 million. The method of exhaustion employed by Ginsberg et al [18] is computationally expensive and not easily reproducible by researchers with limited resources [27]. In some cases, researchers have solely relied on keywords recommended by Google [23], [24], [26].…”
Section: Abstractelection and Filteringmentioning
confidence: 99%
“…In the same year as the Ginsberg's publication [18], several studies investigated the usefulness of Google searches for forecasting unemployment in various countries [21±25]. Several papers also used search query data to predict consumption [26], [27], house pricing and sales [28], and travel and consumer confidence [27]. Though studies using web search query data have achieved good results in empirical practice, the field is still young and rapidly developing, with room for discussion and improvement.…”
Several approaches have been proposed for near real-time detection and prediction of the spread of influenza. These include search query data for influenza-related terms, which has been explored as a tool for augmenting traditional surveillance methods. In this paper, we present a method that uses Internet search query data from Baidu to model and monitor influenza activity in China. The objectives of the study are to present a comprehensive technique for: (i) keyword selection, (ii) keyword filtering, (iii) index composition and (iv) modeling and detection of influenza activity in China. Sequential time-series for the selected composite keyword index is significantly correlated with Chinese influenza case data. In addition, one-month ahead prediction of influenza cases for the first eight months of 2012 has a mean absolute percent error less than 11%. To our knowledge, this is the first study on the use of search query data from Baidu in conjunction with this approach for estimation of influenza activity in China.
“…Wu and Brynjolfsson (2013) used real estate related search series to predict housing prices and sales, and demonstrated Google search data predicted future business activities accurately. Many other researches (Askitas and Zimmermann 2009;Kholodilin et al 2009;Goel et al 2010;McLaren 2011;Guzman 2011;Schmidt 2011, 2012;Zhu et al 2012;Chen and Chen 2010) used the search data to forecast many economic indicators, like unemployment rate, inflation expectations, private consumption, consumer behavior, home sales and business cycle. For the forecasting of financial markets, Da et al (2011) constructed a new index to measure investor's attention using search frequency of Google, and forecasted the stock prices in the next weeks.…”
Section: Google Search Data For Economic Forecastmentioning
“…The data has been used for consumption research (Kholodilin et al, 2010;Vosen, 2009, 2010), housing prices (Kulkarni et al, 2009), unemployment (Askitas andZimmermann, 2009;D'Amuri and Marcucci, 2010), as well as finance (Sims, 2010) and policy (Bersier, 2010). It has also been used to enhance the performance of more traditional forecasting models as in Kholodilin et al (2009). Our own exercise on short-term forecasting of German unemployment (Askitas and Zimmermann, 2009) served us well in predicting a quiet labor market at a time when most experts thought it was highly unlikely.…”
Purpose -The purpose of this paper is to investigate the extent to which the authors can use internet search data in order to capture the impact of the 2008 Financial and Economic Crisis on well-being. Design/methodology/approach -The authors look at the G8 countries with a special focus on USA and Germany and investigate whether internet searches reflect the "malaise" caused by the crisis. The authors focus on searches that contain the word "symptoms" and are thought to proxy self-diagnosis and those that contain "side effects" and are thought to proxy treatment. Findings -The authors find that "malaise" searches spike in a fashion coincident with the crisis and its contagion timeline across the G8 countries. The authors show that results based on search recover previously known stylized facts from the economics of health, well-being and the business cycle. Research limitations/implications -Internet penetration is high across the G8 countries. The authors nonetheless cannot get a good handle on the part of the population, which is not online. Moreover the authors cannot get a good grip on all confounding factors. More research would be necessary with access to search microdata. Originality/value -The authors propose global proxies for diagnosis and treatment based on the "search buzz" for symptoms and side effects. The authors can thus capture trends on a global scale. This approach will become increasingly important.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.