BACKGROUND
The increased use of digital data in health research calls for inter- and transdisciplinary collaborations as this data is associated with methodological complexities. This often entails merging the linear deductive approach of health science with the explorative iterative approach of data science. Yet, it is questioned how established health research practices and paradigms should adapt to effectively utilize emerging digital data sources and analytical methods.
OBJECTIVE
The present study systematically examined differences and similarities between health sciences and data science using their approaches towards defining research question as an illustrative example.
METHODS
To this end, we conducted a literature search and organized three expert workshops with researchers at the University of Zurich.
RESULTS
We first developed a glossary to establish a shared understanding of common terminologies and concepts. Subsequently, we delineated the established research workflow for research question formulation, emphasizing "what" and "how," while summarizing the necessary tools for facilitating the process. Finally, we proposed clusters of adaptations to this workflow to integrate essential data science practices, offering recommendations for a pragmatic approach to research question formulation.
CONCLUSIONS
Our findings aim to foster a collective understanding for the systematic adaptation of data science methods in utilizing digital data in health research.