Abstract. Recently, deep learning (DL) has emerged as a revolutionary and
versatile tool transforming industry applications and generating new and
improved capabilities for scientific discovery and model building. The
adoption of DL in hydrology has so far been gradual, but the field is now
ripe for breakthroughs. This paper suggests that DL-based methods can open up a
complementary avenue toward knowledge discovery in hydrologic sciences. In
the new avenue, machine-learning algorithms present competing hypotheses that
are consistent with data. Interrogative methods are then invoked to interpret
DL models for scientists to further evaluate. However, hydrology presents
many challenges for DL methods, such as data limitations, heterogeneity
and co-evolution, and the general inexperience of the hydrologic field with
DL. The roadmap toward DL-powered scientific advances will require the
coordinated effort from a large community involving scientists and citizens.
Integrating process-based models with DL models will help alleviate data
limitations. The sharing of data and baseline models will improve the
efficiency of the community as a whole. Open competitions could serve as the
organizing events to greatly propel growth and nurture data science education
in hydrology, which demands a grassroots collaboration. The area of
hydrologic DL presents numerous research opportunities that could, in turn,
stimulate advances in machine learning as well.
Known challenges for petascale machines are that (1) the costs of I/O for high performance applications can be substantial, especially for output tasks like checkpointing, and (2) noise from I/O actions can inject undesirable delays into the runtimes of such codes on individual compute nodes. This paper introduces the flexible 'DataStager' framework for data staging and alternative services within that jointly address (1) and (2). Data staging services moving output data from compute nodes to staging or I/O nodes prior to storage are used to reduce I/O overheads on applications' total processing times, and explicit management of data staging offers reduced perturbation when extracting output data from a petascale machine's compute partition. Experimental evaluations of DataStager on the Cray XT machine at Oak Ridge National Laboratory establish both the necessity of intelligent data staging and the high performance of our approach, using the GTC fusion modeling code and benchmarks running on 1000+ processors.
Spatial-temporal forecasting has attracted tremendous attention in a wide range of applications, and traffic flow prediction is a canonical and typical example. The complex and long-range spatialtemporal correlations of traffic flow bring it to a most intractable challenge. Existing works typically utilize shallow graph convolution networks (GNNs) and temporal extracting modules to model spatial and temporal dependencies respectively. However, the representation ability of such models is limited due to: (1) shallow GNNs are incapable to capture long-range spatial correlations, (2) only spatial connections are considered and a mass of semantic connections are ignored, which are of great importance for a comprehensive understanding of traffic networks. To this end, we propose Spatial-Temporal Graph Ordinary Differential Equation Networks (STGODE). 1 . Specifically, we capture spatial-temporal dynamics through a tensor-based ordinary differential equation (ODE), as a result, deeper networks can be constructed and spatial-temporal features are utilized synchronously. To understand the network more comprehensively, semantical adjacency matrix is considered in our model, and a well-design temporal dialated convolution structure is used to capture long term temporal dependencies. We evaluate our model on multiple real-world traffic datasets and superior performance is achieved over state-of-the-art baselines.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.