“…Databases, like FLUXNET (http://www.fluxdata.org), contain over 1000 site years of data, representing more than 8 million hours of flux measurements (Stoy et al, 2013); this database is expected to double in size with a new release of data being processed as this article is being written. So operating at the terabyte to petabyte scales requires coordinated use of distributed data exchange systems to submit, ingest, store, vet, track version and provenance, manipulate, distribute, and discover data (Agarwal et al, 2010;Papale et al, 2012). So operating at the terabyte to petabyte scales requires coordinated use of distributed data exchange systems to submit, ingest, store, vet, track version and provenance, manipulate, distribute, and discover data (Agarwal et al, 2010;Papale et al, 2012).…”