2012
DOI: 10.1145/2076450.2076468
|View full text |Cite
|
Sign up to set email alerts
|

Software as a service for data scientists

Abstract: F e B R uA RY 2 0 1 2 | Vo L. 5 5 | N o. 2 | c oM M u n i c aT i o n s o f T he ac M 81in performing them for terabyte or larger datasets (increasingly common across scientific disciplines) are quite different from those that applied when data volumes were measured in kilobytes. The result is a computational crisis in many laboratories and a growing need for far more powerful data-management tools, yet the typical researcher lacks the resources and expertise to operate these tools.The answer may be to deliver … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
107
0

Year Published

2012
2012
2015
2015

Publication Types

Select...
8
1

Relationship

2
7

Authors

Journals

citations
Cited by 201 publications
(107 citation statements)
references
References 19 publications
0
107
0
Order By: Relevance
“…This packet loss was not being reported by the router's internal error monitoring, and was only noticed using the owamp active packet loss monitoring tool, which is part of the perfSONAR Toolkit. 4 Because TCP interprets the loss as network congestion, it reacts by rapidly reducing the overall sending rate. The sending rate then slowly recovers due to the dynamic behavior of the control algorithms.…”
Section: Tcp Performancementioning
confidence: 99%
“…This packet loss was not being reported by the router's internal error monitoring, and was only noticed using the owamp active packet loss monitoring tool, which is part of the perfSONAR Toolkit. 4 Because TCP interprets the loss as network congestion, it reacts by rapidly reducing the overall sending rate. The sending rate then slowly recovers due to the dynamic behavior of the control algorithms.…”
Section: Tcp Performancementioning
confidence: 99%
“…The SDSS collaboration has been making increasing use of Globus Online (Foster 2011;Allen et al 2012; https://www .globus.org), a general purpose bulk data transfer system that grew out of the GridFTP infrastructure. This system provides reliable bulk transfer between "endpoints" which are GridFTP servers that have advertised themselves to Globus Online.…”
Section: Future Developmentmentioning
confidence: 99%
“…Pipelining speeds up transfers containing a large number of files, which are sent back-toback without waiting for the complete command's response for the previous transfer; this can reduce latency. As a default, pipelining (pp) is set to 4 in the previous version of GridFTP, while in Globus Online [7], pipelining is set to 5, 10 or 20, depending on the file size.…”
Section: Background and Related Workmentioning
confidence: 99%
“…Related studies have sought to find optimal values of pipelining, concurrency and parallelism, such as Globus Online [7], which sets pipelining (pp) level to 5, 10 and 20, depending on file size. Globus Online sets parallelism (p) level to 2, 4 and 8.…”
Section: Background and Related Workmentioning
confidence: 99%