Abstract.Clustering with constraints is an active area of machine learning and data mining research. Previous empirical work has convincingly shown that adding constraints to clustering improves performance, with respect to the true data labels. However, in most of these experiments, results are averaged over different randomly chosen constraint sets, thereby masking interesting properties of individual sets. We demonstrate that constraint sets vary significantly in how useful they are for constrained clustering; some constraint sets can actually decrease algorithm performance. We create two quantitative measures, informativeness and coherence, that can be used to identify useful constraint sets. We show that these measures can also help explain differences in performance for four particular constrained clustering algorithms.
Recent discoveries of dispersed, non-periodic impulsive radio signals with single-dish radio telescopes have sparked significant interest in exploring the relatively uncharted space of fast transient radio signals. Here we describe V-FASTR, an experiment to perform a blind search for fast transient radio signals using the Very Long Baseline Array (VLBA). The experiment runs entirely in a commensal mode, alongside normal VLBA observations and operations. It is made possible by the features and flexibility of the DiFX software correlator that is used to process VLBA data. Using the VLBA for this type of experiment offers significant advantages over single-dish experiments, including a larger field of view, the ability to easily distinguish local radio-frequency interference from real signals, and the possibility to localize detected events on the sky to milliarcsecond accuracy. We describe our software pipeline, which accepts short integration (∼ ms) spectrometer data from each antenna in real time during correlation and performs an incoherent dedispersion separately for each antenna, over a range of trial dispersion measures. The dedispersed data are processed by a sophisticated detector and candidate events are recorded. At the end of the correlation, small snippets of the raw data at the time of the events are stored for further analysis. We present the results of our event detection pipeline from some test observations of the pulsars B0329+54 and B0531+21 (the Crab pulsar).
We are developing a purely commensal survey experiment for fast (<5 s) transient radio sources. Short-timescale transients are associated with the most energetic and brightest single events in the Universe. Our objective is to cover the enormous volume of transients parameter space made available by ASKAP, with an unprecedented combination of sensitivity and field of view. Fast timescale transients open new vistas on the physics of high brightness temperature emission, extreme states of matter and the physics of strong gravitational fields. In addition, the detection of extragalactic objects affords us an entirely new and extremely sensitive probe on the huge reservoir of baryons present in the IGM. We outline here our approach to the considerable challenge involved in detecting fast transients, particularly the development of hardware fast enough to dedisperse and search the ASKAP data stream at or near real-time rates. Through CRAFT, ASKAP will provide the testbed of many of the key technologies and survey modes proposed for high time resolution science with the SKA.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.