Assessing the validity and reliability of self-report data on contraception use in the MObile Technology for Improved Family Planning (MOTIF) randomised controlled trial

Smoothing splines provide flexible nonparametric regression estimators. However, the high computational cost of smoothing splines for large datasets has hindered their wide application. In this article, we develop a new method, named adaptive basis sampling, for efficient computation of smoothing splines in super-large samples. Except for the univariate case where the Reinsch algorithm is applicable, a smoothing spline for a regression problem with sample size n can be expressed as a linear combination of n basis functions and its computational complexity is generally O(n 3). We achieve a more scalable computation in the multivariate case by evaluating the smoothing spline using a smaller set of basis functions, obtained by an adaptive sampling scheme that uses values of the response variable. Our asymptotic analysis shows that smoothing splines computed via adaptive basis sampling converge to the true function at the same rate as full basis smoothing splines. Using simulation studies and a large-scale deep earth core-mantle boundary imaging study, we show that the proposed method outperforms a sampling method that does not use the values of response variables.

show abstract

Optimal algorithms for crawling a hidden database in the web

Sheng

Zhang

Tao

et al. 2012

Proc. VLDB Endow.

View full text Add to dashboard Cite

A hidden database refers to a dataset that an organization makes accessible on the web by allowing users to issue queries through a search interface. In other words, data acquisition from such a source is not by following static hyper-links. Instead, data are obtained by querying the interface, and reading the result page dynamically generated. This, with other facts such as the interface may answer a query only partially, has prevented hidden databases from being crawled effectively by existing search engines.This paper remedies the problem by giving algorithms to extract all the tuples from a hidden database. Our algorithms are provably efficient, namely, they accomplish the task by performing only a small number of queries, even in the worst case. We also establish theoretical results indicating that these algorithms are asymptotically optimal -i.e., it is impossible to improve their efficiency by more than a constant factor. The derivation of our upper and lower bound results reveals significant insight into the characteristics of the underlying problem. Extensive experiments confirm the proposed techniques work very well on all the real datasets examined.

show abstract

Deep Extreme Learning Machine and Its Application in EEG Classification

Ding

Zhang

Xiaoli

et al. 2015

Mathematical Problems in Engineering

111

View full text Add to dashboard Cite

Recently, deep learning has aroused wide interest in machine learning fields. Deep learning is a multilayer perceptron artificial neural network algorithm. Deep learning has the advantage of approximating the complicated function and alleviating the optimization difficulty associated with deep models. Multilayer extreme learning machine (MLELM) is a learning algorithm of an artificial neural network which takes advantages of deep learning and extreme learning machine. Not only does MLELM approximate the complicated function but it also does not need to iterate during the training process. We combining with MLELM and extreme learning machine with kernel (KELM) put forward deep extreme learning machine (DELM) and apply it to EEG classification in this paper. This paper focuses on the application of DELM in the classification of the visual feedback experiment, using MATLAB and the second brain-computer interface (BCI) competition datasets. By simulating and analyzing the results of the experiments, effectiveness of the application of DELM in EEG classification is confirmed.

show abstract

Graded rough set model based on two universes and its properties

Liu

Miao

Zhang

2012

Knowledge-Based Systems

View full text Add to dashboard Cite

Information Dissemination Analysis of Different Media towards the Application for Disaster Pre-Warning

Zhang

Huang

et al. 2014

PLoS ONE

View full text Add to dashboard Cite

Knowing the information dissemination mechanisms of different media and having an efficient information dissemination plan for disaster pre-warning plays a very important role in reducing losses and ensuring the safety of human beings. In this paper we established models of information dissemination for six typical information media, including short message service (SMS), microblogs, news portals, cell phones, television, and oral communication. Then, the information dissemination capability of each medium concerning individuals of different ages, genders, and residential areas was simulated, and the dissemination characteristics were studied. Finally, radar graphs were used to illustrate comprehensive assessments of the six media; these graphs show directly the information dissemination characteristics of all media. The models and the results are essential for improving the efficiency of information dissemination for the purpose of disaster pre-warning and for formulating emergency plans which help to reduce the possibility of injuries, deaths and other losses in a disaster.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Nan Zhang

Unbiased estimation of size and other aggregates over hidden web databases

Transmission Line Boundary Protection Using Wavelet Transform and Neural Network

An overview on Restricted Boltzmann Machines

Efficient computation of smoothing splines via adaptive basis sampling

Optimal algorithms for crawling a hidden database in the web

Deep Extreme Learning Machine and Its Application in EEG Classification

Graded rough set model based on two universes and its properties

Information Dissemination Analysis of Different Media towards the Application for Disaster Pre-Warning

Contact Info

Product

Resources

About