Outlier-Robust Kernel Hierarchical-Optimization RLS on a Budget with Affine Constraints

Slavakis, Konstantinos; Yukawa, Masahiro

doi:10.1109/icassp39728.2021.9413415

Cited by 3 publications

(3 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Notwithstanding, the LS loss is notoriously sensitive to the presence of outliers [3], where outliers are defined as contaminating data that do not adhere to a nominal data-generation model, and are often viewed as random variables (RVs) with non-Gaussian heavy tailed distributions, e.g., α-stable ones [4,5]. To counter outliers in AdaFilt, non-LS losses, such as the p-norm (2 > p ∈ R ++ ) [6][7][8][9][10][11][12][13] and correntropy [14,15] have been studied (henceforth, R ++ will denote the set of all positive real numbers).…”

Section: A Motivation: Adaptive Filters Against Outliersmentioning

confidence: 99%

“…Proof: See Appendix A. More variations of ( 9) can be generated from (10) by tuning the loss functions L, R appropriately. For example, robust B-Map designs against outliers in sampling can be obtained by letting the ℓ 1 -norm take the place of the quadratic one in (12a) and (14a).…”

Section: New Bellman Mappings In Rkhssmentioning

confidence: 99%

“…For example, robust B-Map designs against outliers in sampling can be obtained by letting the ℓ 1 -norm take the place of the quadratic one in (12a) and (14a). Task (10) for general (non)smooth convex L and R can be handled efficiently by [54]. Such designs are deferred to future publications.…”

Section: New Bellman Mappings In Rkhssmentioning

confidence: 99%

See 2 more Smart Citations

Nonparametric Bellman Mappings for Reinforcement Learning: Application to Robust Adaptive Filtering

Akiyama,

Vu,

Slavakis

2024

Preprint

View full text Add to dashboard Cite

This paper designs novel nonparametric Bellman mappings in reproducing kernel Hilbert spaces (RKHSs) for reinforcement learning (RL). The proposed mappings benefit from the rich approximating properties of RKHSs, adopt no assumptions on the statistics of the data owing to their nonparametric nature, require no knowledge on transition probabilities of Markov decision processes, and may operate without any training data. Moreover, they allow for sampling on-the-fly via the design of trajectory samples, re-use past test data via experience replay, effect dimensionality reduction by random Fourier features, and enable computationally lightweight operations to fit into efficient online or time-adaptive learning. The paper offers also a variational framework to design the free parameters of the proposed Bellman mappings, and shows that appropriate choices of those parameters yield several popular Bellman-mapping designs. As an application, the proposed mappings are employed to offer a novel solution to the problem of countering outliers in adaptive filtering. More specifically, with no prior information on the statistics of the outliers and no training data, a policy-iteration algorithm is introduced to select online, per time instance, the “optimal” coefficient p in the least-mean-p-power-error method. Numerical tests on synthetic data showcase, in most of the cases, the superior performance of the proposed solution over several RL and non-RL schemes.

show abstract