ArXiv

Characterizing and Correcting Effective Target Shift in Online Learning

Authors
Ziyan Li, Naoki Hiratani
Categories
stat.ML, cs.LG
arXiv
https://arxiv.org/abs/2605.07886v1
PDF
https://arxiv.org/pdf/2605.07886v1

Brief

Online kernel regression: Li and Hiratani (2026) derive a closed-form expression showing online kernel regression is equivalent to offline kernel regression with shifted, inaccurate target outputs. They give a closed-form and an iterative target-correction that provably recovers the offline predictor. Experiments on CIFAR-10 and CORe50 show online SGD with corrected targets outperforms using true targets in continual learning.

Why it matters

The authors derive a closed-form expression proving online kernel regression is equivalent to offline kernel regression with systematically shifted (inaccurate) target outputs, and they show that compensating for this effective target shift can provably recover the offline predictor.

Key details

  • They provide both a closed-form target-correction and an iterative sequential form; empirically, online SGD with iteratively corrected targets outperforms learning with the true targets on CIFAR-10 and CORe50 in continual-learning settings (paper by Ziyan Li and Naoki Hiratani, arXiv:2605.07886v1, published 2026-05-08; 22 pages, 6 figures).
Source evidence

Abstract

Online learning from a stream of data is a defining feature of intelligence, yet modern machine learning systems often struggle in this setting, especially under distributional shift. To understand its basic properties, we study the relationship between online and offline learning in the context of kernel regression. We derive a closed-form expression for the function learned by online kernel regression, revealing that online kernel regression is equivalent to offline regression with shifted, inaccurate target outputs. Conversely, we show that by compensating for this effective shift in the teaching signal through target correction, online kernel-based learning can provably learn the same predictor as its offline counterpart. We derive both a closed-form expression for this target correction and an iterative form that can be applied sequentially. Applying this framework to image classification tasks on CIFAR-10 and CORe50, we show that online stochastic gradient descent with iteratively corrected targets outperforms learning with the true targets in continual learning settings. This work therefore provides a basic framework for analyzing and improving online learning in non-stationary environments.

Comment: 22 pages; 6 figures