Assume a set of ordered input-output pairs
, where
are
-dimensional input vectors and
are scalar outputs. Data pairs are made available on a one-at-a-time basis, i.e.,
is made available at time
. Our objective is to infer the predictive distribution of a new, unseen output
given the corresponding input
and data available up to time
,
.