Abstract Despite attempts to unify the different theoretical accounts of the mismatch negativity (MMN), there is still an ongoing debate on the neurophysiological mechanisms underlying this complex brain response. On one hand, neuronal adaptation to recurrent stimuli is able to explain many of the observed properties of the MMN, such as its sensitivity to controlled experimental parameters. On the other hand, several modeling studies reported evidence in favor of Bayesian learning models for explaining the trial-to-trial dynamics of the human MMN. However, direct comparisons of these two main hypotheses are scarce, and previous modeling studies suffered from methodological limitations. Based on reports indicating spatial and temporal dissociation of physiological mechanisms within the timecourse of mismatch responses in animals, we hypothesized that different computational models would best fit different temporal phases of the human MMN. Using electroencephalographic data from two independent studies of a simple auditory oddball task (n = 82), we compared adaptation and Bayesian learning models’ ability to explain the sequential dynamics of auditory deviance detection in a time-resolved fashion. We first ran simulations to evaluate the capacity of our design to dissociate the tested models and found that they were sufficiently distinguishable above a certain level of signal-to-noise ratio (SNR). In subjects with a sufficient SNR, our time-resolved approach revealed a temporal dissociation between the two model families, with high evidence for adaptation during the early MMN window (from 90 to 150-190 ms post-stimulus depending on the dataset) and for Bayesian learning later in time (170-180 ms or 200-220ms). In addition, Bayesian model averaging of fixed-parameter models within the adaptation family revealed a gradient of adaptation rates, resembling the anatomical gradient in the auditory cortical hierarchy reported in animal studies. Author summary The ability to detect and adapt to changes in the environment is an essential feature for survival of living beings. Two main theories have been proposed to explain how the brain performs such an automatic task in the auditory domain. The first one, adaptation, emphasizes the ability of auditory cortical and sub-cortical neurons to attenuate their response to repeated stimuli, which renders the brain more sensitive to deviations from expected sensory inputs. The second one, Bayesian learning, further involves higher-level cortical regions which would update their predictions about incoming stimuli, depending on their performance at predicting previous ones. These two views may not be mutually exclusive, but few experimental works compared them directly. We used computational models inspired from both accounts to assess which view may provide a better fit of two independent electrophysiological datasets from similar auditory experiments. Evidence from a large sample of 82 human subjects provided a complex picture, with adaptation processes seemingly dominating the early phase of auditory brain response, and Bayesian learning processes appearing later on. Our results converge with other recent works in animals and points to the necessary reconciliation of those two theories for a better understanding of auditory perception and statistical learning.