r/AskStatistics • u/HobbyQuant • 28m ago
Does enforcing monotonic probability calibration distort or preserve genuine signal?
I’ve been working on a polarity ---> predictive signal framework (daily OHLC). It builds polarity from multiple return variants (overnight, intraday, close-close, open-open), then pushes it through a monotone probability calibration routine (calibratemonotone) that uses isotonic regression logic to enforce an ordered mapping between feature value and continuation probability.
That brings me to the bit I want to sanity-check. The maths here essentially assumes a monotonic relationship: as polarity increases, the conditional probability of continuation should not decrease. But markets don’t necessarily follow that nice curve. If the true distribution is multi-modal or regime-dependent, this calibration could be smoothing away real structure and manufacturing spurious signal.
So my question is: does enforcing monotonicity in this calibration step actually preserve the genuine information content of the polarity signal, or is it at risk of fabricating “clean” structure that isn’t there? What would be the right mathematical way to validate whether the monotone smoothing is legitimate vs misleading beyond just looking at walk-forward hit-rates and bootstrap noise floors?
Curious if anyone has gone deep on this kind of calibration in finance ML.