What is the Value of Censored Data? An Exact Analysis for the Data-driven Newsvendor
arXiv:2602.16842v1 Announce Type: new Abstract: We study the offline data-driven newsvendor problem with censored demand data. In contrast to prior works where demand is fully observed, we consider the setting where demand is censored at the inventory level and only sales are observed; sales match demand when there is sufficient inventory, and equal the available inventory otherwise. We provide a general procedure to compute the exact worst-case regret of classical data-driven inventory policies, evaluated over all demand distributions. Our main technical result shows that this infinite-dimensional, non-convex optimization problem can be reduced to a finite-dimensional one, enabling an exact characterization of the performance of policies for any sample size and censoring levels. We leverage this reduction to derive sharp insights on the achievable performance of standard inventory policies under demand censoring. In particular, our analysis of the Kaplan-Meier policy shows that while
arXiv:2602.16842v1 Announce Type: new Abstract: We study the offline data-driven newsvendor problem with censored demand data. In contrast to prior works where demand is fully observed, we consider the setting where demand is censored at the inventory level and only sales are observed; sales match demand when there is sufficient inventory, and equal the available inventory otherwise. We provide a general procedure to compute the exact worst-case regret of classical data-driven inventory policies, evaluated over all demand distributions. Our main technical result shows that this infinite-dimensional, non-convex optimization problem can be reduced to a finite-dimensional one, enabling an exact characterization of the performance of policies for any sample size and censoring levels. We leverage this reduction to derive sharp insights on the achievable performance of standard inventory policies under demand censoring. In particular, our analysis of the Kaplan-Meier policy shows that while demand censoring fundamentally limits what can be learned from passive sales data, just a small amount of targeted exploration at high inventory levels can substantially improve worst-case guarantees, enabling near-optimal performance even under heavy censoring. In contrast, when the point-of-sale system does not record stockout events and only reports realized sales, a natural and commonly used approach is to treat sales as demand. Our results show that policies based on this sales-as-demand heuristic can suffer severe performance degradation as censored data accumulates, highlighting how the quality of point-of-sale information critically shapes what can, and cannot, be learned offline.
Executive Summary
This article examines the value of censored data in the context of the data-driven newsvendor problem, where demand is censored at the inventory level and only sales are observed. The authors provide a general procedure to compute the exact worst-case regret of classical data-driven inventory policies and derive sharp insights on the achievable performance of standard inventory policies under demand censoring. The results highlight the limitations of learning from passive sales data and the importance of targeted exploration to improve worst-case guarantees.
Key Points
- ▸ The article studies the offline data-driven newsvendor problem with censored demand data
- ▸ The authors provide a general procedure to compute the exact worst-case regret of classical data-driven inventory policies
- ▸ The results show that demand censoring fundamentally limits what can be learned from passive sales data
Merits
Methodological Contribution
The article provides a general procedure to compute the exact worst-case regret of classical data-driven inventory policies, which is a significant methodological contribution to the field.
Demerits
Limited Scope
The article focuses on the newsvendor problem and may not be directly applicable to other domains, which could limit its scope and generalizability.
Expert Commentary
The article's methodological contribution is significant, as it provides a general procedure to compute the exact worst-case regret of classical data-driven inventory policies. The results also have important implications for inventory management practices, highlighting the limitations of learning from passive sales data and the importance of targeted exploration. However, the article's focus on the newsvendor problem may limit its scope and generalizability. Overall, the article provides valuable insights into the value of censored data and has the potential to inform the development of more effective inventory management strategies.
Recommendations
- ✓ Practitioners should consider implementing targeted exploration strategies to improve worst-case guarantees in inventory management
- ✓ Future research should aim to generalize the article's findings to other domains and explore the implications of censored data in different contexts