Logo image
Dynamic lead time promising
Conference proceeding

Dynamic lead time promising

M. J. Reindorp and M. C. Fu
2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), pp 176-183
Apr 2011

Abstract

Learning Markov processes Nickel Q factor Schedules Supply chains
We consider a make-to-order business that serves customers in multiple priority classes. Orders from customers in higher classes bring greater revenue, but they expect shorter lead times than customers in lower classes. In making lead time promises, the firm must recognize preexisting order commitments, uncertainty over future demand from each class, and the possibility of supply chain disruptions. We model this scenario as a Markov decision problem and use reinforcement learning to determine the firm's lead time policy. In order to achieve tractability on large problems, we utilize a sequential decision-making approach that effectively allows us to eliminate one dimension from the state space of the system. Initial numerical results from the sequential dynamic approach suggest that the resulting policies more closely approximate optimal policies than static optimization approaches.

Metrics

14 Record Views
2 citations in Scopus

Details

Logo image