This article was downloaded by: [18.172.5.91] On: 05 August 2016, At: 07:10 Publisher: Institute for Operations Research and the Management Sciences (INFORMS) INFORMS is located in Maryland, USA
Transportation Science
Publication details, including instructions for authors and subscription information: http://pubsonline.informs.org Revenue Management in a Dynamic Network Environment
Dimitris Bertsimas, Ioana Popescu,
To cite this article: Dimitris Bertsimas, Ioana Popescu, (2003) Revenue Management in a Dynamic Network Environment. Transportation Science 37(3):257-277. http://dx.doi.org/10.1287/trsc.37.3.257.16047
Full terms and conditions of use: http://pubsonline.informs.org/page/terms-and-conditions
This article may be used only for the purposes of research, teaching, and/or private study. Commercial use or systematic downloading (by robots or other automatic processes) is prohibited without explicit Publisher approval, unless otherwise noted. For more information, contact [email protected]. The Publisher does not warrant or guarantee the article’s accuracy, completeness, merchantability, fitness for a particular purpose, or non-infringement. Descriptions of, or references to, products or publications, or inclusion of an advertisement in this article, neither constitutes nor implies a guarantee, endorsement, or support of claims made of that product, publication, or service. © 2003 INFORMS
Please scroll down for article—it is on subsequent pages
INFORMS is the largest professional society in the world for professionals in the fields of operations research, management science, and analytics. For more information on INFORMS, its publications, membership, or meetings visit http://www.informs.org Revenue Management in a Dynamic Network Environment
Dimitris Bertsimas • Ioana Popescu Sloan School of Management, MIT, Cambridge, Massachusetts 02139 INSEAD, Fontainebleau Cedex 77305, France [email protected] • [email protected]
e investigate dynamic policies for allocating scarce inventory to stochastic demand for Wmultiple fare classes,in a network environment so as to maximize total expected rev- enues. Typical applications include sequential reservations for an airline network,hotel,or car rental service. We propose and analyze a new algorithm based on approximate dynamic programming,both theoretically and computationally. This algorithm uses adaptive,nonad- ditive bid prices from a linear programming relaxation. We provide computational results that give insight into the performance of the new algorithm and the widely used bid-price control,for several networks and demand scenarios. We extend the proposed algorithm to handle cancellations and no-shows by incorporating oversales decisions in the underlying linear programming formulation. We report encouraging computational results that show that the new algorithm leads to higher revenues and more robust performance than bid-price control.
Introduction policies for allocating inventory to correlated,stochas- Capacity constrained service industries,such as trans- tic demand for multiple classes,in a network environ- portation,tourism,entertainment,media,and internet ment. Specifically,we design a decision support tool, providers are constantly faced with the problem of based on stochastic and dynamic optimization tech- intelligently allocating their limited,perishable inven- niques,that at each point in time accepts or rejects a tories to demand from different market segments, reservation request,based on the currently available with the objective of maximizing total revenues. Rev- inventory,past sales and future potential demand,so enue management is concerned with the theory and as to maximize total expected revenues. practice underlying this type of problem. Following airline deregulation,revenue management techniques Problem Definition have had an important impact on the development of the industry,providing up to 4%–10% increases The main problem we address in this paper is as fol- in company revenues (Fuchs 1987). For example,in lows. We are given an airline (hotel,car rental) net- 1997,American Airlines collected one billion dollars work composed of l legs (pairs of consecutive days for hotels,car rentals),which are used to serve a total
Downloaded from informs.org by [18.172.5.91] on 05 August 2016, at 07:10 . For personal use only, all rights reserved. by implementing revenue management,representing most of the company’s profit (Cook 1998). of m demand classes. The initial inventory is given by
Optimization techniques have been essential in the a vector N = N1 Nl of leg capacities. The net- development of revenue management tools,particu- work is described by a l ×m matrix A and a m-vector
larly for seat allocation models. In this research,we R = R1 Rm: Rj is the fare category of class j,
are interested in investigating the design of dynamic which utilizes aij units of resource (leg) i. In this way,
0041-1655/03/3703/0257$05.00 Transportation Science © 2003 INFORMS 1526-5447 electronic ISSN Vol. 37,No. 3,August 2003,pp. 257–277 BERTSIMAS AND POPESCU Revenue Management in a Dynamic Network Environment
a demand class j is defined by its itinerary Aj (a col- does not matter). Usually,we have partial information
umn of matrix A) and its fare category Rj . In an airline about the demand process,which might consist of the network without group discounts, A is a 0–1 matrix expected demand to come Dt = Et,and possibly which may contain repeated columns for each fare other type of information available from forecasting class on a given itinerary. To account for special group tools,such as cancellation or no-show probabilities. fares,integer multiples of the itinerary-incidence vec- The state of the system S is given by the time t tor are allowed. (t periods to departure) and the sales-to-date record For example,consider a very simple network cor- s = s1 sm for each demand class. If cancellations responding to a weekend in a hotel: There are three and no-shows are not allowed,then it is sufficient to nodes Fri, Sat, Sun and l = 2 legs (1) Fri–Sat and define the state based on the remaining inventory n = (2) Sat–Sun with total capacities N = N N . Sup- 1 2 n n. A common practice is to use the latter pose there is demand for all types of stays,i.e., 1 l model and account for cancellations and no shows by “itineraries:” (1) Fri–Sat,(2) Sat–Sun,and (3) Fri–Sat– incorporating a virtual increase,called overbooking Sun,with two ( high and low) fare classes for each type. pads,in the initial capacity definition. Moreover,suppose there are discounts for groups of 10 The general stochastic,dynamic inventory control size k1 = 10 for (1) Fri–Sat night stays,at a rate of R1 per group,that is a total of m = 7 classes. The leg- problem for network revenue management (NRM) class incidence matrix,together with the correspond- can thus be stated as follows: At time t,and given ing fare structure R,is given by: that the state of the network is S,should we accept or reject a new class j request? The overall objective 10 Rh Rl Rh Rl Rh Rl R is to maximize total expected revenues. 1 1 2 2 3 3 1 R = 110011 10 The decision to accept or reject determines an A admission control policy,which is in general a func- 001111 0 tion of the current network configuration (s or n),
We assume a finite booking horizon of length T , the time-to-go t,the currently requested fare Rj ,as with the time line sufficiently discretized so as to well as partial information from demand forecasts. If allow at most one request (reservation or cancellation) we accept the request,the new state becomes s +
per time period almost surely (a.s.). Time is counted kj · ej t− 1,where kj is the size of the class j group
backwards: Time t = T is the beginning of the book- request,and ej is the jth unit vector; when can- ing horizon and time t = 0 is the end of the reserva- cellations are not allowed,the state of the network tion period,where the no-shows are being counted. nt becomes simply n − Aj t. If the request is Customers who do not consume their reservations get rejected,only the time component of the state vector full,partial,or no refund,depending on their fare is changed to t − 1. class. If at the end of the horizon,the inventory is In reality,the control policy has an impact on oversold,at time t =−1 redistribution decisions are the demand process,because customer choice may being made. These can be class upgrades or customer depend on the opportunity set. Capturing and quan- bumping,in which case companies pay overbooking tifying this type of feedback phenomenon is a sub- penalties. For the case of hotels and car rentals,an tle task that goes beyond the scope of this work. We infinite time horizon could be more appropriate; how- will thus make the simplifying assumption that the ever,one can decompose the problem in fixed length demand process is independent of the control policy.
Downloaded from informs.org by [18.172.5.91] on 05 August 2016, at 07:10 . For personal use only, all rights reserved. time periods (one month,one year,etc.). The demand (to come) process at time t is denoted by t,and t represents the corresponding ran- Notation t We will use the following notation throughout the dom vector of cumulative demands. That is, j is a random variable representing the number of class j paper. Vectors will be denoted in bold,and random requests to come from time t until departure (order variables and processes in calygrahic style.
258 Transportation Science/Vol. 37,No. 3,August 2003 BERTSIMAS AND POPESCU Revenue Management in a Dynamic Network Environment
Time: NRM problem. To compare these different policies, T = length of time horizon (number of time peri- we provide structural and computational results. ods); The most popular technique developed in the cur- t = time periods left until departure (count-down). rent literature is an additive bid-pricing approach. Network: These are mechanisms whereby the opportunity cost l = number of legs in the network; of each itinerary is estimated as the sum of the m = number of classes (itineraries with fare cate- shadow prices of the incident legs,obtained from a gories); linear programming formulation of the problem (see N = total initial network capacity (l-vector); Formulation (2) in §2.2). However,there are two obvi- A = leg-class incidence l × m-matrix; ous drawbacks to additive bid prices: aij = quantity of resource i utilized by bundle j. (a) They are not well defined if there are multiple Sales and inventory: dual solutions. n = remaining inventory (l-vector); (b) They are restrictive in their way of taking into s = sales to date vector (m-vector); account bundles by their predefined additive struc- t sj = the number of itineraries sold to class j until ture. In particular,they do not account for changes of time t; a dual basis in response to accepting large-group and so = the number of class j itineraries overbooked at j multileg itinerary requests. departure. The contributions of this paper are as follows: Fares,refunds,and penalties: (1) We propose an efficient control policy,that is R = revenue collected for one class j sold; j well defined if there are multiple dual solutions, Rc = the refund per class j cancellation; j and does not have an additive structure. The pro- Rns = the refund per class j no-show; j posed control policy,which we call certainty equiva- C = the overbooking penalty per demand class j. j lent control (CEC),belongs in the class of approximate Demand: dynamic programming mechanisms (see Bertsekas pt = probability of request for class j at time t; j and Tsitsiklis 1998),in which the cost-to-go function is pct = the probability of a class j cancellation occur- j approximated by the value of a linear programming ring at time t; (LP) relaxation. We remark that this LP is equivalent pt = the probability of no request (reservation or 0 to a network flow problem in the case of origin- cancellation) at time t; destination (OD) demands or linear networks (where pns = the probability that a class j reservation will j nodes can be ordered so that the arcs are pairs of con- not show up; secutive nodes ii + 1 i = 1 l). t = demand (to come) process (m-dimensional); t = aggregate demand (to come) distribution (2) We provide structural properties that com- (m-dimensional); pare the behavior of the proposed CEC policy with Dt = Et = expected aggregate demand to come the additive bid-price approach. These results offer (m-dimensional); insight into the behavior of both methods. = the cancellation process,adapted to the sales (3) We propose several algorithmic improvements history (not a decision); of the CEC policy based on approximate dynamic = the no-show distribution (adapted to final programming. sales,adjusted for cancellations). (4) We provide computational results that give We will use the operator x+ = maxx0,for insight into the performance of these algorithms and + Downloaded from informs.org by [18.172.5.91] on 05 August 2016, at 07:10 . For personal use only, all rights reserved. x ∈ R,which naturally extends for vectors: x = several variations,for different networks and demand + + n scenarios. We observe that the CEC algorithm per- x1 xn for x = x1 xn ∈ R . forms very well in practice,giving results that are Contributions very close to optimum. For high load factors,we In this paper,we evaluate from different perspectives observe an average 5%–10% improvement over exist- several policies for solving the dynamic and stochastic ing policies (additive bid pricing). We describe and
Transportation Science/Vol. 37,No. 3,August 2003 259 BERTSIMAS AND POPESCU Revenue Management in a Dynamic Network Environment
simulate extensions of this algorithm that result in sig- and Stone (1991). An analogous discrete time solu- nificantly higher improvements (up to 20%). Interest- tion is provided by Lee and Hersh (1993),who ingly,the CEC policy appears to be significantly more also provide a method for estimating arrival rates. robust to noise and bias in the demand forecast. Subramanian et al. (1999) propose a dynamic pro- (5) We extend these algorithms to handle cancel- gramming model that handles cancellations and over- lations and no-shows by incorporating overbooking booking,by analogy to a problem in the optimal control in the underlying mathematical programming control of admission in a queueing system. formulation. This extension preserves several struc- A natural,but much harder question is to deter- tural properties. We report computational results that mine which of the single leg results can be extended show that the proposed algorithm improves upon the to network (multiproduct) settings,and how. A major performance of the bid-price control policy. conceptual advance in the study and practice of net- work revenue management was introduced by bid- Structure price control. These are additive,leg-based shadow The remainder of the paper is organized as follows: In prices used to approximate the opportunity cost of the next section,we present an overview of the litera- itinerary capacity. The concept was proposed by ture. Section 2 describes several (dynamic,stochastic, Simpson (1989),and further analyzed by Williamson linear,and network flow) models and formulations (1992) in extensive simulation studies. A deficiency of for the NRM problem and evaluates the relationships such mathematical programming based models is that between them. Section 3 presents efficient algorithms they do not account for nesting of the fare classes. for the NRM problem based on ideas from approxi- To overcome this problem,Curry (1992) proposes a mate dynamic programming. In §4,we present struc- virtual nesting method. In all cases,however,the allo- tural properties for the proposed CEC policy and cation of capacity to itinerary demand is decided by contrast them with additive bid pricing. In §5,we extend our model and algorithms to handle cancella- a one-time,static rule (one fixed set of bid prices). tions,no-shows,and overbooking. Finally,in §6,we The most realistic and relevant,yet least investi- present computational results. The last section sum- gated model for NRM is the dynamic network model. marizes our conclusions. Talluri and van Ryzin (1998) study a dynamic net- work model using bid-price control mechanisms and argue why bid-price policies are not optimal in gen- 1.Literature Review eral. They provide an asymptotic regime when certain and Positioning bid-price controls,based on a probabilistic program- ming formulation of the problem,are asymptoti- The NRM problem can be viewed as a particular cally optimal. In the context of hotels,Bitran and instance of the general class of perishable asset rev- enue management problems (PARM). Initial devel- Mondschein (1995) propose a dynamic policy that opments in static single leg revenue management extends to multiple-night stays,but do not give any are due to Littlewood (1972),followed by Simpson further analysis. (1989) and Belobaba (1987),who proposed a subop- Chen et al. (2000) formulate the problem as a timal policy for computing protection levels based Markov decision problem,and use linear program- on expected marginal seat revenues (EMSR). Curry ming and regression splines to approximate the value function. Gunther et al. (2000) introduce a new
Downloaded from informs.org by [18.172.5.91] on 05 August 2016, at 07:10 . For personal use only, all rights reserved. (1989),Wollmer (1992),and Brumelle and McGill (1993),derive the optimal solution for the single leg method to compute bid price for single hub airline static model. Robinson (1995) proposes an extension networks. Both studies report encouraging simulation that handles nonmonotonic fare classes. results. None of these take into account cancellations. A characterization of the optimal dynamic policy For further pointers to related literature,we refer based on a threshold time property is due to Diamond the interested reader the survey of McGill and
260 Transportation Science/Vol. 37,No. 3,August 2003 BERTSIMAS AND POPESCU Revenue Management in a Dynamic Network Environment
van Ryzin (1999),and for a schematic summary the demands,this can be computed via the Bellman equa- Ph.D. thesis of Popescu (1999). tion as follows:
Relative Positioning. Our approach can be viewed DPnt as extending the single-leg models investigated by m t j Lee and Hersh (1993) and Bitran and Mondschein = pj maxDPnt− 1 Rj + DPn − A t− 1 (1995),who actually propose a similar LP-based j=1 heuristic for the multiple-night booking problem in t + p0DPnt− 1 a hotel,but without any further analysis. Our over- m t booking model is similar to the early single-leg DP = DPnt− 1 + pj Rj − DPnt− 1 formulation of Rothstein (1971,1974). As far as static j=1 network models are concerned,our LP formulation is + DPn − Aj t− 1 + (1) similar to the one proposed by Williamson (1992),but we handle multiple classes and group bookings. The for all n ≤ N , t ≤ T ,with the boundary conditions: network flow formulation we provide for linear net- DPnt=− if n < 0 for some i and works and origin-destination fare (ODF) demands,is i the same as those proposed by Glover et al. (1982) for DPn 0 = 0 for n ≥ 0 airlines and Chen (1998) for hotels.
We define the opportunity cost OCj nt to be
OC nt= DPnt− 1 − DPn − Aj t− 1 2.General Models j and Relationships Then,the optimal policy accepts a request if and only
The problem of dynamic inventory control for NRM if the corresponding fare Rj exceeds its current oppor- belongs in the class of finite horizon decision prob- tunity cost. The value function can thus be expressed lems under uncertainty. In this section,we present as follows: several optimization models for addressing the NRM m t + problem,and discuss various relationships between DPnt= DPnt− 1 + pj Rj − OCj nt them. Again,we assume that the control policy does j=1 not feed back into the evolution of the demand pro- One can easily show that the value function is non- cess. We restrict to the case when cancellations and decreasing in n and t. Moreover,for the single leg overbooking are not permitted; this case is discussed case without batch arrivals,the value function is con- in detail in §5. cave and opportunity costs decrease with t and n (see Diamond and Stone 1991). Based on these properties, 2.1. The Dynamic Programming Model one can show that the optimal policy is characterized The stochastic dynamic programming model provides by threshold times. Threshold times are points in time the optimal policy for the NRM problem,by evaluat- during the booking horizon before which requests are ing the whole tree of possibilities and making at each rejected,and after which requests are accepted. In point in time the decision (to sell or not to sell) that the general network case,however,this is not true,as would imply higher future expected revenues. we will show in an example in §4.1. The states S = nt are defined by the current Downloaded from informs.org by [18.172.5.91] on 05 August 2016, at 07:10 . For personal use only, all rights reserved. available capacity vector n when the remaining time 2.2. The Integer and Linear Programming is t. The stochasticity is given by the demand process Model (IP, LP) to come t for the remaining t periods. We define The most frequently utilized formulations for the DPnt to be the maximum expected revenue to NRM problem are static models. These are determin- be collected from state nt. Assuming independent istic analogues of the stochastic dynamic problem,
Transportation Science/Vol. 37,No. 3,August 2003 261 BERTSIMAS AND POPESCU Revenue Management in a Dynamic Network Environment
that use only expected demand information,and are Proposition 1. DPnt≤ LPn Dt. usually much simpler to solve. For a full proof see Popescu (1999). The idea is Suppose that all the available demand information that for each pathwise realization of demand,the rev- consists of (unbiased) forecasts of the expected aggre- enue collected by the DP-policy along that path is gate demand to come Dt = Et over the remaining upper bounded by the value of a “perfect hindsight” t periods. The integer programming model computes stochastic program. The perfect hindsight model, the optimal allocation y∗ of available inventory n denoted PI,determines the optimal allocation of to the expected itinerary demand Dt,by maximiz- inventory for each particular realization of demand, ing total revenues subject to capacity and itinerary and then computes the expected reward over all demand constraints. For all n ≤ N and t ≤ T ,we have scenarios: IPn Dt = max R · y DPnt ≤ PIn t = E LPn t s.t. A · y ≤ n t t t 0 ≤ y ≤ D ≤ LPnE = LPn D
y integer t Here, is the cumulative demand corresponding The linear programming relaxation of this problem to scenario . Since the LP value is concave in the provides an efficient way to compute the best possible demand vector,the second inequality follows from “fractional” allocation of inventory,and is defined by Jensen’s inequality. Furthermore,the two values con- simply relaxing the integrality constraint: verge as demand becomes very large (see Popescu 1999). LPn Dt = max R · y s.t. A · y ≤ n 2.3. Origin-Destination RM and t 0 ≤ y ≤ D (2) Linear Networks Consider a particular case of the NRM problem, Clearly,we have that IP n Dt ≤ LPn Dt. This where demand is by ODF,as opposed to itinerary inequality can be strict,but there are particular specific. This is by default the case for linear net- instances when equality holds,one of which (linear works,where there is no question of routing,such as networks) we describe in the next section. One can in hotel or car rental revenue management (where the obtain further insight into the optimal LP-allocation nodes are days). In the case of airline revenue man- by looking at the dual problem: agement this situation occurs when there are perfectly LPn Dt = min v · n + u · Dt substitutable routes (same price,same travel time,and s.t. v · A + u ≥ R so on). In these cases,the Model (2) can be repre- u v ≥ 0 sented as a network flow problem. The advantage of this formulation is that it is very easy to solve in prac- This formulation can be equivalently written as: tice and reoptimization is very fast. For airlines,this model provides (for the company) optimal routing of LPn Dt = min v · n + R − v · A+ · Dt v≥0 path-indifferent customers. t The network representation is as follows: The nodes = min vk · n + uk · D k∈K are the origins and destinations (days for hotels,{air- where K denotes the index set of extreme points port,time}-pairs for airlines). There are multiple for-
Downloaded from informs.org by [18.172.5.91] on 05 August 2016, at 07:10 . For personal use only, all rights reserved. vk uk of the dual polyhedron. Thus,the objective ward arcs odf representing the flow from origin o value of Model (2) is a piecewise linear, concave, and to destination d from fare-class f . The capacity of each nondecreasing function of the expected demand to forward arc is the corresponding aggregate demand t f f come D and available capacity n. We next relate the Dod. The revenue collected along arc odf is Rod. For objective values of the dynamic and static formula- each “leg” ij (flight leg,respectively,pair of consec- tions at any state nt. utive days) there is a backward arc of the type ji,
262 Transportation Science/Vol. 37,No. 3,August 2003 BERTSIMAS AND POPESCU Revenue Management in a Dynamic Network Environment
capacitated by nij ,the available inventory of leg ij. prices v from the linear programming formulation (2). The revenue collected along backward arcs is zero. Then itinerary bid prices are computed additively,at This model has been proposed by Glover et al. (1982) each state S = nt,and for each class j request as in the context of airlines,and for hotel revenue man- S j agement by Chen (1998),who proves that it reduces BPj S = v · A (3) to solving a network flow problem. Notice that bid prices depend on the choice of Proposition 2 (Integrality of Solution). For optimal dual variables vS. This technique was ini- ODF models with integer data, the optimal LP-solution is tially proposed by Simpson (1989),then studied by t t integral, that is IPn D = LPn D . Williamson (1992) in her Ph.D. thesis. Several proba- bilistic models have been investigated by Glover et al. 3.Approximate Dynamic (1982),Williamson (1992),and Talluri and van Ryzin Programming Algorithms (1998). It has been observed (Williamson 1992) that the LP model achieves a better performance,and is In this section,we present several algorithms for the more efficient. For recent work on bid-price control NRM problem. Most of these algorithms belong in the see Talluri and van Ryzin (1998) and Gunther et al. generic class of approximate dynamic programming (2000). methods (see Bertsekas and Tsitsiklis 1998),in which an approximate value to the exact value function is 3.2. Certainty Equivalent Control used in the Bellman equation. The main disadvantages that are apparent from the Given a certain efficient mathematical program- definition of leg-based additive bid prices is that ming formulation (MP) of the NRM problem,a (a) they are not uniquely defined (several sets of generic approximate DP algorithm for the NRM prob- lem has the following structure: shadow prices may be optimal),and (b) they provide an additive approximation of the opportunity costs, Generic MP-Policy. Identify an efficient formula- which are not necessarily additive due to “bundle tion MP of the NRM problem. effects” (group or multileg itinerary requests may At any current state S = nt, determine basis changes in the dual LP). (1) For a class j request,compute an MP-based esti- We provide a different approximation scheme for mate of the opportunity cost OCMPS. j opportunity costs,based on certainty equivalent (2) Sell to class j if and only if its fare R exceeds j adaptive control. The idea is to approximate the value its opportunity cost estimate,i.e., function of the dynamic program DPnt defined in MP Equation (1) by the value of the linear programming Rj ≥ OCj S problem LPn Dt defined in §2.2. Thus,in Step 1 of (3) Go to Step 1 and ITERATE. the generic algorithm,a request for a given class j We will denote the expected value of this policy at will be accepted if and only if its price Rj exceeds its an initial state S as MPS. The difference between current opportunity cost estimate given by: various algorithms comes from the approximating MP and the MP-based opportunity cost measure,that is LP t−1 j t−1 OCj nt= LPn D − LPn − A D from Step 1. Because the opportunity cost estimate is calculated 3.1. Bid-Price Control in terms of LP objectives,it is uniquely defined,in
Downloaded from informs.org by [18.172.5.91] on 05 August 2016, at 07:10 . For personal use only, all rights reserved. Bid-price control is a popular method in NRM, that its value will not depend on the choice of (dual) whereby the opportunity cost (shadow price,or bid solution. Thereby,the first drawback of bid prices is price) of an itinerary is approximated by the sum of resolved. opportunity costs of the legs along that itinerary. First, This is the certainty equivalent control (CEC) pol- opportunity cost estimates (bid prices) are determined icy,and we denote its expected value for an initial
for each leg in the network,usually as the leg-shadow state S as LPS = CECS. For more on certainty
Transportation Science/Vol. 37,No. 3,August 2003 263 BERTSIMAS AND POPESCU Revenue Management in a Dynamic Network Environment
equivalence,see Bertsekas (1995). A similar policy is Simulations Using Monte Carlo Demand Esti- proposed by Bitran and Mondschein (1995) in the con- mation. One problem with the certainty equivalent text of hotel revenue management,but no analysis is policy is that it only considers expected demand infor- provided. Notice that in the case of linear networks, mation,and uses a deterministic approach to a highly it is actually desirable to use the equivalent network stochastic problem. We propose a variation on the flow formulation as described in §2.3 instead of the certainty equivalent policy that uses Monte Carlo usual LP model because reoptimization at each stage demand estimation to capture demand variability. becomes a much simpler task. Suppose we have information that the cumula- More generally,one can use this technique with vir- tive demand to come t−1 follows a certain distribu- tually any mathematical programming (MP) model tion. We generate r samples from this distribution: that provides an approximation of the value func- t−1 t−1 D1 Dr . In Step 1 of the generic algorithm,we tion. The corresponding OC estimate for an itinerary j calculate the opportunity cost estimate of itinerary j MP request at the current state S is defined as OCj S = as a weighted average, MPS − MPS ,where S and S are the rejj accj accj rejj r states corresponding to the accept,and respectively MC LP t−1 OCj nt= $i · OCj n Di reject decision for the current request j. For exam- i=1 ple,for the static LP model without cancellations and where $ = Pt−1 = D t−1 t−1 ∈ &D