Coalitional Bargaining with Agent Type Uncertainty

Size: px
Start display at page:

Download "Coalitional Bargaining with Agent Type Uncertainty"

Transcription

1 Coaltonal Barganng wth Agent Type Uncertanty Georgos Chalkadaks and Crag Boutler Department of Computer Scence Unversty of Toronto, Toronto, Canada { gehalk, cebly }@cs.toronto.edu Abstract Coalton formaton s a problem of great nterest n AI, allowng groups of autonomous, ndvdually ratonal agents to form stable teams. Automatng the negotatons underlyng coalton formaton s, naturally, of specal concern. However, research to date n both AI and economcs has largely gnored the potental presence of uncertanty n coaltonal barganng. We present a model of dscounted coaltonal barganng where agents are uncertan about the types (or capabltes) ofpotental partners, and hence the value of a coalton. We cast the problem as a Bayesan game n extensve form, and descrbe ts Perfect Bayesan Equlbra as the solutons to a polynomal program. We then present a heurstc algorthm usng teratve coalton formaton to approxmate the optmal soluton, and evaluate ts performance. 1 Introducton Coalton formaton, wdely studed n game theory and economcs [8], has attracted much attenton n AI as means of dynamcally formng partnershps or teams of cooperatng agents. Whle most models of coalton formaton (e.g., coaltonal barganng processes) assume that agents have full knowledge of types of ther potental partners, n most natural settngs ths wll not be the case. Generally, agents wll be uncertan about varous characterstcs of others (e.g., ther capabltes), whch n turn mposes uncertanty on the value of any coalton. Ths presents the opportunty to learn about the types of others based on ther behavor durng negotaton and by observng ther performance n settngs where coaltons form repeatedly. Agents must be able to form coaltons and dvde the generated value even n such settngs. Here we present a model of dscounted coaltonal barganng under agent type uncertanty. We formulate ths as a Bayesan extensve game wth observable actons [8], where the actons correspond to proposng choces of potental partners and a payoff allocaton, or acceptng or reectng such proposals. Our model generalzes related barganng models by explctly dealng wth uncertanty about agent types (or capabltes) and coaltonal values. We formulate the perfect Bayesan equlbrum (PBE) soluton of ths game as a decdable polynomal program. The complexty of the program makes t ntractable for all but trval problems, so we propose an alternatve heurstc algorthm to fnd good agent strateges n the coaltonal barganng game. Prelmnary experments llustrate the performance of ths heurstc approach. Although there s a consderable body of work on coaltonal barganng, no exstng models deal wth explct type uncertanty. Okada [7] suggests a form of coaltonal barganng where agreement can be reached n one barganng round f the proposer s chosen randomly. Chatteree et al. [3] present a barganng model wth a fxed proposer order, whch results n a delay of agreement. Nether model deals wth type uncertanty nstead, they focus on calculatng subgame-perfect equlbra (SPE). Sus et al. [9] ntroduce stochastc cooperatve games (SCGs), comprsng a set of agents, a set of coaltonal actons, and a functon assgnng to each acton a random varable wth fnte expectaton, representng acton-dependent coalton payoff. Though they provde strong theoretcal foundatons for games wth ths restrcted form of acton uncertanty, they do not model explctly a coalton formaton process. Kraus et al. [4] model coalton formaton under a restrcted form of uncertanty regardng coaltonal values n a request for proposal doman. However, type uncertanty s not captured; rather, the mean value of coaltons s common knowledge, and a manager handles proposals (they also focus on socal welfare maxmzaton rather than ndvdual ratonalty). Chalkadaks and Boutler [2] proposean explct model of type uncertanty and show how ths translates nto coaltonal value uncertanty. We adopt ther model n our paper. However, ther results focus on stablty concepts and how coaltons evolve durng repeated nteracton, as agents gradually learn more about each other s capabltes (n renforcement learnng style). The actual coalton formaton processes used are farly smple and are not nfluenced by strategc consderatons, nor do agents update ther belefs about other agents types durng barganng. Our work analyzes the actual barganng process n more depth. 2 Bayesan Coaltonal Barganng We begn by descrbng the Bayesan coalton formaton model and then defne our coaltonal barganng game. We assume a set of agents N = {1,...,n}, and for each agent a fnte set of possble types T. Each agent has a specfc type t T.WeletT = N T denote the set of type profles. Each knows ts own type t, but not those of other agents. Agent s belefs μ comprse a ont dstrbuton 1227

2 over T,whereμ (t ) s the probablty assgns to other agents havng type profle t. Intutvely, s type reflects ts abltes; and ts belefs about the types of others capture ts uncertanty about ther abltes. For nstance, f a carpenter wants to fnd a plumber and electrcan wth whom to buld a house, her decson to propose (or on) such a partnershp, to engage n a specfc type of proect, and to accept a specfc share of the surplus generated should all depend on her probablstc assessment of ther abltes. A coalton C N of members wth actual types t C has a value V (t C ), representng the value ths group can acheve by actng optmally. However, ths smple characterstc functon representaton of the model [8] s nsuffcent, snce ths value s not common knowledge. An agent can only assess the expected value of such a coalton based on ts belefs: V (C) = t C T C μ (t C )V (t C ). A coalton structure CS parttons N nto coaltons of agents. A payoff allocaton P = x, gven the stochastc nature of payoffs n ths settng, assgns to each agent n coalton C ts share of the value attaned by C (and must be such that C x =1for each C CS ). Chalkadaks and Boutler [2] defne the Bayesan core as a generalzaton of the standard core concept, capturng an ntutve noton of stablty n the Bayesan coalton formaton game. Whle coalton structures and allocatons can sometmes be computed centrally, n many stuatons they emerge as the result of some barganng process among the agents, who propose, accept and reect partnershp agreements [3]. We now defne a (Bayesan) coaltonal barganng game for the model above as a Bayesan extensve game wth observable actons. The game proceeds n stages, wth a randomly chosen agent proposng a coalton and allocaton of payments to partners, who then accept or reect the proposal. A fnte set of barganng actons s avalable to the agents. A barganng acton corresponds to ether some proposal π = C, P C to form a coalton C wth a specfc payoff allocaton P C specfyng payoff shares x to each C, or to the acceptance or reecton of such a proposal. The fntehorzongame proceedsn S stages, and ntally all agents are actve. At the begnnng of stage s S, one of the (say n) actve agents s chosen randomly wth probablty γ = 1 n to make a proposal C, P C (wth C). Each other C smultaneously (wthout knowledge of other responses) ether accepts or reects ths proposal. If all C accept, the agents n C are made nactve and removed from the game. Value V s (t C )=δ s 1 V (t C ) s realzed by C at s, and splt accordng to P C,whereδ (0, 1) s the dscount factor. 1 If any C reects the proposal, the agents reman actve (no coalton s formed). At the end of a stage, the responses are observed by all partcpants. At the end of stage S, any not n any coalton receves ts dscounted reservaton value δ S 1 V (t ) (dscounted sngleton coalton value). 3 Perfect Bayesan Equlbrum The coaltonal barganng game descrbed above s clearly an extensve form Bayesan game. We assume each agent wll 1 Agents could have dfferent δ s. As long as these are common knowledge, our analyss holds wth only trval modfcatons. adopt a sutable behavoral strategy, assocatng wth each node n the game tree at whch t must make a decson a dstrbuton over acton choces for each of ts possble types. Furthermore, snce t s uncertan about the types of other agents, ts observed hstory of other agents proposals and responses gve t nformaton about ther types (assumng they are ratonal). Thus, the preferred soluton concept s that of a perfect Bayesan equlbrum (PBE) [8]. A PBE comprses a profle of behavoral strateges for each agent as well a system of belefs dctatng what each agent beleves about the types of ts counterparts at each node n the game tree. The standard ratonalty requrements must also hold: the strategy for each agent maxmzes ts expected utlty gven ts belefs; and each agent s belefs are updated from stage to stage usng Bayes rule, gven the specfc strateges beng played. In ths secton, we formulate the constrants that must hold on both strateges and belefs n order to form a PBE. Let σ denote a behavoral strategy for, mappng nformaton sets (or observable hstores h) n the game tree at whch must act nto dstrbutons over admssble actons A(h). If s a proposer at h (at stage s), let A(h) =P, the fnte set of proposals avalable at h. Thenσ h,t (π) denotes the (behavoral strategy) probablty that makes proposal π Pat h gven ts type s t. If s a responder at h, thenσ h,t (y) s the probablty wth whch accepts the proposal on the table (says yes)ath (and σ h,t (n) =1 σ h,t (y) s the probablty says no). Let μ denote s belefs wth μ h,t (t ) beng s belefs about the types of others at h gven ts own type s t. We defne the PBE constrants for the game by frst defnng the values to (generc) agent at each node and nformaton set n the game tree, gven a fxed strategy for other agents, and the ratonalty constrants on hs strateges and belefs. We proceed n stages. (1) Let ξ be a proposal node for at hstory h at stage s. Snce the only uncertanty n nformaton set h nvolves the types of other agents, each ξ h corresponds to one such type vector t T ;leth(t ) denote ths node n h. The value to of a proposal π = C, P C at h(t ) s: q h(t ),t (π) =p h(t ) acc (π)x X V s(t C)+ r p h(t ) (π,r)q ξ/π/r,t where: pacc h(t ) (π) s the probablty that all C (other than ) accept π (ths s easly defned n terms of ther fxed strateges); x s s payoff share n P C ; r ranges over response vectors n whch at least one C refuses the proposal; p h(t ) (π, r) denotes the probablty of such a response; and q ξ/π/r,t denotes the contnuaton payoff for at stage s +1at the node ξ/π/r (followng n after proposal π and responses r). Ths contnuaton payoff s defned (recursvely) below. The value of π at hstory h (as opposed to a node) s determned by takng the expectaton w.r.t. possble types: q h,t (π) = t μ h,t (t )q h(t ),t (π). (2) Suppose s a responder at node ξ = h(t ) n hstory h at stage s. As above, ξ corresponds to specfc t n h. W.l.o.g. we can assume s the frst responder (snce all responses are smultaneous). Let pacc h(t ) (π) denote the probablty that all other responders accept π. We then defne the 1228

3 value to of acceptng π at ξ as: q h(t ),t (y) =pacc h(t ) (π)x V s (t C )+ p h(t ) (π, r)q ξ/y/r,t r where agan r ranges over response vectors n whch at least one C,, refuses π; p h(t ) (π, r) s the probablty of such a response; and q ξ/y/r,t s the contnuaton payoff for at stage s +1after responses r by ts counterparts. The value of acceptng at h s gven by the expectaton over type vectors t C w.r.t. s belefs μ h,t as above. The value of reectng π at ξ = h(t ) s the expected contnuaton payoff at stage s +1: q h(t ),t (n) = r p h(t ) (π, r)q ξ/n/r,t (where r ranges over all responses, ncludng pure postve responses, of the others). (3) We have defned the value for takng a specfc acton at any of ts nformaton sets. It s now straghtforward to defne the value to of reachng any other stage s node controlled by or by nature (.e., chance nodes where a random proposer s chosen). Frst we note that, by assumng responds frst to any proposal, our defnton above means that we need not compute the value to at any response node (or nformaton set) controlled by. For an nformaton set h where makes a proposal, consder a node ξ = h (t ) where s assumed to be of type t. Then, s strategy σ h,t specfes a dstrbuton over proposals π (determned gven the values q h,t (π) whch can be calculated as above, and s type t ). Agent s value q t,h(t) at ths node s gven by the expectaton (w.r.t. ths strategy dstrbuton) of ts accept or reect values (or f t s not nvolved n a proposal, ts expected contnuaton value at stage s +1gven the responses of others). Its value at h s then Q t (h )= t μ h,t (t )q t,h(t).wedefneq t (h ) (where s the proposer) as n Case 1 above. Fnally, s value at nformaton set h that defnes the stage s contnuaton game (.e., where nature chooses proposer) s q h,t = 1 Q t m (h ) m where m s the number of actve agents, and h s the nformaton set followng h n whch s the proposer. (4) We are now able to defne the ratonalty constrants. We requre that the payoff from the equlbrum behavoral strategy σ exceeds the payoffs of usng pure strateges. Specfcally, n PBE, for all, t T,allh that correspond to one of s nformaton sets, and all actons b A(h), we have: X X μ h (t ) t a A(h) σ h,t (a)q h(t ),t (a) X μ h (t )q h(t ),t (b) t We also add constrants for the Bayesan update of belef varables for any agent regardng type t κ of agent performng a at any h (for all, t T,allhand all a ): μ h a,t (t κ )=μ h,t (t κ )σ h,tκ (a )/ μ h,t (t k )σ h,tk (a ) t k T Fnally, we add the obvous constrants specfyng the doman of the varous varables denotng strateges or belefs (they take values n [0, 1] and sum up to 1 as approprate). Ths ends the formulaton of the program descrbng the PBE. Ths s a polynomal constrant satsfacton problem: fndng a soluton to ths system of constrants s equvalent to decdng whether a system of polynomal equatons and nequaltes has a soluton [1]. The problem s decdable, but s ntractable. For example, an algorthm for decdng ths problem has been proposed wth exponental complexty [1]. Specfcally, the complexty of decdng whether a system of s polynomals, each of degree at most d n k varables has a soluton s s k+1 d O(k). In our case, assumng a random choce of proposer at each of S rounds, we can show that f α s the number of pure strateges, N the number of agents, T the number of types, then s = O(N S ), d = NS and k = O(αNT ). Ths s due to a varety of combnatoral nteractons evdent n the constrants above, creatng as they do nterdependences between belef and strategy varables. In summary, the formulaton above characterzes the PBE soluton of our coaltonal barganng game as a soluton of a polynomal program. However, t does not seem possble that ths soluton can be effcently computed n general. Nevertheless, ths PBE formulaton may prove useful for the computaton of a PBE n a barganng settng wth a lmted number of agents, types, proposals and barganng stages. 4 Approxmatons The calculaton of the PBE soluton s extremely complex due to both the sze of the strategy space (as a functon of the sze of the game tree, whch grows exponentally wth the problem horzon), and the dependence between varables representng strateges and belefs, as explaned above. We present an approxmaton strategy that crcumvents these ssues to some degree by: (a) performng only a small lookahead n the game tree n order to decde on a acton at any stage of the game; and (b) fxng the belefs of each agent durng ths process. Ths latter approach, n partcular, allows us to solve the game tree by backward nducton, essentally computng an equlbrum for ths fxed-belefs game. Note that whle belefs are held fxed durng the lookahead (whle computng an mmedate acton), they do get updated once the acton s selected and executed, and thus do evolve based on the actons of others (ths s n the sprt of recedng horzon control). Furthermore, we allow samplng of type vectors n the computaton to further reduce the tree sze. More precsely, at any stage of the game, wth a partcular collecton of actve agents (each wth ther own belefs), we mplement the followng steps: 1. An agent (e.g., proposer) constructs a game tree consstng of the next d rounds of barganng (for some small lookahead d). 2 All actve agents are assumed to have fxed belefs at each node n ths tree correspondng to ther belefs at the current stage. The agent computes ts optmal acton for the current round usng backward nducton to approxmate an equlbrum (smlar n nature to an SPE) of ths lmted depth game. (We elaborate below.) Furthermore, they sample partners types when calculatng the values of coaltons and proposals. 2 If less than d rounds reman, the tree s sutably truncated. 1229

4 2. Each player executes ts acton computed for the current round of barganng. If a coalton s formed, t breaks away, leavng the remanng players as actve. 3. All actve agents update ther belefs, gven the observed actons of others n the current round, usng Bayesan updatng. Further, each agent keeps track of the belef updates that any other agent of a specfc type would perform at ths pont. 4. The next barganng round s mplemented by repeatng these steps untl a complete coalton structure s determned or the maxmum number of barganng rounds s reached. We stress that the algorthm above does not approxmate the PBE soluton; gettng good bounds for a true PBE approxmaton would only be lkely by assumng belef updatng at every node of the game tree mentoned n Step 1. However, f our algorthmc assumptons are shared by all agents, each can determne ther best responses to others (approxmately) optmal play, and thus ther play approxmates an equlbrum of the fxed-belefs game. Indeed, we can defne a sequental equlbrum under fxed belefs (SEFB) as an extenson of the SPE and a restrcton of the PBE for a fxed-belefs barganng game, and can show the followng (stated nformally here): Theorem 1 If the Bayesan core (BC) of a Bayesan coaltonal game G [2] s non-empty, and so s the BC of each one of G s subgames, then regardless of nature s choce of proposers there s an SEFB strategy profle of the correspondng fxed-belefs dscounted Bayesan coaltonal barganng game that produces a BC element; and conversely, f there s an order ndependent 3 SEFB profle for a Bayesan coaltonal barganng game, then t leads to a confguraton that s n the BC of the underlyng G. Ths result descrbes some noton of equvalence between cooperatve and non-cooperatve Bayesan coalton formaton soluton concepts, and s smlar to results (e.g., Moldovanu et al. [5]) for non-stochastc envronments. It also motvates further Step 1 of our heurstc algorthm, equatng fxed belef equlbrum computaton wth determnaton of ( s part of) the Bayesan core. We now elaborate on ths process. We assume that the agents proceed to negotatons that wll last d rounds (correspondng to the algorthm s lookahead value d) under the assumpton that all belefs wll reman fxed to ther present values throughout the (Step 1) process. We wll present the delberatons of agent durng negotatons. For fxed types t of possble partners, drawn accordng to μ, wll reason about the game tree and assume fxed belefs of other agents. (Agents wll track of the updates of other agents belefs after ths stage of barganng; see Step 2 above). Then, can calculate the optmal acton of any t agent (ncludng hmself) at any nformaton set by takng expectatons over the correspondng tree nodes. We begn our analyss at the last stage d of negotatons. In any node ξ after hstory h where of type t s a responder to proposal π P and assumes a specfc type vector for partners, he expects a value for acceptng that s dfferent to hs (dscounted) reservaton value only f all other responders accept the proposal as well: q h(t ),t xv (y) = d (t C) f all t t C accept (1) V d (t ) otherwse 3 Astrategyproflesorder ndependent ff when played t leads to a specfc CS,P, ndependently of the choce of proposers. However, to evaluate ths acceptance condton, would need to know the other responders strateges (whch n turn depend on s strategy). Therefore, wll make the smplfyng assumpton that all other responders evaluate ther response to π by assumng that the rest of the agents (ncludng ) wll accept the proposal. Thus, any wth t t s assumed by to accept f he evaluates hs expected payoff from acceptance as beng greater than hs (dscounted) reservaton payoff: X x μ (t )V d ({t,t }) V d (t ) (2) t t C Wth ths assumpton, s able to evaluate the acceptance condton n Eq. 1 above, and so calculate a specfc q h(t ),t (y) value. Note that the use of ths assumpton can sometmes lead to an overestmate of the value of a node. At node ξ = h(t ), can also evaluate hs refusal value as q h(t ),t (n) =V d (t ) n ths last round. Then, responder s actual strategy at h can be evaluated as the strategy maxmzng s expected value gven μ h,t : σ h,t =arg max { μ h,t r {y,n} t t C (t )q h(t ),t (r)} If s a proposer of type t delberatng at ξ = h(t ),the value of makng proposal π s: q h(t ),t (π) = xv d (t C) f σ h,t = y, (3) V d (t ) otherwse (.e., wll get hs reservaton value unless all the responders of the specfc type confguraton agree to ths proposal). Furthermore, s expected value q h,t (π) from makng proposal π to coalton C at h can be determned gven μ h,t. Thus, the best proposal that of type t can make to coalton C s the one wth maxmum expected payoff: σ C;h,t = arg max π q h,t (π) wth expected payoff q C;h,t. However, can also propose to other coaltons at h as well. Therefore, the coalton C to whch should propose s the one that guarantees hm the maxmum expected payoff: C =argmax C {q C;h,t }. If P s the payoff allocaton assocated wth that proposal, then the optmal coaltonallocaton par that t can propose n ths subgame (that starts wth proposng at h)s:σ ;h,t = {C,P } wth maxmum expected payoff q C ;h,t. Fnally, f there exst more than one optmal proposal for, randomly selects any of them (ths s taken nto account n agents delberatons accordngly). Of course, when the subgame starts an agent does not know who the proposer n ths subgame wll be; and has only probablstc belefs about the types of hs potental partners. Thus, has to calculate hs contnuaton payoff q d:ξ,t at stage d (that starts at node ξ) wth m partcpants, n the way explaned n the prevous secton. Ths s straghtforward, as can calculate hs expected payoffs from partcpatng n any subgame where some proposes, gven that any can calculate the optmal strateges (and assocated payoffs) for any n ths round d subgame. Now consder play n a subgame startng n perod d 1, agan wth the partcpaton of m agents. The analyss for ths round can be performed n a way completely smlar to 1230

5 the one performed for the last round of negotatons. However, there s one man dfference: the payoffs n the case of a reecton are now the contnuaton payoffs (for agents of specfc type) from the last round subgame. We have to ncorporate ths dfference n our calculatons. Other than that, we can employ a smlar lne of argument to the one used for dentfyng the equlbrum strateges n the last perod. Proceedng n ths way, we defne the contnuaton payoffs and players strateges for each pror round, and fnally determne the frst round actons for any proposer of type t or any responder of type t respondng to any proposal. 5 Expermental Evaluaton To evaluate our approach, we frst conducted experments n two settngs, each wth 5 agents havng 5 possble types. Agents repeatedly engage n epsodes of coalton formaton, each epsode consstng of a number of negotaton rounds. We compare our Bayesan equlbrum approxmaton method (BE) wth KST, an algorthm nspred by a method presented by Kraus et al. [4]. Though ther method s better talored to other settngs, focusng on socal welfare maxmzaton, t s a rare example of a successfully tested dscounted coaltonal barganng method under some restrcted form of uncertanty, whch combnes heurstcs wth prncpled game theoretc technques. It essentally calculates an approxmaton of a kernel-stable allocaton for coaltons that form n each negotaton round wth agents ntentonally compromsng part of ther payoff n order to form coaltons. Lke [4], our KST uses a compromse factor of 0.8, but we assume no central authorty, only one agent proposng per round, and coalton values estmated gven type uncertanty. Durng an epsode, agents progressvely buld a coalton structure and agree on a payment allocaton. The acton executed by a coalton at the end of an epsode (the coaltonal acton) results n one of three possble stochastc outcomes o O = {0, 1, 2} each of dfferng value. Each agent s type determnes ts qualty and the qualty of a coalton s dctated by the sum of the qualty of ts members less a penalty for coalton sze. 4 Coalton qualty then determnes the odds of realzng a specfc outcome (hgher qualty coaltons have greater potental). Fnally, the value of a coalton gven member types s the expected value w.r.t. the dstrbuton over outcomes. In our frst settng, sngleton coaltons receve a penalty of -1 qualty ponts. We compare BE and KST under varous learnng models by measurng average total reward garnered by all coaltons n 30 runs of 500 formaton epsodes each, wth a lmt of 10 barganng rounds per epsode and a barganng dscount factor of δ =0.9. We also compare average reward to the reward that can be attaned usng the optmal, fxed kernel-stable coalton structure { 1, 2, 3, 4, 0 }. We compared BE and KST usng agents that update ther pror over partner types after observng coaltonal actons thus learnng by renforcement (RL) after each epsode and those that do not (No RL). In all cases, BE agents update ther belefs after observng the barganng actons of others 4 We omt the detals here. We only note that agent 0 (of type 0) s detrmental to any coalton (n our 2 frst settngs). durng each negotaton round. There are 388 proposals a BE agent consders when negotatng n a stage wth all fve agents present (fewer n other cases). Table 1(a) shows performance when each agent has a unform pror regardng the types of others. The BE algorthm consstently outperforms KST, even though KST promotes socal welfare (.e., s well-algned wth total reward crteron) rather than ndvdual ratonalty. KST agents wthout RL always converge to the coalton structure { 4, 3, 2, 0, 1 }; ths s due to the fact that they are dscouraged from cooperatng due to the lack of nformaton about ther counterparts. When KST agents learn from observed actons after each epsode (KST-Un-RL) they form the coaltons { 2, 3, 4, 0, 1 } n the last epsode n 16 of 30 runs. BE agents, n contrast, form coaltons based on evolvng belefs about others, and do not form the optmal structure { 1, 2, 3, 4, 0 }. 5 Rather they tend to form coaltons of 2 or 3 members whch exclude agent 0 from beng ther partner. In addton, payoff dvson for BE agents s more algned wth ndvdual ratonalty than t s wth KST. The shares of (averaged) total payoff of KST- Un-RL agents 0 4 are 0.8%, 0.7%, 28.8%, 29.6%, 40.1%, respectvely, whle for BE-Un-RL (SS:10, LA:2) they are 1.3%, 13.4%, 18.8%, 29.5%, 37%; ths more accurately reflects the power [6] of the agents. BE results are reasonably robust wth changng sample sze and lookahead value (at least n ths envronment wth 3125 possble type vectors n a 5-agent coalton). We attrbute the poor performance of KST agents to the fact that they make ther proposals wthout n any way takng nto consderaton the changng belefs of others. Wth the belefs of the agents varyng, negotatons drag (up to the maxmum of 10 rounds) due to refusals, resultng n reduced payoffs. BE agents do not suffer from ths problem, snce they keep track of all possble partners updated belefs, and use them durng negotaton. Thus, they typcally form a coalton structure wthn the frst four rounds of an epsode. We also expermented wth a second settng n whch sngleton coaltons receve a penalty of -2 qualty ponts (rather than -1 above), and where q( t C ) = t t C q(t )/ C (as coaltons get bgger they get penalzed to reflect coordnaton dffcultes). Ths settng makes the qualty of coaltons more dffcult to dstngush. Here, a near-optmal confguraton contans the structure { 4, 3, 2, 1, 0 }. Weusethree dfferent prors: unform, msnformed (agents have an ntal belef of 0.8 that an agent wth type t has type t +2), and nformed (belef 0.8 n the true type of each other agent). The results (Table 1(b)) ndcate that KST agents agan do not do very well, engagng n long negotatons due to unaccounted-for dfferences n belefs among the varous agents. KST-Un-RL agents, for example, typcally use all ten barganng rounds; n contrast, BE-Un-RL usually form structures wthn 3 rounds. Even when KST uses nformed prors, the fact that the expected value of coaltons s not common knowledge takes ts toll. BE agents, on the other hand, derve the true types of ther partners wth 5 Nor should they, gven barganng horzon and δ the kernel and other stablty concepts do not consder barganng dynamcs. 1231

6 Method Reward Optmal CS (expected) KST-Un-NoRL (49.4%) KST-Un-RL (67.3%) BE-Un-NoRL SS=20, LA= (91.2%) BE-Un-RL SS=20, LA= (87.8%) BE-Un-NoRL SS=10, LA= (93.4%) BE Un-RL SS=10, LA= (91.3%) BE-Un-NoRL SS=3, LA= (93.1%) BE-Un-RL SS=3, LA= (91.6%) Method Reward Optmal CS (expected) KST-Un-NoRL (59.6)% KST-Un-RL (59.5)%) BE-Un-NoRL (93.7%) BE-Un-RL (95.2%) KST-Ms-NoRL (59.6)% KST-Ms-RL (63.9)% BE-Ms-NoRL (93.5%) BE-Ms-RL (95.3%) KST-Inf-NoRL (65.6%) KST-Inf-RL (73%) BE-Inf-NoRL (93.3%) BE-Inf-RL 32401(95.6%) Method Q A/B KST-NoRL BE-NoRL KST-NoRL BE-NoRL KST-RL BE-RL KST-RL BE-RL (c) Settng C; Unform Prors; BE (a) Settng A uses SS=5, LA=2; A/B denotes observed relatve power of A over B (b) Settng B; (BE uses SS=10, LA=2) Table 1: Settngs results (average). SS :sample sze; LA :lookahead; Un :unform, Ms :msnformed, Inf :nformedpror. certanty n all experments, and typcally form proftable confguratons wth structures such as { 4, 3, 2, 1, 0 } or { 4, 2, 3, 1, 0 }. We can also see that RL enhances the performance of BE agents slghtly, helpng them further dfferentate the qualty of varous partners. We also report brefly on the results n a settng wth 8 agents, of 2 possble types per agent (4 agents of type A, 4 of type B). The relatve power of type A over B s In ths settng, formng coaltons by mxng agent types s detrmental, wth the excepton of the A, A, B, B ( optmal ), A, A, B and A, B coaltons. There are 2841 proposals an agent consders when negotatng n a stage wth all 8 agents present. The settng makes dscovery of opponent types dffcult, and thus ratonal agents should settle for suboptmal coaltons (hopefully usng them as steppng stones to form better ones later). We also vared the barganng δ (0.95 and 0.5). Agents do not accumulate much reward n ths settng, barganng for many rounds. Instead of reportng reward, we report expected value Q of formaton decsons, Q = C f CV (C), wth f C beng the observed average frequency wth whch coalton C forms and V (C) ts expected value. Results (Table 1(c)) show that BE agents outperform KST agents both n terms of socal welfare and ndvdual ratonalty (the observed relatve power of types the fracton of respectve observed payoffs s close to the true power), and that RL updates are qute benefcal. Further, lowerng the dscount rate to 0.5 forces the agents to form coaltons early, but also contrbutes to better decsons, because t enables the agents to dscover the types of opponents wth more accuracy, effectvely reducng the number of possble opponent responses durng barganng (ntutvely, gven more tme, both a strong and a weak type mght refuse a proposal, whle f tme s pressng the weak mght be the only one to accept). 6 Concludng Remarks and Future Work We proposed an algorthm for coaltonal barganng under uncertanty about the capabltes of potental partners. It uses 6 Relatve power A/B s the expected payoff of A n coaltons excludng B, over the expected payoff of B n coaltons wthout A. teratve coalton formaton wth belef updatng based on the observed actons of others durng barganng, and s motvated by our formulaton of the PBE soluton of a coaltonal barganng game. The algorthm performs well emprcally, and can be combned wth belef updates after observng the results of coaltonal actons (n renforcement learnng style). Future and current work ncludes mplementng a contnuous barganng acton space verson of our algorthm, and also ncorporatng t wthn a broader RL framework facltatng coalton formaton and sequental coaltonal decson makng under uncertanty. We are also nvestgatng approxmaton bounds for our heurstc algorthm. Acknowledgments Thanks to Vangels Markaks for extremely useful dscussons and helpful comments. References [1] S. Basu, R. Pollack, and M.-F. Roy. On the Combnatoral and Algebrac Complexty of Quantfer Elmnaton. Journal of the ACM, 43(6): , [2] G. Chalkadaks and C. Boutler. Bayesan Renforcement Learnng for Coalton Formaton Under Uncertanty. In Proc. of AAMAS 04, [3] K. Chatteree, B. Dutta, and K. Sengupta. A Noncooperatve Theory of Coaltonal Barganng. Revew of Economc Studes, 60: , [4] S. Kraus, O. Shehory, and G. Taase. The Advantages of Compromsng n Coalton Formaton wth Incomplete Informaton. In Proc. of AAMAS 04, [5] B. Moldovanu and E. Wnter. Order Independent Equlbra. Games and Economc Behavor, 9, [6] R.B. Myerson. Game Theory: Analyss of Conflct [7] A. Okada. A Noncooperatve Coaltonal Barganng Game Wth Random Proposers. Games and Econ. Behavor, 16, [8] M.J. Osborne and A. Rubnsten. A course n game theory [9] J. Sus, P. Borm, A. De Wagenaere, and S. Ts. Cooperatve games wth stochastc payoffs. European Journal of Operatonal Research, 113: ,

A MODEL OF COMPETITION AMONG TELECOMMUNICATION SERVICE PROVIDERS BASED ON REPEATED GAME

A MODEL OF COMPETITION AMONG TELECOMMUNICATION SERVICE PROVIDERS BASED ON REPEATED GAME A MODEL OF COMPETITION AMONG TELECOMMUNICATION SERVICE PROVIDERS BASED ON REPEATED GAME Vesna Radonć Đogatovć, Valentna Radočć Unversty of Belgrade Faculty of Transport and Traffc Engneerng Belgrade, Serba

More information

OPERATIONS RESEARCH. Game Theory

OPERATIONS RESEARCH. Game Theory OPERATIONS RESEARCH Chapter 2 Game Theory Prof. Bbhas C. Gr Department of Mathematcs Jadavpur Unversty Kolkata, Inda Emal: bcgr.umath@gmal.com 1.0 Introducton Game theory was developed for decson makng

More information

Introduction to game theory

Introduction to game theory Introducton to game theory Lectures n game theory ECON5210, Sprng 2009, Part 1 17.12.2008 G.B. Ashem, ECON5210-1 1 Overvew over lectures 1. Introducton to game theory 2. Modelng nteractve knowledge; equlbrum

More information

General Examination in Microeconomic Theory. Fall You have FOUR hours. 2. Answer all questions

General Examination in Microeconomic Theory. Fall You have FOUR hours. 2. Answer all questions HARVARD UNIVERSITY DEPARTMENT OF ECONOMICS General Examnaton n Mcroeconomc Theory Fall 2010 1. You have FOUR hours. 2. Answer all questons PLEASE USE A SEPARATE BLUE BOOK FOR EACH QUESTION AND WRITE THE

More information

Single-Item Auctions. CS 234r: Markets for Networks and Crowds Lecture 4 Auctions, Mechanisms, and Welfare Maximization

Single-Item Auctions. CS 234r: Markets for Networks and Crowds Lecture 4 Auctions, Mechanisms, and Welfare Maximization CS 234r: Markets for Networks and Crowds Lecture 4 Auctons, Mechansms, and Welfare Maxmzaton Sngle-Item Auctons Suppose we have one or more tems to sell and a pool of potental buyers. How should we decde

More information

references Chapters on game theory in Mas-Colell, Whinston and Green

references Chapters on game theory in Mas-Colell, Whinston and Green Syllabus. Prelmnares. Role of game theory n economcs. Normal and extensve form of a game. Game-tree. Informaton partton. Perfect recall. Perfect and mperfect nformaton. Strategy.. Statc games of complete

More information

Problems to be discussed at the 5 th seminar Suggested solutions

Problems to be discussed at the 5 th seminar Suggested solutions ECON4260 Behavoral Economcs Problems to be dscussed at the 5 th semnar Suggested solutons Problem 1 a) Consder an ultmatum game n whch the proposer gets, ntally, 100 NOK. Assume that both the proposer

More information

Lecture Note 1: Foundations 1

Lecture Note 1: Foundations 1 Economcs 703 Advanced Mcroeconomcs Prof. Peter Cramton ecture Note : Foundatons Outlne A. Introducton and Examples B. Formal Treatment. Exstence of Nash Equlbrum. Exstence wthout uas-concavty 3. Perfect

More information

CS 286r: Matching and Market Design Lecture 2 Combinatorial Markets, Walrasian Equilibrium, Tâtonnement

CS 286r: Matching and Market Design Lecture 2 Combinatorial Markets, Walrasian Equilibrium, Tâtonnement CS 286r: Matchng and Market Desgn Lecture 2 Combnatoral Markets, Walrasan Equlbrum, Tâtonnement Matchng and Money Recall: Last tme we descrbed the Hungaran Method for computng a maxmumweght bpartte matchng.

More information

- contrast so-called first-best outcome of Lindahl equilibrium with case of private provision through voluntary contributions of households

- contrast so-called first-best outcome of Lindahl equilibrium with case of private provision through voluntary contributions of households Prvate Provson - contrast so-called frst-best outcome of Lndahl equlbrum wth case of prvate provson through voluntary contrbutons of households - need to make an assumpton about how each household expects

More information

Lecture 7. We now use Brouwer s fixed point theorem to prove Nash s theorem.

Lecture 7. We now use Brouwer s fixed point theorem to prove Nash s theorem. Topcs on the Border of Economcs and Computaton December 11, 2005 Lecturer: Noam Nsan Lecture 7 Scrbe: Yoram Bachrach 1 Nash s Theorem We begn by provng Nash s Theorem about the exstance of a mxed strategy

More information

II. Random Variables. Variable Types. Variables Map Outcomes to Numbers

II. Random Variables. Variable Types. Variables Map Outcomes to Numbers II. Random Varables Random varables operate n much the same way as the outcomes or events n some arbtrary sample space the dstncton s that random varables are smply outcomes that are represented numercally.

More information

Tests for Two Ordered Categorical Variables

Tests for Two Ordered Categorical Variables Chapter 253 Tests for Two Ordered Categorcal Varables Introducton Ths module computes power and sample sze for tests of ordered categorcal data such as Lkert scale data. Assumng proportonal odds, such

More information

UNIVERSITY OF NOTTINGHAM

UNIVERSITY OF NOTTINGHAM UNIVERSITY OF NOTTINGHAM SCHOOL OF ECONOMICS DISCUSSION PAPER 99/28 Welfare Analyss n a Cournot Game wth a Publc Good by Indraneel Dasgupta School of Economcs, Unversty of Nottngham, Nottngham NG7 2RD,

More information

3: Central Limit Theorem, Systematic Errors

3: Central Limit Theorem, Systematic Errors 3: Central Lmt Theorem, Systematc Errors 1 Errors 1.1 Central Lmt Theorem Ths theorem s of prme mportance when measurng physcal quanttes because usually the mperfectons n the measurements are due to several

More information

ECE 586GT: Problem Set 2: Problems and Solutions Uniqueness of Nash equilibria, zero sum games, evolutionary dynamics

ECE 586GT: Problem Set 2: Problems and Solutions Uniqueness of Nash equilibria, zero sum games, evolutionary dynamics Unversty of Illnos Fall 08 ECE 586GT: Problem Set : Problems and Solutons Unqueness of Nash equlbra, zero sum games, evolutonary dynamcs Due: Tuesday, Sept. 5, at begnnng of class Readng: Course notes,

More information

Osaka University of Economics Working Paper Series No Hart Mas-Colell Implementation of the Discounted Shapley Value

Osaka University of Economics Working Paper Series No Hart Mas-Colell Implementation of the Discounted Shapley Value Osaka Unversty of Economcs Workng Paper Seres No 2014-2 Hart Mas-Colell Implementaton of the Dscounted Shapley Value Tomohko Kawamor Faculty of Economcs, Osaka Unversty of Economcs November, 2014 Hart

More information

Scribe: Chris Berlind Date: Feb 1, 2010

Scribe: Chris Berlind Date: Feb 1, 2010 CS/CNS/EE 253: Advanced Topcs n Machne Learnng Topc: Dealng wth Partal Feedback #2 Lecturer: Danel Golovn Scrbe: Chrs Berlnd Date: Feb 1, 2010 8.1 Revew In the prevous lecture we began lookng at algorthms

More information

Price and Quantity Competition Revisited. Abstract

Price and Quantity Competition Revisited. Abstract rce and uantty Competton Revsted X. Henry Wang Unversty of Mssour - Columba Abstract By enlargng the parameter space orgnally consdered by Sngh and Vves (984 to allow for a wder range of cost asymmetry,

More information

Quiz on Deterministic part of course October 22, 2002

Quiz on Deterministic part of course October 22, 2002 Engneerng ystems Analyss for Desgn Quz on Determnstc part of course October 22, 2002 Ths s a closed book exercse. You may use calculators Grade Tables There are 90 ponts possble for the regular test, or

More information

Random Variables. b 2.

Random Variables. b 2. Random Varables Generally the object of an nvestgators nterest s not necessarly the acton n the sample space but rather some functon of t. Techncally a real valued functon or mappng whose doman s the sample

More information

Tests for Two Correlations

Tests for Two Correlations PASS Sample Sze Software Chapter 805 Tests for Two Correlatons Introducton The correlaton coeffcent (or correlaton), ρ, s a popular parameter for descrbng the strength of the assocaton between two varables.

More information

15-451/651: Design & Analysis of Algorithms January 22, 2019 Lecture #3: Amortized Analysis last changed: January 18, 2019

15-451/651: Design & Analysis of Algorithms January 22, 2019 Lecture #3: Amortized Analysis last changed: January 18, 2019 5-45/65: Desgn & Analyss of Algorthms January, 09 Lecture #3: Amortzed Analyss last changed: January 8, 09 Introducton In ths lecture we dscuss a useful form of analyss, called amortzed analyss, for problems

More information

Bid-auction framework for microsimulation of location choice with endogenous real estate prices

Bid-auction framework for microsimulation of location choice with endogenous real estate prices Bd-aucton framework for mcrosmulaton of locaton choce wth endogenous real estate prces Rcardo Hurtuba Mchel Berlare Francsco Martínez Urbancs Termas de Chllán, Chle March 28 th 2012 Outlne 1) Motvaton

More information

COS 511: Theoretical Machine Learning. Lecturer: Rob Schapire Lecture #21 Scribe: Lawrence Diao April 23, 2013

COS 511: Theoretical Machine Learning. Lecturer: Rob Schapire Lecture #21 Scribe: Lawrence Diao April 23, 2013 COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture #21 Scrbe: Lawrence Dao Aprl 23, 2013 1 On-Lne Log Loss To recap the end of the last lecture, we have the followng on-lne problem wth N

More information

Production and Supply Chain Management Logistics. Paolo Detti Department of Information Engeneering and Mathematical Sciences University of Siena

Production and Supply Chain Management Logistics. Paolo Detti Department of Information Engeneering and Mathematical Sciences University of Siena Producton and Supply Chan Management Logstcs Paolo Dett Department of Informaton Engeneerng and Mathematcal Scences Unversty of Sena Convergence and complexty of the algorthm Convergence of the algorthm

More information

Jeffrey Ely. October 7, This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License.

Jeffrey Ely. October 7, This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. October 7, 2012 Ths work s lcensed under the Creatve Commons Attrbuton-NonCommercal-ShareAlke 3.0 Lcense. Recap We saw last tme that any standard of socal welfare s problematc n a precse sense. If we want

More information

Instituto de Engenharia de Sistemas e Computadores de Coimbra Institute of Systems Engineering and Computers INESC - Coimbra

Instituto de Engenharia de Sistemas e Computadores de Coimbra Institute of Systems Engineering and Computers INESC - Coimbra Insttuto de Engenhara de Sstemas e Computadores de Combra Insttute of Systems Engneerng and Computers INESC - Combra Joana Das Can we really gnore tme n Smple Plant Locaton Problems? No. 7 2015 ISSN: 1645-2631

More information

Problem Set #4 Solutions

Problem Set #4 Solutions 4.0 Sprng 00 Page Problem Set #4 Solutons Problem : a) The extensve form of the game s as follows: (,) Inc. (-,-) Entrant (0,0) Inc (5,0) Usng backwards nducton, the ncumbent wll always set hgh prces,

More information

Sequential equilibria of asymmetric ascending auctions: the case of log-normal distributions 3

Sequential equilibria of asymmetric ascending auctions: the case of log-normal distributions 3 Sequental equlbra of asymmetrc ascendng auctons: the case of log-normal dstrbutons 3 Robert Wlson Busness School, Stanford Unversty, Stanford, CA 94305-505, USA Receved: ; revsed verson. Summary: The sequental

More information

Equilibrium in Prediction Markets with Buyers and Sellers

Equilibrium in Prediction Markets with Buyers and Sellers Equlbrum n Predcton Markets wth Buyers and Sellers Shpra Agrawal Nmrod Megddo Benamn Armbruster Abstract Predcton markets wth buyers and sellers of contracts on multple outcomes are shown to have unque

More information

/ Computational Genomics. Normalization

/ Computational Genomics. Normalization 0-80 /02-70 Computatonal Genomcs Normalzaton Gene Expresson Analyss Model Computatonal nformaton fuson Bologcal regulatory networks Pattern Recognton Data Analyss clusterng, classfcaton normalzaton, mss.

More information

Problem Set 6 Finance 1,

Problem Set 6 Finance 1, Carnege Mellon Unversty Graduate School of Industral Admnstraton Chrs Telmer Wnter 2006 Problem Set 6 Fnance, 47-720. (representatve agent constructon) Consder the followng two-perod, two-agent economy.

More information

FORD MOTOR CREDIT COMPANY SUGGESTED ANSWERS. Richard M. Levich. New York University Stern School of Business. Revised, February 1999

FORD MOTOR CREDIT COMPANY SUGGESTED ANSWERS. Richard M. Levich. New York University Stern School of Business. Revised, February 1999 FORD MOTOR CREDIT COMPANY SUGGESTED ANSWERS by Rchard M. Levch New York Unversty Stern School of Busness Revsed, February 1999 1 SETTING UP THE PROBLEM The bond s beng sold to Swss nvestors for a prce

More information

Bargaining over Strategies of Non-Cooperative Games

Bargaining over Strategies of Non-Cooperative Games Games 05, 6, 73-98; do:0.3390/g603073 Artcle OPEN ACCESS games ISSN 073-4336 www.mdp.com/ournal/games Barganng over Strateges of Non-Cooperatve Games Guseppe Attanas, *, Aurora García-Gallego, Nkolaos

More information

Applications of Myerson s Lemma

Applications of Myerson s Lemma Applcatons of Myerson s Lemma Professor Greenwald 28-2-7 We apply Myerson s lemma to solve the sngle-good aucton, and the generalzaton n whch there are k dentcal copes of the good. Our objectve s welfare

More information

Elements of Economic Analysis II Lecture VI: Industry Supply

Elements of Economic Analysis II Lecture VI: Industry Supply Elements of Economc Analyss II Lecture VI: Industry Supply Ka Hao Yang 10/12/2017 In the prevous lecture, we analyzed the frm s supply decson usng a set of smple graphcal analyses. In fact, the dscusson

More information

Parallel Prefix addition

Parallel Prefix addition Marcelo Kryger Sudent ID 015629850 Parallel Prefx addton The parallel prefx adder presented next, performs the addton of two bnary numbers n tme of complexty O(log n) and lnear cost O(n). Lets notce the

More information

Finance 402: Problem Set 1 Solutions

Finance 402: Problem Set 1 Solutions Fnance 402: Problem Set 1 Solutons Note: Where approprate, the fnal answer for each problem s gven n bold talcs for those not nterested n the dscusson of the soluton. 1. The annual coupon rate s 6%. A

More information

Static (or Simultaneous- Move) Games of Complete Information

Static (or Simultaneous- Move) Games of Complete Information Statc (or Smultaneous- Move) Games of Complete Informaton Nash Equlbrum Best Response Functon F. Valognes - Game Theory - Chp 3 Outlne of Statc Games of Complete Informaton Introducton to games Normal-form

More information

Topics on the Border of Economics and Computation November 6, Lecture 2

Topics on the Border of Economics and Computation November 6, Lecture 2 Topcs on the Border of Economcs and Computaton November 6, 2005 Lecturer: Noam Nsan Lecture 2 Scrbe: Arel Procacca 1 Introducton Last week we dscussed the bascs of zero-sum games n strategc form. We characterzed

More information

Economic Design of Short-Run CSP-1 Plan Under Linear Inspection Cost

Economic Design of Short-Run CSP-1 Plan Under Linear Inspection Cost Tamkang Journal of Scence and Engneerng, Vol. 9, No 1, pp. 19 23 (2006) 19 Economc Desgn of Short-Run CSP-1 Plan Under Lnear Inspecton Cost Chung-Ho Chen 1 * and Chao-Yu Chou 2 1 Department of Industral

More information

Linear Combinations of Random Variables and Sampling (100 points)

Linear Combinations of Random Variables and Sampling (100 points) Economcs 30330: Statstcs for Economcs Problem Set 6 Unversty of Notre Dame Instructor: Julo Garín Sprng 2012 Lnear Combnatons of Random Varables and Samplng 100 ponts 1. Four-part problem. Go get some

More information

The Integration of the Israel Labour Force Survey with the National Insurance File

The Integration of the Israel Labour Force Survey with the National Insurance File The Integraton of the Israel Labour Force Survey wth the Natonal Insurance Fle Natale SHLOMO Central Bureau of Statstcs Kanfey Nesharm St. 66, corner of Bach Street, Jerusalem Natales@cbs.gov.l Abstact:

More information

Lecture Note 2 Time Value of Money

Lecture Note 2 Time Value of Money Seg250 Management Prncples for Engneerng Managers Lecture ote 2 Tme Value of Money Department of Systems Engneerng and Engneerng Management The Chnese Unversty of Hong Kong Interest: The Cost of Money

More information

Creating a zero coupon curve by bootstrapping with cubic splines.

Creating a zero coupon curve by bootstrapping with cubic splines. MMA 708 Analytcal Fnance II Creatng a zero coupon curve by bootstrappng wth cubc splnes. erg Gryshkevych Professor: Jan R. M. Röman 0.2.200 Dvson of Appled Mathematcs chool of Educaton, Culture and Communcaton

More information

A Case Study for Optimal Dynamic Simulation Allocation in Ordinal Optimization 1

A Case Study for Optimal Dynamic Simulation Allocation in Ordinal Optimization 1 A Case Study for Optmal Dynamc Smulaton Allocaton n Ordnal Optmzaton Chun-Hung Chen, Dongha He, and Mchael Fu 4 Abstract Ordnal Optmzaton has emerged as an effcent technque for smulaton and optmzaton.

More information

Clearing Notice SIX x-clear Ltd

Clearing Notice SIX x-clear Ltd Clearng Notce SIX x-clear Ltd 1.0 Overvew Changes to margn and default fund model arrangements SIX x-clear ( x-clear ) s closely montorng the CCP envronment n Europe as well as the needs of ts Members.

More information

TCOM501 Networking: Theory & Fundamentals Final Examination Professor Yannis A. Korilis April 26, 2002

TCOM501 Networking: Theory & Fundamentals Final Examination Professor Yannis A. Korilis April 26, 2002 TO5 Networng: Theory & undamentals nal xamnaton Professor Yanns. orls prl, Problem [ ponts]: onsder a rng networ wth nodes,,,. In ths networ, a customer that completes servce at node exts the networ wth

More information

An Application of Alternative Weighting Matrix Collapsing Approaches for Improving Sample Estimates

An Application of Alternative Weighting Matrix Collapsing Approaches for Improving Sample Estimates Secton on Survey Research Methods An Applcaton of Alternatve Weghtng Matrx Collapsng Approaches for Improvng Sample Estmates Lnda Tompkns 1, Jay J. Km 2 1 Centers for Dsease Control and Preventon, atonal

More information

occurrence of a larger storm than our culvert or bridge is barely capable of handling? (what is The main question is: What is the possibility of

occurrence of a larger storm than our culvert or bridge is barely capable of handling? (what is The main question is: What is the possibility of Module 8: Probablty and Statstcal Methods n Water Resources Engneerng Bob Ptt Unversty of Alabama Tuscaloosa, AL Flow data are avalable from numerous USGS operated flow recordng statons. Data s usually

More information

Solution of periodic review inventory model with general constrains

Solution of periodic review inventory model with general constrains Soluton of perodc revew nventory model wth general constrans Soluton of perodc revew nventory model wth general constrans Prof Dr J Benkő SZIU Gödöllő Summary Reasons for presence of nventory (stock of

More information

A Comparison of Statistical Methods in Interrupted Time Series Analysis to Estimate an Intervention Effect

A Comparison of Statistical Methods in Interrupted Time Series Analysis to Estimate an Intervention Effect Transport and Road Safety (TARS) Research Joanna Wang A Comparson of Statstcal Methods n Interrupted Tme Seres Analyss to Estmate an Interventon Effect Research Fellow at Transport & Road Safety (TARS)

More information

MgtOp 215 Chapter 13 Dr. Ahn

MgtOp 215 Chapter 13 Dr. Ahn MgtOp 5 Chapter 3 Dr Ahn Consder two random varables X and Y wth,,, In order to study the relatonshp between the two random varables, we need a numercal measure that descrbes the relatonshp The covarance

More information

University of Toronto November 9, 2006 ECO 209Y MACROECONOMIC THEORY. Term Test #1 L0101 L0201 L0401 L5101 MW MW 1-2 MW 2-3 W 6-8

University of Toronto November 9, 2006 ECO 209Y MACROECONOMIC THEORY. Term Test #1 L0101 L0201 L0401 L5101 MW MW 1-2 MW 2-3 W 6-8 Department of Economcs Prof. Gustavo Indart Unversty of Toronto November 9, 2006 SOLUTION ECO 209Y MACROECONOMIC THEORY Term Test #1 A LAST NAME FIRST NAME STUDENT NUMBER Crcle your secton of the course:

More information

University of Toronto November 9, 2006 ECO 209Y MACROECONOMIC THEORY. Term Test #1 L0101 L0201 L0401 L5101 MW MW 1-2 MW 2-3 W 6-8

University of Toronto November 9, 2006 ECO 209Y MACROECONOMIC THEORY. Term Test #1 L0101 L0201 L0401 L5101 MW MW 1-2 MW 2-3 W 6-8 Department of Economcs Prof. Gustavo Indart Unversty of Toronto November 9, 2006 SOLUTION ECO 209Y MACROECONOMIC THEORY Term Test #1 C LAST NAME FIRST NAME STUDENT NUMBER Crcle your secton of the course:

More information

Consumption Based Asset Pricing

Consumption Based Asset Pricing Consumpton Based Asset Prcng Mchael Bar Aprl 25, 208 Contents Introducton 2 Model 2. Prcng rsk-free asset............................... 3 2.2 Prcng rsky assets................................ 4 2.3 Bubbles......................................

More information

Measures of Spread IQR and Deviation. For exam X, calculate the mean, median and mode. For exam Y, calculate the mean, median and mode.

Measures of Spread IQR and Deviation. For exam X, calculate the mean, median and mode. For exam Y, calculate the mean, median and mode. Part 4 Measures of Spread IQR and Devaton In Part we learned how the three measures of center offer dfferent ways of provdng us wth a sngle representatve value for a data set. However, consder the followng

More information

MULTIPLE CURVE CONSTRUCTION

MULTIPLE CURVE CONSTRUCTION MULTIPLE CURVE CONSTRUCTION RICHARD WHITE 1. Introducton In the post-credt-crunch world, swaps are generally collateralzed under a ISDA Master Agreement Andersen and Pterbarg p266, wth collateral rates

More information

Notes on experimental uncertainties and their propagation

Notes on experimental uncertainties and their propagation Ed Eyler 003 otes on epermental uncertantes and ther propagaton These notes are not ntended as a complete set of lecture notes, but nstead as an enumeraton of some of the key statstcal deas needed to obtan

More information

CHAPTER 3: BAYESIAN DECISION THEORY

CHAPTER 3: BAYESIAN DECISION THEORY CHATER 3: BAYESIAN DECISION THEORY Decson makng under uncertanty 3 rogrammng computers to make nference from data requres nterdscplnary knowledge from statstcs and computer scence Knowledge of statstcs

More information

Cyclic Scheduling in a Job shop with Multiple Assembly Firms

Cyclic Scheduling in a Job shop with Multiple Assembly Firms Proceedngs of the 0 Internatonal Conference on Industral Engneerng and Operatons Management Kuala Lumpur, Malaysa, January 4, 0 Cyclc Schedulng n a Job shop wth Multple Assembly Frms Tetsuya Kana and Koch

More information

AC : THE DIAGRAMMATIC AND MATHEMATICAL APPROACH OF PROJECT TIME-COST TRADEOFFS

AC : THE DIAGRAMMATIC AND MATHEMATICAL APPROACH OF PROJECT TIME-COST TRADEOFFS AC 2008-1635: THE DIAGRAMMATIC AND MATHEMATICAL APPROACH OF PROJECT TIME-COST TRADEOFFS Kun-jung Hsu, Leader Unversty Amercan Socety for Engneerng Educaton, 2008 Page 13.1217.1 Ttle of the Paper: The Dagrammatc

More information

INTRODUCTION TO MACROECONOMICS FOR THE SHORT RUN (CHAPTER 1) WHY STUDY BUSINESS CYCLES? The intellectual challenge: Why is economic growth irregular?

INTRODUCTION TO MACROECONOMICS FOR THE SHORT RUN (CHAPTER 1) WHY STUDY BUSINESS CYCLES? The intellectual challenge: Why is economic growth irregular? INTRODUCTION TO MACROECONOMICS FOR THE SHORT RUN (CHATER 1) WHY STUDY BUSINESS CYCLES? The ntellectual challenge: Why s economc groth rregular? The socal challenge: Recessons and depressons cause elfare

More information

Bilateral Bargaining with Externalities

Bilateral Bargaining with Externalities Unversty of Toronto From the SelectedWorks of Joshua S Gans October, 2007 Blateral Barganng wth Externaltes Catherne C de Fontenay, Melbourne Busness School Joshua S Gans Avalable at: https://works.bepress.com/joshuagans/14/

More information

Prospect Theory and Asset Prices

Prospect Theory and Asset Prices Fnance 400 A. Penat - G. Pennacch Prospect Theory and Asset Prces These notes consder the asset prcng mplcatons of nvestor behavor that ncorporates Prospect Theory. It summarzes an artcle by N. Barbers,

More information

Optimal Service-Based Procurement with Heterogeneous Suppliers

Optimal Service-Based Procurement with Heterogeneous Suppliers Optmal Servce-Based Procurement wth Heterogeneous Supplers Ehsan Elah 1 Saf Benjaafar 2 Karen L. Donohue 3 1 College of Management, Unversty of Massachusetts, Boston, MA 02125 2 Industral & Systems Engneerng,

More information

Facility Location Problem. Learning objectives. Antti Salonen Farzaneh Ahmadzadeh

Facility Location Problem. Learning objectives. Antti Salonen Farzaneh Ahmadzadeh Antt Salonen Farzaneh Ahmadzadeh 1 Faclty Locaton Problem The study of faclty locaton problems, also known as locaton analyss, s a branch of operatons research concerned wth the optmal placement of facltes

More information

Advisory. Category: Capital

Advisory. Category: Capital Advsory Category: Captal NOTICE* Subject: Alternatve Method for Insurance Companes that Determne the Segregated Fund Guarantee Captal Requrement Usng Prescrbed Factors Date: Ths Advsory descrbes an alternatve

More information

REPUTATION WITHOUT COMMITMENT IN FINITELY-REPEATED GAMES

REPUTATION WITHOUT COMMITMENT IN FINITELY-REPEATED GAMES REPUTATION WITHOUT COMMITMENT IN FINITELY-REPEATED GAMES JONATHAN WEINSTEIN AND MUHAMET YILDIZ A. In the reputaton lterature, players have commtment types whch represent the possblty that they do not have

More information

2) In the medium-run/long-run, a decrease in the budget deficit will produce:

2) In the medium-run/long-run, a decrease in the budget deficit will produce: 4.02 Quz 2 Solutons Fall 2004 Multple-Choce Questons ) Consder the wage-settng and prce-settng equatons we studed n class. Suppose the markup, µ, equals 0.25, and F(u,z) = -u. What s the natural rate of

More information

In this appendix, we present some theoretical aspects of game theory that would be followed by players in a restructured energy market.

In this appendix, we present some theoretical aspects of game theory that would be followed by players in a restructured energy market. Market Operatons n Electrc Power Systes: Forecastng, Schedulng, and Rsk Manageentg Mohaad Shahdehpour, Hat Yan, Zuy L Copyrght 2002 John Wley & Sons, Inc. ISBNs: 0-47-44337-9 (Hardback); 0-47-2242-X (Electronc)

More information

EDC Introduction

EDC Introduction .0 Introducton EDC3 In the last set of notes (EDC), we saw how to use penalty factors n solvng the EDC problem wth losses. In ths set of notes, we want to address two closely related ssues. What are, exactly,

More information

Likelihood Fits. Craig Blocker Brandeis August 23, 2004

Likelihood Fits. Craig Blocker Brandeis August 23, 2004 Lkelhood Fts Crag Blocker Brandes August 23, 2004 Outlne I. What s the queston? II. Lkelhood Bascs III. Mathematcal Propertes IV. Uncertantes on Parameters V. Mscellaneous VI. Goodness of Ft VII. Comparson

More information

A DUAL EXTERIOR POINT SIMPLEX TYPE ALGORITHM FOR THE MINIMUM COST NETWORK FLOW PROBLEM

A DUAL EXTERIOR POINT SIMPLEX TYPE ALGORITHM FOR THE MINIMUM COST NETWORK FLOW PROBLEM Yugoslav Journal of Operatons Research Vol 19 (2009), Number 1, 157-170 DOI:10.2298/YUJOR0901157G A DUAL EXTERIOR POINT SIMPLEX TYPE ALGORITHM FOR THE MINIMUM COST NETWORK FLOW PROBLEM George GERANIS Konstantnos

More information

REPUTATION WITHOUT COMMITMENT

REPUTATION WITHOUT COMMITMENT REPUTATION WITHOUT COMMITMENT JONATHAN WEINSTEIN AND MUHAMET YILDIZ A. In the reputaton lterature, players have commtment types whch represent the possblty that they do not have standard payoffs but nstead

More information

Privatization and government preference in an international Cournot triopoly

Privatization and government preference in an international Cournot triopoly Fernanda A Ferrera Flávo Ferrera Prvatzaton and government preference n an nternatonal Cournot tropoly FERNANDA A FERREIRA and FLÁVIO FERREIRA Appled Management Research Unt (UNIAG School of Hosptalty

More information

Quadratic Games. First version: February 24, 2017 This version: December 12, Abstract

Quadratic Games. First version: February 24, 2017 This version: December 12, Abstract Quadratc Games Ncolas S. Lambert Gorgo Martn Mchael Ostrovsky Frst verson: February 24, 2017 Ths verson: December 12, 2017 Abstract We study general quadratc games wth mult-dmensonal actons, stochastc

More information

c slope = -(1+i)/(1+π 2 ) MRS (between consumption in consecutive time periods) price ratio (across consecutive time periods)

c slope = -(1+i)/(1+π 2 ) MRS (between consumption in consecutive time periods) price ratio (across consecutive time periods) CONSUMPTION-SAVINGS FRAMEWORK (CONTINUED) SEPTEMBER 24, 2013 The Graphcs of the Consumpton-Savngs Model CONSUMER OPTIMIZATION Consumer s decson problem: maxmze lfetme utlty subject to lfetme budget constrant

More information

Understanding price volatility in electricity markets

Understanding price volatility in electricity markets Proceedngs of the 33rd Hawa Internatonal Conference on System Scences - 2 Understandng prce volatlty n electrcty markets Fernando L. Alvarado, The Unversty of Wsconsn Rajesh Rajaraman, Chrstensen Assocates

More information

Appendix - Normally Distributed Admissible Choices are Optimal

Appendix - Normally Distributed Admissible Choices are Optimal Appendx - Normally Dstrbuted Admssble Choces are Optmal James N. Bodurtha, Jr. McDonough School of Busness Georgetown Unversty and Q Shen Stafford Partners Aprl 994 latest revson September 00 Abstract

More information

Economics 1410 Fall Section 7 Notes 1. Define the tax in a flexible way using T (z), where z is the income reported by the agent.

Economics 1410 Fall Section 7 Notes 1. Define the tax in a flexible way using T (z), where z is the income reported by the agent. Economcs 1410 Fall 2017 Harvard Unversty Yaan Al-Karableh Secton 7 Notes 1 I. The ncome taxaton problem Defne the tax n a flexble way usng T (), where s the ncome reported by the agent. Retenton functon:

More information

Formation of Coalition Structures as a Non-Cooperative Game

Formation of Coalition Structures as a Non-Cooperative Game Formaton of Coalton Structures as a Non-Cooperatve Game Dmtry Levando Natonal Research Unversty Hgher School of Economcs, Moscow, Russa dlevando@hse.ru Abstract. The paper proposes a lst of requrements

More information

4. Greek Letters, Value-at-Risk

4. Greek Letters, Value-at-Risk 4 Greek Letters, Value-at-Rsk 4 Value-at-Rsk (Hull s, Chapter 8) Math443 W08, HM Zhu Outlne (Hull, Chap 8) What s Value at Rsk (VaR)? Hstorcal smulatons Monte Carlo smulatons Model based approach Varance-covarance

More information

Efficient Project Portfolio as a Tool for Enterprise Risk Management

Efficient Project Portfolio as a Tool for Enterprise Risk Management Effcent Proect Portfolo as a Tool for Enterprse Rsk Management Valentn O. Nkonov Ural State Techncal Unversty Growth Traectory Consultng Company Enterprse Rsk Management Symposum Socety of Actuares Chcago,

More information

Quadratic Games. First version: February 24, 2017 This version: August 3, Abstract

Quadratic Games. First version: February 24, 2017 This version: August 3, Abstract Quadratc Games Ncolas S. Lambert Gorgo Martn Mchael Ostrovsky Frst verson: February 24, 2017 Ths verson: August 3, 2018 Abstract We study general quadratc games wth multdmensonal actons, stochastc payoff

More information

Introduction to PGMs: Discrete Variables. Sargur Srihari

Introduction to PGMs: Discrete Variables. Sargur Srihari Introducton to : Dscrete Varables Sargur srhar@cedar.buffalo.edu Topcs. What are graphcal models (or ) 2. Use of Engneerng and AI 3. Drectonalty n graphs 4. Bayesan Networks 5. Generatve Models and Samplng

More information

IND E 250 Final Exam Solutions June 8, Section A. Multiple choice and simple computation. [5 points each] (Version A)

IND E 250 Final Exam Solutions June 8, Section A. Multiple choice and simple computation. [5 points each] (Version A) IND E 20 Fnal Exam Solutons June 8, 2006 Secton A. Multple choce and smple computaton. [ ponts each] (Verson A) (-) Four ndependent projects, each wth rsk free cash flows, have the followng B/C ratos:

More information

A Utilitarian Approach of the Rawls s Difference Principle

A Utilitarian Approach of the Rawls s Difference Principle 1 A Utltaran Approach of the Rawls s Dfference Prncple Hyeok Yong Kwon a,1, Hang Keun Ryu b,2 a Department of Poltcal Scence, Korea Unversty, Seoul, Korea, 136-701 b Department of Economcs, Chung Ang Unversty,

More information

Games and Decisions. Part I: Basic Theorems. Contents. 1 Introduction. Jane Yuxin Wang. 1 Introduction 1. 2 Two-player Games 2

Games and Decisions. Part I: Basic Theorems. Contents. 1 Introduction. Jane Yuxin Wang. 1 Introduction 1. 2 Two-player Games 2 Games and Decsons Part I: Basc Theorems Jane Yuxn Wang Contents 1 Introducton 1 2 Two-player Games 2 2.1 Zero-sum Games................................ 3 2.1.1 Pure Strateges.............................

More information

Analogy-Based Expectation Equilibrium

Analogy-Based Expectation Equilibrium Analogy-Based Expectaton Equlbrum Phlppe Jehel March 2001 Abstract It s assumed that players bundle nodes n whch other players must move nto analogy classes, and players only have expectatons about the

More information

Common Belief Foundations of Global Games

Common Belief Foundations of Global Games Common Belef Foundatons of Global Games Stephen Morrs Prnceton Unversty Hyun Song Shn Bank for Internatonal Settlements Aprl 2015 Muhamet Yldz M.I.T. Abstract Ths paper makes two related contrbutons to

More information

Mechanisms for Efficient Allocation in Divisible Capacity Networks

Mechanisms for Efficient Allocation in Divisible Capacity Networks Mechansms for Effcent Allocaton n Dvsble Capacty Networks Antons Dmaks, Rahul Jan and Jean Walrand EECS Department Unversty of Calforna, Berkeley {dmaks,ran,wlr}@eecs.berkeley.edu Abstract We propose a

More information

Networks of Influence Diagrams: A Formalism for Representing Agents Beliefs and Decision-Making Processes

Networks of Influence Diagrams: A Formalism for Representing Agents Beliefs and Decision-Making Processes Journal of Artfcal Intellgence Research 33 (2008) 109-147 Submtted 11/07; publshed 9/08 Networks of Influence Dagrams: A Formalsm for Representng Agents Belefs and Decson-Makng Processes Ya akov Gal MIT

More information

Practical Distributed Coalition Formation via Heuristic Negotiation in Social Networks

Practical Distributed Coalition Formation via Heuristic Negotiation in Social Networks Practcal Dstrbuted Coalton Formaton va Heurstc Negotaton n Socal Networks Sarvapal D. Ramchurn, Enrco Gerdng, Ncholas R. Jennngs, Hu Jun Agents, Interacton, and Complexty Group School of Electroncs and

More information

A Bootstrap Confidence Limit for Process Capability Indices

A Bootstrap Confidence Limit for Process Capability Indices A ootstrap Confdence Lmt for Process Capablty Indces YANG Janfeng School of usness, Zhengzhou Unversty, P.R.Chna, 450001 Abstract The process capablty ndces are wdely used by qualty professonals as an

More information

Evaluating Performance

Evaluating Performance 5 Chapter Evaluatng Performance In Ths Chapter Dollar-Weghted Rate of Return Tme-Weghted Rate of Return Income Rate of Return Prncpal Rate of Return Daly Returns MPT Statstcs 5- Measurng Rates of Return

More information

Strategic games (Normal form games)

Strategic games (Normal form games) Strategc Normal form ECON5200 Advanced mcroeconomc Lecture n game theory Fall 2010, Part 1 07.07.2010 G.B. Ahem, ECON5200-1 1 Game theory tude mult-peron decon problem, and analyze agent that are ratonal

More information

A Single-Product Inventory Model for Multiple Demand Classes 1

A Single-Product Inventory Model for Multiple Demand Classes 1 A Sngle-Product Inventory Model for Multple Demand Classes Hasan Arslan, 2 Stephen C. Graves, 3 and Thomas Roemer 4 March 5, 2005 Abstract We consder a sngle-product nventory system that serves multple

More information

Simulation Budget Allocation for Further Enhancing the Efficiency of Ordinal Optimization

Simulation Budget Allocation for Further Enhancing the Efficiency of Ordinal Optimization Dscrete Event Dynamc Systems: Theory and Applcatons, 10, 51 70, 000. c 000 Kluwer Academc Publshers, Boston. Manufactured n The Netherlands. Smulaton Budget Allocaton for Further Enhancng the Effcency

More information