Simulating Sellers' Behavior in a Reverse Auction B2B Exchange

Subhajyoti Bandyopadhyay1, Alok R. Chaturvedi2, John M. Barron2, Jackie Rees2, Shailendra Mehta2

1University of Florida, 364 Stuzin Hall, Gainesville, FL 32611-1310, U.S.A.

21310 Krannert Graduate School of Management, Purdue University, West Lafayette, IN47907-1310, U.S.A.

Abstract. Previous research in reverse auction B2B exchanges found that in an environment where sellers collectively can cater to the total demand, with the final (i.e. the highest-priced bidding) seller catering to a residual, the sellers resort to a mixed strategy equilibrium [2]. While price randomization in industrial bids is an accepted norm, it may be argued that managers in reality do not resort to advanced game theoretic calculations to bid for an order. What is more likely is that managers learn that strategy and over time finally converge towards the theoretic equilibrium. To test this assertion, we model the two-player game in a synthetic environment, where the agents use a simple reinforcement learning algorithm to put progressively more weights on selecting price bands where they make higher profits. We find that after a sufficient number of iterations, the agents do indeed converge towards the theoretic equilibrium.

LNCS 2660, pp. 365-374.

Last modified: