DISTRIBUTED REINFORCEMENT LEARNING FOR A TRAFFIC ENGINEERING APPLICATION

In this paper, the authors describe how a distributed reinforcement learning problem, in which the returns of many agents are simultaneously updating a single shared policy, is addressed by applying novel reinforcement learning techniques. A traffic simulator is used in the learning process. Two new algorithms are introduced: a value function-based algorithm and one that uses a direct policy evaluation approach. Both algorithms are shown to perform comparably well.

Language

  • English

Media Info

  • Pagination: p. 404-411

Subject/Index Terms

Filing Info

  • Accession Number: 00821745
  • Record Type: Publication
  • Source Agency: UC Berkeley Transportation Library
  • Files: PATH
  • Created Date: Dec 31 2001 12:00AM