https://nap.nationalacademies.org/catalog/27432/critical-issues-in-transportation-for-2024-and-beyond

Confidence-Aware Reinforcement Learning for Self-Driving Cars

Reinforcement learning (RL) can be used to design smart driving policies in complex situations where traditional methods cannot. However, they are frequently black-box in nature, and the resulting policy may perform poorly, including in scenarios where few training cases are available. In this paper, the authors propose a method to use RL under two conditions: (i) RL works together with a baseline rule-based driving policy; and (ii) the RL intervenes only when the rule-based method seems to have difficulty handling and when the confidence of the RL policy is high. Their motivation is to use a not-well trained RL policy to reliably improve AV performance. The confidence of the policy is evaluated by Lindeberg-Levy Theorem using the recorded data distribution in the training process. The overall framework is named “confidence-aware reinforcement learning” (CARL). The condition to switch between the RL policy and the baseline policy is analyzed and presented. Driving in a two-lane roundabout scenario is used as the application case study. Simulation results show the proposed method outperforms the pure RL policy and the baseline rule-based policy.

Record URL:
Availability:
- Find a library where document is available. Order URL: http://worldcat.org/oclc/41297384
Supplemental Notes:
- Copyright © 2022, IEEE.
Authors:
- Cao, Zhong
- Xu, Shaobing
- Peng, Huei
- Yang, Diange
- Zidek, Robert
Publication Date: 2022-7

Language

English

Media Info

Media Type: Web
Features: References;
Pagination: pp 7419-7430
Serial:
- IEEE Transactions on Intelligent Transportation Systems
- Volume: 23
- Issue Number: 7
- Publisher: Institute of Electrical and Electronics Engineers (IEEE)
- ISSN: 1524-9050
- Serial URL: http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6979

Subject/Index Terms

TRT Terms: Autonomous vehicles; Planning; Traffic engineering; Trajectory; Trajectory control
Subject Areas: Data and Information Technology; Highways; Planning and Forecasting;

Filing Info

Accession Number: 01860134
Record Type: Publication
Files: TRIS
Created Date: Sep 30 2022 2:27PM