Dynamic recovery sequence optimization of subway-bus network with constrained reinforcement learning

Subway and bus networks constitute the backbone of urban public transportation and cater to the daily commuting needs of city residents. For damaged subway-bus networks caused by extreme events, aligning the repair sequence with dynamic passenger flow is crucial to reducing passenger delays. Most existing studies on station recovery strategies ignore the dynamic changes of passenger flow, and this study aims to design the station recovery sequence for the subway-bus network considering dynamic passenger flow. To enhance practicality, a constrained reinforcement learning framework is formulated to incorporate passenger demands, mobility distance, and repair duration into a Markov decision process. Moreover, dynamic graph convolutional networks are adopted to capture time-varying characteristics of passenger flow. The reward function proposed in this work integrates node importance, recovery progress, transferred passengers, and excessive distance penalty to enable efficient post-disaster recovery. Empirical analysis reveals that this methodology increases the number of transferred passengers by 5.53%-11.30%.

Language

  • English

Media Info

Subject/Index Terms

Filing Info

  • Accession Number: 01979590
  • Record Type: Publication
  • Files: TRIS
  • Created Date: Feb 17 2026 1:11PM