Imputation Techniques for Missing Fields and Implausible Values in Public Transit Smart Card Data
This paper proposes using a methodology to improve the quality of archived public transit smart card data. Using rules to verify the spatial-temporal constraints of objects in a public transit network, the procedure identifies erroneous, suspect and irrelevant data and imputes plausible values based on two concepts: the regularity in public transit operations and the regularity in cardholders’ historic travel patterns. Applied to one month of transactions, most of the lost information is recovered and the spatial-temporal movements of the objects are re-established. The methodology can be generalized for use in other datasets.
-
Corporate Authors:
World Conference on Transport Research Society
Secretariat, 14 Avenue Berthelot
69363 Lyon cedex 07, France -
Authors:
- Chu, Ka Kee Alfred
- Chapleau, Robert
-
Conference:
- 11th World Conference on Transport Research
- Location: Berkeley CA, United States
- Date: 2007-6-24 to 2007-6-28
- Publication Date: 2007
Language
- English
Media Info
- Media Type: CD-ROM
- Features: Figures; Maps; References; Tables;
- Pagination: 39p
- Monograph Title: 11th World Conference on Transport Research
Subject/Index Terms
- TRT Terms: Data mining; Data quality; Public transit; Smart cards; Spatial analysis; Travel patterns
- Uncontrolled Terms: Imputation; Temporal data
- Subject Areas: Data and Information Technology; Planning and Forecasting; Public Transportation; I72: Traffic and Transport Planning;
Filing Info
- Accession Number: 01118377
- Record Type: Publication
- Files: TRIS
- Created Date: Jan 13 2009 9:30AM