Markov Chain Monte Carlo Multiple Imputation Using Bayesian Networks for Incomplete Intelligent Transportation Systems Data
The rich data on intelligent transportation systems (ITS) are a precious resource for transportation researchers and practitioners. However, the usability of this resource is greatly limited by missing data. Many imputation methods have been proposed in the past decade. However, some issues are still not addressed or are not sufficiently addressed, for example, the missing of entire records, temporal correlation in observations, natural characteristics in raw data, and unbiased estimates for missing values. This paper proposes an advanced imputation method based on recent development in other disciplines, especially applied statistics. The method uses a Bayesian network to learn from the raw data and a Markov chain Monte Carlo technique to sample from the probability distributions learned by the Bayesian network. It imputes the missing data multiple times and makes statistical inferences about the result. In addition, the method incorporates a time series model so that it allows data missing in entire rows—an unfavorable missing pattern frequently seen in ITS data. Empirical study shows that the proposed method is robust and accurate. It is ideal for use as a high-quality imputation method for off-line application.
- Record URL:
-
- Summary URL:
-
Availability:
- Find a library where document is available. Order URL: http://www.trb.org/Main/Public/Blurbs/155479.aspx
-
Authors:
- Ni, Daiheng
- Leonard II
- JOHN, D
- Publication Date: 2005
Language
- English
Media Info
- Media Type: Print
- Features: Figures; References; Tables;
- Pagination: pp 57-67
- Monograph Title: Information Systems and Technology
-
Serial:
- Transportation Research Record: Journal of the Transportation Research Board
- Issue Number: 1935
- Publisher: Transportation Research Board
- ISSN: 0361-1981
Subject/Index Terms
- TRT Terms: Accuracy; Distributions (Statistics); Intelligent transportation systems; Markov chains; Monte Carlo method
- Uncontrolled Terms: Bayesian networks; Missing data; Multiple imputation; Robustness
- Subject Areas: Highways; Operations and Traffic Management; Planning and Forecasting; I72: Traffic and Transport Planning;
Filing Info
- Accession Number: 01023236
- Record Type: Publication
- ISBN: 0309094097
- Files: TRIS, TRB, ATRI
- Created Date: Apr 24 2006 1:01PM