Real-Time Twitter Data Mining Approach to Infer User Perception Toward Active Mobility
This study evaluates the level of service of shared transportation facilities through mining geotagged data from social media and analyzing the perceptions of road users. An algorithm is developed adopting a text classification approach with contextual understanding to filter out relevant information related to users’ perceptions toward active mobility. Using a heuristic-based keyword matching approach produces about 75% tweets that are out of context, so that approach is deemed unsuitable for information extraction from Twitter. This study implements six different text classification models and compares the performance of these models for tweet classification. The model is applied to real-world data to filter out relevant information, and content analysis is performed to check the distribution of keywords within the filtered data. The text classification model “term frequency-inverse document frequency” vectorizer-based logistic regression model performed best at classifying the tweets. To select the best model, the performances of the models are compared based on precision, recall, F1 score (geometric mean of precision and recall), and accuracy metrics. The findings from the analysis show that the proposed method can help produce more relevant information on walking and biking facilities as well as safety concerns. By analyzing the sentiments of the filtered data, the existing condition of biking and walking facilities in the DC area can be inferred. This method can be a critical part of the decision support system to understand the qualitative level of service of existing transportation facilities.
- Record URL:
-
Availability:
- Find a library where document is available. Order URL: http://worldcat.org/issn/03611981
-
Supplemental Notes:
- © National Academy of Sciences: Transportation Research Board 2021.
-
Authors:
- Rahman, Rezaur
- Redwan Shabab, Kazi
- Chandra Roy, Kamol
- Zaki, Mohamed H
- Hasan, Samiul
- Publication Date: 2021-9
Language
- English
Media Info
- Media Type: Web
- Features: Figures; References; Tables;
- Pagination: pp 947-960
-
Serial:
- Transportation Research Record: Journal of the Transportation Research Board
- Volume: 2675
- Issue Number: 9
- Publisher: Sage Publications, Incorporated
- ISSN: 0361-1981
- EISSN: 2169-4052
- Serial URL: http://journals.sagepub.com/home/trr
Subject/Index Terms
- TRT Terms: Bicycle facilities; Data mining; Evaluation; Level of service; Logistic regression analysis; Nonmotorized transportation; Pedestrian areas; Real time information; Social media
- Identifier Terms: Twitter
- Subject Areas: Data and Information Technology; Pedestrians and Bicyclists; Terminals and Facilities;
Filing Info
- Accession Number: 01764224
- Record Type: Publication
- Report/Paper Numbers: TRBAM-21-02955
- Files: TRIS, TRB, ATRI
- Created Date: Feb 4 2021 11:00AM