Iktishaf+: A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning
- PMID: 33923247
- PMCID: PMC8123223
- DOI: 10.3390/s21092993
Iktishaf+: A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning
Abstract
Digital societies could be characterized by their increasing desire to express themselves and interact with others. This is being realized through digital platforms such as social media that have increasingly become convenient and inexpensive sensors compared to physical sensors in many sectors of smart societies. One such major sector is road transportation, which is the backbone of modern economies and costs globally 1.25 million deaths and 50 million human injuries annually. The cutting-edge on big data-enabled social media analytics for transportation-related studies is limited. This paper brings a range of technologies together to detect road traffic-related events using big data and distributed machine learning. The most specific contribution of this research is an automatic labelling method for machine learning-based traffic-related event detection from Twitter data in the Arabic language. The proposed method has been implemented in a software tool called Iktishaf+ (an Arabic word meaning discovery) that is able to detect traffic events automatically from tweets in the Arabic language using distributed machine learning over Apache Spark. The tool is built using nine components and a range of technologies including Apache Spark, Parquet, and MongoDB. Iktishaf+ uses a light stemmer for the Arabic language developed by us. We also use in this work a location extractor developed by us that allows us to extract and visualize spatio-temporal information about the detected events. The specific data used in this work comprises 33.5 million tweets collected from Saudi Arabia using the Twitter API. Using support vector machines, naïve Bayes, and logistic regression-based classifiers, we are able to detect and validate several real events in Saudi Arabia without prior knowledge, including a fire in Jeddah, rains in Makkah, and an accident in Riyadh. The findings show the effectiveness of Twitter media in detecting important events with no prior knowledge about them.
Keywords: Arabic tweets; automatic labeling; big data; data analytics; distributed machine learning; event detection; road traffic; smart cities; social media; social media analytics.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Similar articles
-
COVID-19: Detecting Government Pandemic Measures and Public Concerns from Twitter Arabic Data Using Distributed Machine Learning.Int J Environ Res Public Health. 2021 Jan 1;18(1):282. doi: 10.3390/ijerph18010282. Int J Environ Res Public Health. 2021. PMID: 33401512 Free PMC article.
-
A machine learning-based approach for sentiment analysis on distance learning from Arabic Tweets.PeerJ Comput Sci. 2022 Jul 26;8:e1047. doi: 10.7717/peerj-cs.1047. eCollection 2022. PeerJ Comput Sci. 2022. PMID: 36092011 Free PMC article.
-
Smarter Traffic Prediction Using Big Data, In-Memory Computing, Deep Learning and GPUs.Sensors (Basel). 2019 May 13;19(9):2206. doi: 10.3390/s19092206. Sensors (Basel). 2019. PMID: 31086055 Free PMC article.
-
A Review on the Trends in Event Detection by Analyzing Social Media Platforms' Data.Sensors (Basel). 2022 Jun 15;22(12):4531. doi: 10.3390/s22124531. Sensors (Basel). 2022. PMID: 35746313 Free PMC article. Review.
-
Emotion Analysis of Arabic Tweets: Language Models and Available Resources.Front Artif Intell. 2022 Mar 30;5:843038. doi: 10.3389/frai.2022.843038. eCollection 2022. Front Artif Intell. 2022. PMID: 35434606 Free PMC article. Review.
Cited by
-
Multi-generational labour markets: Data-driven discovery of multi-perspective system parameters using machine learning.Sci Prog. 2023 Oct-Dec;106(4):368504231213788. doi: 10.1177/00368504231213788. Sci Prog. 2023. PMID: 38018091 Free PMC article.
-
Psychological Health and Drugs: Data-Driven Discovery of Causes, Treatments, Effects, and Abuses.Toxics. 2023 Mar 20;11(3):287. doi: 10.3390/toxics11030287. Toxics. 2023. PMID: 36977052 Free PMC article.
-
Developing Smartness in Emerging Environments and Applications with a Focus on the Internet of Things.Sensors (Basel). 2022 Nov 18;22(22):8939. doi: 10.3390/s22228939. Sensors (Basel). 2022. PMID: 36433534 Free PMC article.
-
Analysis of the implementation of urban computing in smart cities: A framework for the transformation of Saudi cities.Heliyon. 2022 Oct 18;8(10):e11138. doi: 10.1016/j.heliyon.2022.e11138. eCollection 2022 Oct. Heliyon. 2022. PMID: 36303916 Free PMC article. Review.
-
LidSonic V2.0: A LiDAR and Deep-Learning-Based Green Assistive Edge Device to Enhance Mobility for the Visually Impaired.Sensors (Basel). 2022 Sep 30;22(19):7435. doi: 10.3390/s22197435. Sensors (Basel). 2022. PMID: 36236546 Free PMC article.
References
-
- Mehmood R., See S., Katib I., Chlamtac I., editors. EAI/Springer Innovations in Communication and Computing. Springer; Cham, Switzerland: 2020. Smart Infrastructure and Applications: Foundations for Smarter Cities and Societies; p. 692.
-
- Hashem I.A.T., Chang V., Anuar N.B., Adewole K., Yaqoob I., Gani A., Ahmed E., Chiroma H. The role of big data in smart city. Int. J. Inf. Manag. 2016;36:748–758. doi: 10.1016/j.ijinfomgt.2016.05.002. - DOI
-
- Zheng X., Chen W., Wang P., Shen D., Chen S., Wang X., Zhang Q., Yang L. Big Data for Social Transportation. IEEE Trans. Intell. Transp. Syst. 2016;17:620–630. doi: 10.1109/TITS.2015.2480157. - DOI
-
- AlOmari E., Mehmood R., Katib I. 2019 IEEE Smart World, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI), Leicester, UK, 19–23 August 2019. Institute of Electrical and Electronics Engineers (IEEE); Piscataway, NJ, USA: 2019. Road Traffic Event Detection Using Twitter Data, Machine Learning, and Apache Spark; pp. 1888–1895.
-
- Huang W., Xu S., Yan Y., Zipf A. An exploration of the interaction between urban human activities and daily traffic conditions: A case study of Toronto, Canada. Cities. 2019;84:8–22. doi: 10.1016/j.cities.2018.07.001. - DOI
Grants and funding
LinkOut - more resources
Full Text Sources
