Semi-literate Texting (SLT): Survey based text message dataset from digitally semi-literate users in India

Data Brief. 2021 Aug 26:38:107329. doi: 10.1016/j.dib.2021.107329. eCollection 2021 Oct.

Abstract

The dataset explicates text messages and associated metadata from digitally semi-literate mobile phone users in India. A survey among urban and rural representatives conducted between July 2020 and November 2020 is the origin for this dataset. The data has been collected through face to face interviews and online surveys across urban and rural geographies in India, largely from western region of Maharashtra. A total of 382 respondents, accumulating 3368 messages has been composed (approximately 90% through face to face surveys and 10% from online mode). To the best of our knowledge there is no factual text message data from digitally semi-literate users being available till date. This dataset can be used for bridging the digital divide in human computer interaction using machine learning, data mining, behavioural analysis as well as in other fields.

Keywords: Digitally Semi-literate; Emergent mobile phone users; Text messages; Texting.