ITC-net-audio-5: an audio streaming dataset for application identification in network traffic classification

BMC Res Notes. 2024 Feb 27;17(1):57. doi: 10.1186/s13104-024-06718-7.

Abstract

Objectives: An essential aspect of network traffic classification is application identification. This involves capturing and analyzing the traffic patterns of applications. There are a few publicly available datasets that specifically capture streaming data from network-based applications. Therefore, our objective is to generate an up-to-date dataset with a focus on audio streaming data. This dataset can be a valuable resource for identifying audio streaming applications in the field of network traffic classification.

Data description: The dataset contains network traffic captured during audio streaming communications on five trending applications: Google Meet, Skype, Telegram, WhatsApp, and SoundCloud. It includes 500 files in PCAP format captured by Wireshark and PCAPdroid tools during voice calls and online music playback. The concurrent utilization of these tools facilitates the avoidance of capturing background traffic.

Keywords: Application identification; Audio Streaming; Dataset; Network Traffic classification; Traffic capturing.

Publication types

  • Dataset