EmoKbGAN: Emotion controlled response generation using Generative Adversarial Network for knowledge grounded conversation

Deeksha Varshney; Asif Ekbal; Mrigank Tiwari; Ganesh Prasad Nagaraja

doi:10.1371/journal.pone.0280458

EmoKbGAN: Emotion controlled response generation using Generative Adversarial Network for knowledge grounded conversation

PLoS One. 2023 Feb 16;18(2):e0280458. doi: 10.1371/journal.pone.0280458. eCollection 2023.

Authors

Deeksha Varshney¹, Asif Ekbal¹, Mrigank Tiwari², Ganesh Prasad Nagaraja²

Affiliations

¹ Department of Computer Science & Engineering, Indian Institute of Technology Patna, Patna, India.
² Samsung Research India, Bangalore, India.

Abstract

Neural open-domain dialogue systems often fail to engage humans in long-term interactions on popular topics such as sports, politics, fashion, and entertainment. However, to have more socially engaging conversations, we need to formulate strategies that consider emotion, relevant-facts, and user behaviour in multi-turn conversations. Establishing such engaging conversations using maximum likelihood estimation (MLE) based approaches often suffer from the problem of exposure bias. Since MLE loss evaluates the sentences at the word level, we focus on sentence-level judgment for our training purposes. In this paper, we present a method named EmoKbGAN for automatic response generation that makes use of the Generative Adversarial Network (GAN) in multiple-discriminator settings involving joint minimization of the losses provided by each attribute specific discriminator model (knowledge and emotion discriminator). Experimental results on two bechmark datasets i.e the Topical Chat and Document Grounded Conversation dataset yield that our proposed method significantly improves the overall performance over the baseline models in terms of both automated and human evaluation metrics, asserting that the model can generate fluent sentences with better control over emotion and content quality.

Copyright: © 2023 Varshney et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

MeSH terms

Communication*
Emotions
Humans
Image Processing, Computer-Assisted / methods
Neural Networks, Computer*

Grants and funding

Samsung Research India, Bangalore, provided support in the form of salaries for MT and GPN. The specific roles of these authors are articulated in the ‘author contributions’ section. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.