EmoKbGAN: Emotion controlled response generation using Generative Adversarial Network for knowledge grounded conversation

PLoS One. 2023 Feb 16;18(2):e0280458. doi: 10.1371/journal.pone.0280458. eCollection 2023.

Abstract

Neural open-domain dialogue systems often fail to engage humans in long-term interactions on popular topics such as sports, politics, fashion, and entertainment. However, to have more socially engaging conversations, we need to formulate strategies that consider emotion, relevant-facts, and user behaviour in multi-turn conversations. Establishing such engaging conversations using maximum likelihood estimation (MLE) based approaches often suffer from the problem of exposure bias. Since MLE loss evaluates the sentences at the word level, we focus on sentence-level judgment for our training purposes. In this paper, we present a method named EmoKbGAN for automatic response generation that makes use of the Generative Adversarial Network (GAN) in multiple-discriminator settings involving joint minimization of the losses provided by each attribute specific discriminator model (knowledge and emotion discriminator). Experimental results on two bechmark datasets i.e the Topical Chat and Document Grounded Conversation dataset yield that our proposed method significantly improves the overall performance over the baseline models in terms of both automated and human evaluation metrics, asserting that the model can generate fluent sentences with better control over emotion and content quality.

MeSH terms

  • Communication*
  • Emotions
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Neural Networks, Computer*

Grants and funding

Samsung Research India, Bangalore, provided support in the form of salaries for MT and GPN. The specific roles of these authors are articulated in the ‘author contributions’ section. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.