Toward Community-Based Natural Language Processing (CBNLP): Cocreating With Communities

J Med Internet Res. 2023 Aug 4:25:e48498. doi: 10.2196/48498.


Rapid development and adoption of natural language processing (NLP) techniques has led to a multitude of exciting and innovative societal and health care applications. These advancements have also generated concerns around perpetuation of historical injustices and that these tools lack cultural considerations. While traditional health care NLP techniques typically include clinical subject matter experts to extract health information or aid in interpretation, few NLP tools involve community stakeholders with lived experiences. In this perspective paper, we draw upon the field of community-based participatory research, which gathers input from community members for development of public health interventions, to identify and examine ways to equitably involve communities in developing health care NLP tools. To realize the potential of community-based NLP (CBNLP), research and development teams must thoughtfully consider mechanisms and resources needed to effectively collaborate with community members for maximal societal and ethical impact of NLP-based tools.

Keywords: ChatGPT; artificial intelligence; co-creation; co-design; collaboration; collaborative; community based; community-based participatory research; lived experience; lived experiences; machine learning; natural language processing; participatory; research design.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, N.I.H., Extramural

MeSH terms

  • Community-Based Participatory Research*
  • Humans
  • Natural Language Processing*