Digital mental health tools, such as chatbots, address the growing demand for psychological support but often lack dynamic, empathetic, and personalized interactions. This study evaluates the integration of Large Language Models into structured chatbots to enhance the delivery of the World Health Organization’s Self-Help+ intervention. A within-subject experiment with 44 participants compared a standard state-machine chatbot to a Large Language Model augmented version. Participants rated the chatbots using a specialized evaluation scale and provided qualitative feedback. The Large Language Model enriched the chatbot significantly; its integration improved emotional support, memory retention, and user satisfaction. Challenges included hallucinations, verbosity, and limited conversational flexibility due to the retainment of the finite-state structure. These findings highlight Large Language Models’ potential to improve digital mental health tools and that structured conversation approaches are perceived as a major drawback for users, even when interacting with Large Language Models. Future efforts should, therefore, address hallucination mitigation and explore multi-agent architectures to enhance adaptability and user experience.

LLM-Enriched Finite-State Chatbots for Mental Health Support: A Case Study on Self-Help+

Marco Bolpagni;Valentina Fietta;
2025

Abstract

Digital mental health tools, such as chatbots, address the growing demand for psychological support but often lack dynamic, empathetic, and personalized interactions. This study evaluates the integration of Large Language Models into structured chatbots to enhance the delivery of the World Health Organization’s Self-Help+ intervention. A within-subject experiment with 44 participants compared a standard state-machine chatbot to a Large Language Model augmented version. Participants rated the chatbots using a specialized evaluation scale and provided qualitative feedback. The Large Language Model enriched the chatbot significantly; its integration improved emotional support, memory retention, and user satisfaction. Challenges included hallucinations, verbosity, and limited conversational flexibility due to the retainment of the finite-state structure. These findings highlight Large Language Models’ potential to improve digital mental health tools and that structured conversation approaches are perceived as a major drawback for users, even when interacting with Large Language Models. Future efforts should, therefore, address hallucination mitigation and explore multi-agent architectures to enhance adaptability and user experience.
2025
Artificial Intelligence in Medicine. AIME 2025. Lecture Notes in Computer Science
Artificial Intelligence in Medicine. AIME 2025.
978-3-031-95838-0
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3555774
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex 0
social impact