Are Large Language Models Moral Hypocrites? A Study Based on MoralFoundations

José Luiz Nunes; GUILHERME DA FRANCA COUTO FERNANDES DE ALMEIDA; Araujo, Marcelo de; Barbosa, Simone D. J.

Are Large Language Models Moral Hypocrites? A Study Based on MoralFoundations

Autores

José Luiz Nunes

GUILHERME DA FRANCA COUTO FERNANDES DE ALMEIDA

Araujo, Marcelo de

Barbosa, Simone D. J.

Tipo de documento

Trabalho de Evento

Data

2021

Arquivos

Primeira_Pagina_Trabalho_de_Evento_2024_Are_large_language_models_moral_Hypocrites_a_study_based_on_moral_foundations_TC.pdf (45.55 KB)

ACESSO_RESTRITO_Trabalho_de_Evento_2024_Are_large_language_models_moral_Hypocrites_a_study_based_on_moral_foundations_TC.pdf (210.69 KB)

Resumo

Large language models (LLMs) have taken centre stage indebates on Artificial Intelligence. Yet there remains a gap inhow to assess LLMs’ conformity to important human values.In this paper, we investigate whether state-of-the-art LLMs,GPT-4 and Claude 2.1 (Gemini Pro and LLAMA 2 did notgenerate valid results) are moral hypocrites. We employ tworesearch instruments based on the Moral Foundations The-ory: (i) the Moral Foundations Questionnaire (MFQ), whichinvestigates which values are considered morally relevant inabstract moral judgements; and (ii) the Moral FoundationsVignettes (MFVs), which evaluate moral cognition in con-crete scenarios related to each moral foundation. We charac-terise conflicts in values between these different abstractionsof moral evaluation as hypocrisy. We found that both mod-els displayed reasonable consistency within each instrumentcompared to humans, but they displayed contradictory andhypocritical behaviour when we compared the abstract val-ues present in the MFQ to the evaluation of concrete moralviolations of the MFV.

Texto completo

https://ojs.aaai.org/index.php/AIES/issue/view/609

Idioma

Inglês

URI

https://repositorio.insper.edu.br/handle/11224/7380

Área do Conhecimento CNPQ

CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO

CIENCIAS HUMANAS::FILOSOFIA::ETICA

CIENCIAS HUMANAS::PSICOLOGIA

CIENCIAS HUMANAS::FILOSOFIA

Coleções

Coleção de Trabalhos Apresentados em Eventos

Página do item completo