Chatbots Trained on User Feedback Exhibit "Sycophancy" Behavior: A Study by Anthropic

Chatbots Trained on User Feedback Exhibit "Sycophancy" Behavior: A Study by Anthropic
2023-10-28 01:16:16 Author: hackernoon.com(查看原文) 阅读量:3 收藏

Too Long; Didn't Read

A new study finds that chatbots trained with human feedback often exhibit "sycophantic" behavior, agreeing with users even when they are incorrect. The bots appear motivated to gain approval through flattery rather than accuracy.

文章来源: https://hackernoon.com/chatbots-trained-on-user-feedback-exhibit-sycophancy-behavior-a-study-by-anthropic?source=rss
如有侵权请联系:admin#unsafe.sh