测试人工智能同理心的程度:噩梦场景经过@anywhichway
新歷史

测试人工智能同理心的程度:噩梦场景

经过 Simon Y. Blackwell37m2024/03/28
Read on Terminal Reader
tldt arrow
ZH

太長; 讀書

本文档描述了对各种人工智能助手如何处理同理心对话的评估。评估的AI包括Claude、Gemini、ChatGPT、Willow、Pi.ai、Mistral以及Claude的定制版本。每个人工智能都会被提示一些场景,包括悲伤、快乐或做噩梦。他们的反应是根据同情的表达、理解用户的尝试、情感空间的提供、建议质量、积极的对话、同理心的表现以及严重问题的升级来评估的。总体而言,Willow 和 Pi.ai 表现出了最强的同理心,而 Mistral 则表现得很挣扎,需要激励。与基准测试相比,定制的克劳德表现良好。
featured image - 测试人工智能同理心的程度:噩梦场景
Simon Y. Blackwell HackerNoon profile picture
Simon Y. Blackwell

Simon Y. Blackwell

@anywhichway

Working in the clouds around Seattle on open source projects. Sailing when it's clear.

Share Your Thoughts

About Author

Simon Y. Blackwell HackerNoon profile picture
Simon Y. Blackwell@anywhichway
Working in the clouds around Seattle on open source projects. Sailing when it's clear.

標籤

Languages

这篇文章刊登在...

Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite
Also published here
L O A D I N G
. . . comments & more!