We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我在作者的“汤很热”网站体验了几轮游戏,请问可以分享 用户问题已经揭晓了汤底这种情况的prompt或者其他的实现方式吗?
The text was updated successfully, but these errors were encountered:
我没有理解你的意思,可以详细说说吗?
Sorry, something went wrong.
比如“海龟汤的故事”这个汤面,提问:他是因为他老婆的死吗?给出的结果是”猜对了“,这与benchmark中的三种回复结果都不一致,想问一下具体的处理策略是什么
因为这是游戏,所以额外增加了一个状态,汤底中最重要的线索会被作为判定是否猜对的依据。
例如,“他把老婆的肉误以为是海龟肉”是这一题最重要的线索,理想情况下,游戏中LLM会首先判定用户是在“猜测”还是在“提问”,假如在猜测,则判定这个猜测是否命中关键线索,命中则“猜对了”。在在benchmark中,我们只使用了用户的“提问”,而过滤掉了猜测。
你的case中,LLM没有遵循「先判断猜测还是提问」这个指令,并且认为“他是因为他老婆的死吗?”命中了汤底,所以才会返回“猜对了”。
No branches or pull requests
我在作者的“汤很热”网站体验了几轮游戏,请问可以分享 用户问题已经揭晓了汤底这种情况的prompt或者其他的实现方式吗?
The text was updated successfully, but these errors were encountered: