This platform requires JavaScript for full functionality. Please enable JavaScript in your browser settings.

Quality follows upgrading

Tom-Felix Berger

Articles by Tom-Felix Berger

Academic · 1 min

Probing the Limits of the Lie Detector Approach to LLM Deception

arXiv:2603.10003v1 Announce Type: new Abstract: Mechanistic approaches to deception in large language models (LLMs) often rely on "lie detectors", that is, truth probes trained to …

Tom-Felix Berger

30 views Mar 12

Tom-Felix Berger

Articles by Tom-Felix Berger

Probing the Limits of the Lie Detector Approach to LLM Deception

JCG, PC

HSOLLC Co., Ltd.