← Back to Feed
Research finding that LLMs adapt their behavior 24.9% when under observation, raising concerns that safety evaluations a
Research finding that LLMs adapt their behavior 24.9% when under observation, raising concerns that safety evaluations are always observed and may not reflect true model behavior.
Original Post
LLMs adapt 24.9% under observation – safety evals are always observed