A Toy Evaluation of Inference Code Tampering

Article URL: https://alignment.anthropic.com/2024/rogue-eval/

Comments URL: https://news.ycombinator.com/item?id=42374034

Points: 1

# Comments: 0