New Anthropic research: Alignment faking in large language models

Published: December 19, 2024

abstract-business-code-270348

Article URL: https://twitter.com/AnthropicAI/status/1869427646368792599

Comments URL: https://news.ycombinator.com/item?id=42458384

Points: 1

# Comments: 0