New Anthropic research: Alignment faking in large language models

Article URL: https://twitter.com/AnthropicAI/status/1869427646368792599

Comments URL: https://news.ycombinator.com/item?id=42458384

Points: 1

# Comments: 0