New Anthropic research: Alignment faking in large language models
Article URL: https://twitter.com/AnthropicAI/status/1869427646368792599
Comments URL: https://news.ycombinator.com/item?id=42458384
Points: 1
# Comments: 0
Article URL: https://twitter.com/AnthropicAI/status/1869427646368792599
Comments URL: https://news.ycombinator.com/item?id=42458384
Points: 1
# Comments: 0