Cybersecurity researchers aren't happy about the guardrails on Anthropic's Fable (techcrunch.com)

0 points 4 hours ago ago | visit original

🤖 AI Summary

Anthropic has unveiled Fable, a limited public version of its advanced cybersecurity model, Mythos, but reactions from cybersecurity experts have been mixed due to its stringent guardrails. Fable is designed to reject any prompts that could relate to cybersecurity or biology, including benign tasks like reading blog posts, raising concerns among researchers who believe the restrictions hinder legitimate coding inquiries. Notably, when a request triggers these guardrails, the AI defaults to a safer model, Claude Opus 4.8, which can limit its effectiveness in security-related tasks. This development is significant as it reflects ongoing efforts to manage the potential misuse of AI in cybersecurity, particularly after Mythos was previously restricted to a select group of organizations. While the intentions behind Fable's limitations are to prevent the model from being used to create malware or other harmful applications, feedback suggests the keyword-based system may be overzealous, leading to frustrations among professionals seeking to improve software security. As AI and cybersecurity continue to converge, experts advocate for a balanced approach that allows for the safe utilization of such models while still addressing security concerns effectively.

Loading comments...

loading comments...