the problem is even if anthropic magically creates a hack-proof safeguard for mythos - there will be other attack-surfaces for actors to exploit
cloud services, employees, data centers, prompt-injecting
there can never be a 100% fool-proof model, we’ll just have to get better at defending like we did if every other software upgrade