Twitter/X

@cryptopunk7213: the problem is even if anthropic magically creates a hack-proof safeguard for mythos - there will b...

the problem is even if anthropic magically creates a hack-proof safeguard for mythos - there will be other attack-surfaces for actors to exploit

cloud services, employees, data centers, prompt-injecting

there can never be a 100% fool-proof model, we’ll just have to get better at defending like we did if every other software upgrade