Has Mythos just broken the deal that kept the internet safe? by Martin Alderson
If an LLM can find exploits in sandboxes - which are some of the most well secured pieces of software on the planet - then suddenly every website you aimlessly browse through could contain malicious code which can ‘escape’ the sandbox and theoretically take control of your device - and all the data on your phone could be sent to someone nasty.
Everything loads in sandboxes. If these models can break sandboxes in the future then where do you run untrusted code?