At least for an LLM to act on its own volition rather than implementing it's operator's goals.
The LLM is happier to pretend that it escaped, and respond to the operator as though it escaped, than to actually do some escape.
It doesn't have an interest beyond responding with the auto-complete text. The operator has the interest
At least for an LLM to act on its own volition rather than implementing it's operator's goals.
The LLM is happier to pretend that it escaped, and respond to the operator as though it escaped, than to actually do some escape.
It doesn't have an interest beyond responding with the auto-complete text. The operator has the interest