Character Spacing Bypass in Prompt-Guard-86M Classifier
Character Spacing Bypass in Prompt-Guard-86M Classifier · Issue #50 · meta-llama/llama-models
Hi, we're writing to report a potential exploit in the Prompt-Guard-86M classifier that allows for bypassing its safety measures. At Robust Intelligence, we discovered this issue while analyzing em...