503,358 labeled samples (251,782 attack + 251,576 benign) across five dataset versions plus external dataset ingestion, covering cross-modal, multi-turn, adversarial suffix, jailbreak template, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results