Red-teaming & Safety EvalsAdversarial prompting, systematic safety evaluation, eval coverage, and what gets missed.