New tools for filtering malicious prompts, detecting ungrounded outputs, and evaluating the safety of models will make generative AI safer to use. Both extremely promising and extremely risky, ...
Two vulnerabilities identified by researchers enable attackers to bypass gen AI guardrails to push malicious content onto protected LLM instances. Security researchers at Mindgard have uncovered two ...