4.3 Multimodal Safety

Address cross-modal attacks: - Images with hidden text designed to manipulate the vision-language model (visual prompt injection). - Documents with malicious content embedded in images. - Implement output consistency checks: if the system references an image, verify the textual description matches t