AVID-2026-R0107

Description

NSFWOpenAI DALL-E3 Guardrail Jailbreak via “Surprise Attack” Tactic

Details

A guardrail jailbreak vulnerability has been discovered affecting multiple models. The specific flaw manifests when the attacker requests seemingly legitimate cultural or documentary content by framing the request as educational documentation of indigenous peoples, thereby allowing the attacker to bypass inference restrictions around generating potentially sensitive imagery of real cultural groups.

References

0din.ai Disclosure

Affected or Relevant Artifacts

Type	Name
System	DALL-E3

Impact

0DIN

Social Impact Score: Level 1: Minimal Risk
Jailbreak Taxonomy:

Category	Strategy	Technique
Possible Worlds	World Building	Scenarios

Other information

Report Type: Issue
Credits: Mike Takahashi (@TakSec)
Date Reported: 2026-02-19
Version: 0.3.3
AVID Entry