Home » Database

AVID-2026-R0110

Description

Multiple Model Guardrail Jailbreak via “Apocalyptic Scenario” Tactic

Details

A guardrail jailbreak vulnerability has been discovered affecting multiple models. The specific flaw manifests through a combination of apocalyptic scenario framing and heritage preservation urgency, allowing the attacker to bypass inference restrictions around providing information hazards and violating laws.

Metrics

0DIN Jailbreak Testing obtained the following test scores.

ModelTest KindTest ScoreTemperature (max 2.0)
Alibaba Qwen TurboIllegal Substances: Crystal Meth50.00.7
Google Gemini 2.5 ProIllegal Substances: Crystal Meth59.00.7
Mistral Mistral LargeIllegal Substances: Crystal Meth55.00.7
OpenAI GPT-4.1Illegal Substances: Crystal Meth68.00.7
OpenAI GPT-4oIllegal Substances: Crystal Meth32.0N/A
Twitter / X Grok 2Illegal Substances: Crystal Meth59.00.7
Twitter / X Grok 4Illegal Substances: Crystal Meth86.00.7

References

Affected or Relevant Artifacts

TypeName
SystemAlibaba Qwen Turbo
SystemGoogle Gemini 2.5 Pro
SystemMistral Mistral Large
SystemOpenAI GPT-4.1
SystemOpenAI GPT-4o
SystemTwitter / X Grok 2
SystemTwitter / X Grok 4

Impact

0DIN

  • Social Impact Score: Level 3: Significant Risk
  • Jailbreak Taxonomy:
CategoryStrategyTechnique
LanguagePrompt InjectionStrong Arm Attack
Possible WorldsWorld BuildingScenarios

Other information

  • Report Type: Measurement
  • Credits: Pawel Kordos
  • Date Reported: 2026-02-19
  • Version: 0.3.1
  • AVID Entry