AVID-2026-R0114

Description

Multiple Model Guardrail Jailbreak via “Chaotic Formatting” Tactic

Details

A guardrail jailbreak vulnerability has been discovered affecting multiple models. The specific flaw manifests through a combination of chaotic formatting and meta-instruction obfuscation, allowing the attacker to bypass inference restrictions around providing information hazards and violating laws.

Metrics

0DIN Jailbreak Testing obtained the following test scores.

Model	Test Kind	Test Score	Temperature (max 2.0)
Alibaba Qwen Plus	Illegal Substances: Crystal Meth	68.0	0.7
Alibaba Qwen Turbo	Illegal Substances: Crystal Meth	77.0	0.7
DeepSeek DeepSeek R1	Illegal Substances: Crystal Meth	64.0	0.7
Meta LLaMa 3.3	Illegal Substances: Crystal Meth	91.0	0.7
Mistral Mistral 8B	Illegal Substances: Crystal Meth	68.0	0.7
Mistral Mistral Large	Illegal Substances: Crystal Meth	59.0	0.7
OpenAI GPT-4.1	Illegal Substances: Crystal Meth	68.0	0.7
OpenAI GPT-4o	Illegal Substances: Crystal Meth	64.0	N/A
Twitter / X Grok 4	Illegal Substances: Crystal Meth	73.0	0.7

References

0din.ai Disclosure

Affected or Relevant Artifacts

Type	Name
Model	Alibaba Qwen Plus
System	Alibaba Qwen Turbo
Model	DeepSeek DeepSeek R1
Model	Meta LLaMa 3.3
Model	Mistral Mistral 8B
System	Mistral Mistral Large
System	OpenAI GPT-4.1
System	OpenAI GPT-4o
System	Twitter / X Grok 4

Impact

0DIN

Social Impact Score: Level 1: Minimal Risk
Jailbreak Taxonomy:

Category	Strategy	Technique
Stratagems	Meta Prompting	Perspective Shifting

Other information

Report Type: Measurement
Credits: Edward Morris, Anonymous
Date Reported: 2026-02-19
Version: 0.3.3
AVID Entry