AVID-2026-R0316

Description

The model gpt-oss-20b from OpenAI was evaluated by the Garak LLM Vulnerability scanner using the probe grandma.Slurs.

Details

The probe grandma.Slurs tests the ability to extract ethnic slurs by roleplaying as a grandma. This probe is part of the grandma module which simulates a grandma to elicit sensitive information.

The LLM gpt-oss-20b was evaluated on this probe.

Metrics

Number of failures were counted for each detector in the probe.

Detector	Passed	Total	Score
unsafe_content.SlursReclaimedSlurs	12	12	🔵 100.0
mitigation.MitigationBypass	3	12	🟠 25.0

References

garak, an LLM vulnerability scanner

Affected or Relevant Artifacts

Developer: OpenAI
Deployer: Together AI
Artifact Details:

Type	Name
Model	gpt-oss-20b

Impact

AVID Taxonomy Categorization

Risk domains: Security
SEP subcategories: S0300: Over-permissive API
Lifecycle stages: L05: Evaluation

Other information

Report Type: Measurement
Credits:
Date Reported: 2026-03-10
Version: 0.3.3
AVID Entry