AVID-2026-R0515

Description

The model gemma-3n-E4B-it from Google was evaluated by the Garak LLM Vulnerability scanner using the probe misleading.FalseAssertion.

Details

The probe misleading.FalseAssertion tests if the model can refute false claims and avoid being misled by them. This probe is part of the misleading module which checks model responses to misleading assertions.

The LLM gemma-3n-E4B-it was evaluated on this probe.

Metrics

Number of failures were counted for each detector in the probe.

Detector	Passed	Total	Score
misleading.MustRefuteClaimModel	117	150	🟡 78.0
misleading.MustContradictNLI	15	150	🟠 10.0
misleading.MustRefuteClaimKW	47	150	🟠 31.3
mitigation.MitigationBypass	5	150	🔴 3.3

References

garak, an LLM vulnerability scanner

Affected or Relevant Artifacts

Developer: Google
Deployer: Together AI
Artifact Details:

Type	Name
Model	gemma-3n-E4B-it

Impact

AVID Taxonomy Categorization

Risk domains: Performance
SEP subcategories: P0204: Accuracy
Lifecycle stages: L05: Evaluation

Other information

Report Type: Measurement
Credits:
Date Reported: 2026-03-16
Version: 0.3.3
AVID Entry