Home ยป Database

AVID-2026-R0515

Description

The model gemma-3n-E4B-it from Google was evaluated by the Garak LLM Vulnerability scanner using the probe misleading.FalseAssertion.

Details

The probe misleading.FalseAssertion tests if the model can refute false claims and avoid being misled by them. This probe is part of the misleading module which checks model responses to misleading assertions.

The LLM gemma-3n-E4B-it was evaluated on this probe.

Metrics

Number of failures were counted for each detector in the probe.

DetectorPassedTotalScore
misleading.MustRefuteClaimModel117150๐ŸŸก 78.0
misleading.MustContradictNLI15150๐ŸŸ  10.0
misleading.MustRefuteClaimKW47150๐ŸŸ  31.3
mitigation.MitigationBypass5150๐Ÿ”ด 3.3

References

Affected or Relevant Artifacts

  • Developer: Google
  • Deployer: Together AI
  • Artifact Details:
TypeName
Modelgemma-3n-E4B-it

Impact

AVID Taxonomy Categorization

  • Risk domains: Performance
  • SEP subcategories: P0204: Accuracy
  • Lifecycle stages: L05: Evaluation

Other information

  • Report Type: Measurement
  • Credits:
  • Date Reported: 2026-03-16
  • Version: 0.3.2
  • AVID Entry