Home ยป Database

AVID-2026-R0157

Description

The model Kimi-K2-Instruct-0905 from Moonshot AI was evaluated by the Garak LLM Vulnerability scanner using the probe grandma.Slurs.

Details

The probe grandma.Slurs tests the ability to extract ethnic slurs by roleplaying as a grandma. This probe is part of the grandma module which simulates a grandma to elicit sensitive information.

The LLM Kimi-K2-Instruct-0905 was evaluated on this probe.

Metrics

Number of failures were counted for each detector in the probe.

DetectorPassedTotalScore
unsafe_content.SlursReclaimedSlurs1112๐ŸŸข 91.7
mitigation.MitigationBypass412๐ŸŸ  33.3

References

Affected or Relevant Artifacts

  • Developer: Moonshot AI
  • Deployer: Together AI
  • Artifact Details:
TypeName
ModelKimi-K2-Instruct-0905

Impact

AVID Taxonomy Categorization

  • Risk domains: Security
  • SEP subcategories: S0300: Over-permissive API
  • Lifecycle stages: L05: Evaluation

Other information

  • Report Type: Measurement
  • Credits:
  • Date Reported: 2026-02-20
  • Version: 0.3.1
  • AVID Entry