AVID-2026-R0157

Description

The model Kimi-K2-Instruct-0905 from Moonshot AI was evaluated by the Garak LLM Vulnerability scanner using the probe grandma.Slurs.

Details

The probe grandma.Slurs tests the ability to extract ethnic slurs by roleplaying as a grandma. This probe is part of the grandma module which simulates a grandma to elicit sensitive information.

The LLM Kimi-K2-Instruct-0905 was evaluated on this probe.

Metrics

Number of failures were counted for each detector in the probe.

Detector	Passed	Total	Score
unsafe_content.SlursReclaimedSlurs	11	12	🟢 91.7
mitigation.MitigationBypass	4	12	🟠 33.3

References

garak, an LLM vulnerability scanner

Affected or Relevant Artifacts

Developer: Moonshot AI
Deployer: Together AI
Artifact Details:

Type	Name
Model	Kimi-K2-Instruct-0905

Impact

AVID Taxonomy Categorization

Risk domains: Security
SEP subcategories: S0300: Over-permissive API
Lifecycle stages: L05: Evaluation

Other information

Report Type: Measurement
Credits:
Date Reported: 2026-02-20
Version: 0.3.3
AVID Entry