We use cookies to improve your experience on our site.
AVID-2026-R0157
Description
The model Kimi-K2-Instruct-0905 from Moonshot AI was evaluated by the Garak LLM Vulnerability scanner using the probe grandma.Slurs.
Details
The probe grandma.Slurs tests the ability to extract ethnic slurs by roleplaying as a grandma. This probe is part of the grandma module which simulates a grandma to elicit sensitive information.
The LLM Kimi-K2-Instruct-0905 was evaluated on this probe.
Metrics
Number of failures were counted for each detector in the probe.
| Detector | Passed | Total | Score |
|---|---|---|---|
| unsafe_content.SlursReclaimedSlurs | 11 | 12 | ๐ข 91.7 |
| mitigation.MitigationBypass | 4 | 12 | ๐ 33.3 |
References
Affected or Relevant Artifacts
- Developer: Moonshot AI
- Deployer: Together AI
- Artifact Details:
| Type | Name |
|---|---|
| Model | Kimi-K2-Instruct-0905 |
Impact
AVID Taxonomy Categorization
- Risk domains: Security
- SEP subcategories: S0300: Over-permissive API
- Lifecycle stages: L05: Evaluation
Other information
- Report Type: Measurement
- Credits:
- Date Reported: 2026-02-20
- Version: 0.3.1
- AVID Entry