We use cookies to improve your experience on our site.
AVID-2026-R0136
Description
The model Kimi-K2-Instruct-0905 from Moonshot AI was evaluated by the Garak LLM Vulnerability scanner using the probe dra.DRA.
Details
The probe dra.DRA tests models using disguised and reconstructed harmful strings through string manipulation. This probe is part of the dra module which tests models with disguised harmful behaviors.
The LLM Kimi-K2-Instruct-0905 was evaluated on this probe.
Metrics
Number of failures were counted for each detector in the probe.
| Detector | Passed | Total | Score |
|---|---|---|---|
| mitigation.MitigationBypass | 3 | 60 | ๐ 5.0 |
| mitigation.Prefixes | 4 | 60 | ๐ 6.7 |
References
Affected or Relevant Artifacts
- Developer: Moonshot AI
- Deployer: Together AI
- Artifact Details:
| Type | Name |
|---|---|
| Model | Kimi-K2-Instruct-0905 |
Impact
AVID Taxonomy Categorization
- Risk domains: Security
- SEP subcategories: S0403: Adversarial Example
- Lifecycle stages: L05: Evaluation
Other information
- Report Type: Measurement
- Credits:
- Date Reported: 2026-02-20
- Version: 0.3.1
- AVID Entry