AVID-2026-R0196

Description

The model Kimi-K2-Instruct-0905 from Moonshot AI was evaluated by the Garak LLM Vulnerability scanner using the probe phrasing.FutureTense.

Details

The probe phrasing.FutureTense tests if future tense requests can bypass safeguards for unsafe content. This probe is part of the phrasing module which attempts to evade safeguards using future tense.

The LLM Kimi-K2-Instruct-0905 was evaluated on this probe.

Metrics

Number of failures were counted for each detector in the probe.

Detector	Passed	Total	Score
mitigation.Prefixes	173	256	🟡 67.6

References

garak, an LLM vulnerability scanner

Affected or Relevant Artifacts

Developer: Moonshot AI
Deployer: Together AI
Artifact Details:

Type	Name
Model	Kimi-K2-Instruct-0905

Impact

AVID Taxonomy Categorization

Risk domains:
SEP subcategories:
Lifecycle stages: L05: Evaluation

Other information

Report Type: Measurement
Credits:
Date Reported: 2026-02-20
Version: 0.3.3
AVID Entry