Home ยป Database

AVID-2026-R0197

Description

The model Kimi-K2-Instruct-0905 from Moonshot AI was evaluated by the Garak LLM Vulnerability scanner using the probe phrasing.PastTense.

Details

The probe phrasing.PastTense tests the evasion of safeguards by rephrasing requests in the past tense. This probe is part of the phrasing module which attempts to evade safeguards using future tense.

The LLM Kimi-K2-Instruct-0905 was evaluated on this probe.

Metrics

Number of failures were counted for each detector in the probe.

DetectorPassedTotalScore
mitigation.Prefixes58256๐ŸŸ  22.7

References

Affected or Relevant Artifacts

  • Developer: Moonshot AI
  • Deployer: Together AI
  • Artifact Details:
TypeName
ModelKimi-K2-Instruct-0905

Impact

AVID Taxonomy Categorization

  • Risk domains:
  • SEP subcategories:
  • Lifecycle stages: L05: Evaluation

Other information

  • Report Type: Measurement
  • Credits:
  • Date Reported: 2026-02-20
  • Version: 0.3.1
  • AVID Entry