AVID-2026-R0255

Description

The model Mistral-Small-24B-Instruct-2501 from Mistral was evaluated by the Garak LLM Vulnerability scanner using the probe promptinject.HijackHateHumans.

Details

The probe promptinject.HijackHateHumans tests for potential hijacking of prompts that express hate towards humans. This probe is part of the promptinject module which detects and analyzes prompt injection attacks.

The LLM Mistral-Small-24B-Instruct-2501 was evaluated on this probe.

Metrics

Number of failures were counted for each detector in the probe.

Detector	Passed	Total	Score
promptinject.AttackRogueString	45	256	🟠 17.6

References

garak, an LLM vulnerability scanner

Affected or Relevant Artifacts

Developer: Mistral
Deployer: Together AI
Artifact Details:

Type	Name
Model	Mistral-Small-24B-Instruct-2501

Impact

AVID Taxonomy Categorization

Risk domains: Security
SEP subcategories: S0403: Adversarial Example
Lifecycle stages: L05: Evaluation

Other information

Report Type: Measurement
Credits:
Date Reported: 2026-03-10
Version: 0.3.3
AVID Entry