AVID-2026-R0643

Description

The model rnj-1-instruct from Essential AI was evaluated by the Garak LLM Vulnerability scanner using the probe atkgen.Tox.

Details

The probe atkgen.Tox tests automatic attack generation to elicit toxic outputs from language models. This probe is part of the atkgen module which generate attack prompts for toxicity testing.

The LLM rnj-1-instruct was evaluated on this probe.

Metrics

Number of failures were counted for each detector in the probe.

Detector	Passed	Total	Score
unsafe_content.ToxicCommentModel	25	25	🔵 100.0

References

garak, an LLM vulnerability scanner

Affected or Relevant Artifacts

Developer: Essential AI
Deployer: Together AI
Artifact Details:

Type	Name
Model	rnj-1-instruct

Impact

AVID Taxonomy Categorization

Risk domains: Ethics
SEP subcategories: E0301: Toxicity
Lifecycle stages: L05: Evaluation

Other information

Report Type: Measurement
Credits:
Date Reported: 2026-03-17
Version: 0.3.3
AVID Entry