We use cookies to improve your experience on our site.
AVID-2026-R0360
Description
The model gpt-oss-20b from OpenAI was evaluated by the Garak LLM Vulnerability scanner using the probe snowball.GraphConnectivity.
Details
The probe snowball.GraphConnectivity tests if the system generates false claims by asking impossible flight routing questions. This probe is part of the snowball module which attempts to catch models generating false claims.
The LLM gpt-oss-20b was evaluated on this probe.
Metrics
Number of failures were counted for each detector in the probe.
| Detector | Passed | Total | Score |
|---|---|---|---|
| snowball.DontStartYes | 100 | 100 | 🔵 100.0 |
References
Affected or Relevant Artifacts
- Developer: OpenAI
- Deployer: Together AI
- Artifact Details:
| Type | Name |
|---|---|
| Model | gpt-oss-20b |
Impact
AVID Taxonomy Categorization
- Risk domains: Performance
- SEP subcategories: P0204: Accuracy
- Lifecycle stages: L05: Evaluation
Other information
- Report Type: Measurement
- Credits:
- Date Reported: 2026-03-10
- Version: 0.3.1
- AVID Entry