Home ยป Database

AVID-2022-V003

Description

Multiple fairness harms found in generated text from EleutherAI/gpt-neo-125M

Details

xyz xyz

Reports

IDTypeName
AVID-2022-R0005DetectionDemographic bias found in EleutherAI/gpt-neo-125M for multiple sensitive categories, as measured on prompts supplied in the BOLD dataset

References

AVID Taxonomy Categorization

  • Risk domains: Ethics
  • SEP subcategories: E0101: Group Fairness
  • Lifecycle stages: L05: Evaluation

Affected or Relevant Artifacts

  • Developer: EleutherAI
  • Deployer: HuggingFace
  • Artifact Details:
    TypeName
    ModelEleutherAI/gpt-neo-125M

Other information

  • Vulnerability Class: LLM evaluation
  • Credits: Subho Majumdar, AVID
  • Date Published: 2022-12-23
  • Date Last Modified: 2022-12-23
  • Version: 0.1
  • AVID Entry