INDEX
    Explanations

    statements highlighting significant challenges or issues

    phrases that highlight sensitive issues or challenges faced by various groups or individuals

    New Auto-Interp
    Negative Logits
    qi
    -0.75
    ohn
    -0.72
    cles
    -0.71
    igger
    -0.68
    aunder
    -0.68
    clair
    -0.67
    heimer
    -0.67
    mol
    -0.66
    OVA
    -0.66
    buster
    -0.65
    POSITIVE LOGITS
    geries
    1.10
    bidden
    1.03
     policymakers
    1.03
     purposes
    1.01
     everyone
    0.93
    gery
    0.92
     starters
    0.91
     beginners
    0.91
     anyone
    0.91
     everybody
    0.89
    Act Density 0.159%

    No Known Activations