INDEX
    Explanations

    statements related to systemic injustice and societal critique

    Follows an abbreviation or initialism

    Queen, scale, correlation

    New Auto-Interp
    Negative Logits
     EconPapers
    -0.89
    aarrggbb
    -0.85
    httphttps
    -0.79
     &___
    -0.79
    HasAnnotation
    -0.76
    LookAnd
    -0.74
     Meksiku
    -0.74
     Савезне
    -0.73
    TypedDataSet
    -0.72
    verwijspagina
    -0.71
    POSITIVE LOGITS
    <eos>
    0.80
    ↵↵
    0.75
    なお
    0.53
    </tr>
    0.53
    </h3>
    0.52
    ↵↵↵
    0.52
    0.47
     Bonus
    0.46
    ________________
    0.46
    ↵↵↵↵↵
    0.46
    Act Density 0.053%

    No Known Activations