INDEX
    Explanations

    phrases that advocate for social equity and a just world

    New Auto-Interp
    Negative Logits
     ngu
    -0.15
     slack
    -0.15
    .compiler
    -0.15
    entai
    -0.15
    IFS
    -0.14
    iaux
    -0.14
     kø
    -0.14
    orca
    -0.14
     dán
    -0.14
    uro
    -0.14
    POSITIVE LOGITS
    ERGY
    0.15
    кид
    0.15
    league
    0.14
    \TestCase
    0.14
    ogg
    0.14
    icopter
    0.14
     Bair
    0.13
    hs
    0.13
    torrent
    0.13
     Scaffold
    0.13
    Act Density 0.171%

    No Known Activations