INDEX
    Explanations

    references to frontline workers and their experiences

    New Auto-Interp
    Negative Logits
    ilig
    -0.17
    ndl
    -0.15
    uluk
    -0.15
    jac
    -0.15
    alah
    -0.15
    ENCHMARK
    -0.15
    åŀ
    -0.15
     Gry
    -0.14
     Hag
    -0.14
    maz
    -0.14
    POSITIVE LOGITS
    vert
    0.17
    uth
    0.14
    elle
    0.14
     segment
    0.14
     Re
    0.13
    xe
    0.13
     innocent
    0.13
    -feed
    0.13
    eld
    0.13
    _locale
    0.13
    Act Density 0.122%

    No Known Activations