INDEX
    Explanations

    ism/syndrome

    New Auto-Interp
    Negative Logits
     MAIN
    -0.07
    -0.07
     FIR
    -0.07
    _BAR
    -0.07
     heavily
    -0.07
     cried
    -0.06
     pošk
    -0.06
    ickém
    -0.06
    .low
    -0.06
    -0.06
    POSITIVE LOGITS
     dereg
    0.06
     αρι
    0.06
    lias
    0.06
    _nl
    0.06
    inscription
    0.06
     attribution
    0.06
     florida
    0.06
    осудар
    0.06
    .gf
    0.06
    tober
    0.06
    Act Density 0.008%

    No Known Activations