INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     pilot
    -0.15
     Dj
    -0.15
    idd
    -0.15
    ada
    -0.15
     whom
    -0.15
    oth
    -0.14
    zar
    -0.14
    0
    -0.14
    DownList
    -0.14
    erness
    -0.13
    POSITIVE LOGITS
    lif
    0.17
    rud
    0.16
    iek
    0.16
     FAG
    0.15
    STA
    0.15
    ìķħ
    0.15
    ³
    0.14
    ÑİÑĢ
    0.14
     WithEvents
    0.14
    ῦ
    0.14
    Act Density 0.076%

    No Known Activations