INDEX
    Explanations

    negative statements or negations

    New Auto-Interp
    Negative Logits
    aggi
    -0.16
    rees
    -0.14
     ÐľÑĸнÑĸÑģÑĤеÑĢ
    -0.14
    çıł
    -0.14
    etic
    -0.13
     MMI
    -0.13
    ettel
    -0.13
    .Module
    -0.13
    .stamp
    -0.13
     rejo
    -0.13
    POSITIVE LOGITS
    landers
    0.15
    ubo
    0.14
    lander
    0.14
     Düz
    0.14
    pard
    0.14
     Trit
    0.14
    опол
    0.14
    auge
    0.14
    çν
    0.13
    aqu
    0.13
    Act Density 0.001%

    No Known Activations