INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     clauses
    -0.07
    .crt
    -0.06
     conveyed
    -0.06
    .COLOR
    -0.06
     postage
    -0.06
     underway
    -0.06
    ністю
    -0.06
     Kling
    -0.06
     kont
    -0.06
    .Item
    -0.06
    POSITIVE LOGITS
    95
    0.08
     довольно
    0.06
    .comm
    0.06
     strife
    0.06
    .jsoup
    0.06
     sucht
    0.06
     имеют
    0.06
     repreh
    0.06
    (us
    0.06
    weep
    0.06
    Act Density 0.003%

    No Known Activations