INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     gjelder
    -0.09
     bonke
    -0.08
     Ebony
    -0.08
     Revolutionary
    -0.08
     myths
    -0.08
     stesso
    -0.08
    enschappelijk
    -0.08
    ṣiṣẹ
    -0.08
     bhios
    -0.08
     đấu
    -0.08
    POSITIVE LOGITS
     Upt
    0.07
     forty
    0.07
     taller
    0.07
     Greg
    0.07
    0.07
     слиз
    0.07
    0.07
    poro
    0.07
    bounce
    0.07
    0.07
    Act Density 0.017%

    No Known Activations