INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     como
    -0.06
     еж
    -0.06
     Brittany
    -0.06
    hide
    -0.06
    igos
    -0.06
     Houston
    -0.06
     Colorado
    -0.06
    harga
    -0.06
     jméno
    -0.06
    Colorado
    -0.06
    POSITIVE LOGITS
     ball
    0.11
     Ball
    0.11
    ball
    0.10
    Ball
    0.10
    ads
    0.09
     balls
    0.08
    .ball
    0.08
    0.08
     Ballard
    0.07
    ейств
    0.07
    Act Density 0.017%

    No Known Activations