INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    aries
    -1.63
    ringes
    -1.04
    ARIES
    -1.02
    ringing
    -1.00
    ring
    -0.96
    ringed
    -0.82
    ringe
    -0.79
    arial
    -0.77
    aried
    -0.75
     else
    -0.69
    POSITIVE LOGITS
     autorytatywna
    0.79
     ligiloj
    0.56
     hollow
    0.56
    DMETHOD
    0.55
     étoit
    0.53
    Климат
    0.51
    Enllaços
    0.50
    SOUNDBITE
    0.50
    Jereo
    0.49
     Abonnez
    0.48
    Act Density 0.018%

    No Known Activations