INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     friends
    -1.11
    friends
    -0.93
     donors
    -0.82
     FRIENDS
    -0.79
     witnesses
    -0.77
     Freunde
    -0.76
     Donors
    -0.74
     kasarigan
    -0.74
    AndEndTag
    -0.73
     vrienden
    -0.73
    POSITIVE LOGITS
    imageshack
    0.50
    存于互联网档案馆
    0.50
    Décès
    0.49
    Bioaccumulative
    0.49
    openConnection
    0.48
    ufort
    0.48
     and
    0.47
    0.47
    SBS
    0.47
    hatsu
    0.47
    Act Density 0.127%

    No Known Activations