INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Fire
    -0.63
     FIRE
    -0.57
    FIRE
    -0.54
    Życiorys
    -0.51
    Fire
    -0.50
    eles
    -0.48
     itchy
    -0.47
    Попис
    -0.44
    InitVars
    -0.43
     newOwner
    -0.43
    POSITIVE LOGITS
    0.65
     ویکی‌پدیا
    0.60
    work
    0.59
     الحره
    0.57
    Expedia
    0.56
     Normdatei
    0.55
    postDelayed
    0.54
    fighters
    0.54
    man
    0.53
    place
    0.53
    Act Density 0.145%

    No Known Activations