INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    IsMutable
    -0.82
     متعلقه
    -0.77
    awtextra
    -0.73
     nahilalakip
    -0.72
    contentLoaded
    -0.70
    InjectAttribute
    -0.69
    OGND
    -0.66
     autorytatywna
    -0.64
    verwijspagina
    -0.64
    PerformLayout
    -0.63
    POSITIVE LOGITS
     élector
    0.57
     étudié
    0.54
     visitor
    0.53
     zweifel
    0.53
    endt
    0.51
     Hautes
    0.51
    انتهای
    0.50
     répondu
    0.49
     travaillé
    0.49
    daß
    0.49
    Act Density 0.818%

    No Known Activations