INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     autorytatywna
    -0.83
     defaultstate
    -0.77
    IsContent
    -0.66
     مشين
    -0.65
     resourceCulture
    -0.64
     مرئيه
    -0.63
     חיצוניים
    -0.62
    Personensuche
    -0.61
    Zeneca
    -0.61
    BeginContext
    -0.61
    POSITIVE LOGITS
     and
    0.66
     the
    0.61
     Fer
    0.50
     Murphy
    0.48
     Adams
    0.47
     p
    0.47
    eya
    0.47
     Haupts
    0.47
     key
    0.46
     caus
    0.46
    Act Density 0.007%

    No Known Activations