INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     sacrific
    -0.81
     behavi
    -0.69
     Juda
    -0.67
     advoc
    -0.65
     fundament
    -0.63
     Dept
    -0.62
    ioch
    -0.61
    mington
    -0.60
    yd
    -0.60
     Til
    -0.59
    POSITIVE LOGITS
     Username
    0.66
    BuyableInstoreAndOnline
    0.65
    orial
    0.64
    unctions
    0.64
     Shia
    0.63
    ippi
    0.62
    iae
    0.62
    geries
    0.61
    ativity
    0.61
    interstitial
    0.60
    Act Density 0.201%

    No Known Activations