INDEX
    Explanations

    phrases indicating legal or moral entitlement

    phrases related to individual rights and liberties

    New Auto-Interp
    Negative Logits
     Madness
    -0.60
    iries
    -0.59
     diligent
    -0.57
     Spiel
    -0.57
     unsuspecting
    -0.56
     batches
    -0.55
     Journals
    -0.54
    metics
    -0.54
     Hort
    -0.54
    duction
    -0.54
    POSITIVE LOGITS
     whatsoever
    0.85
    76561
    0.77
     vested
    0.74
    ointed
    0.74
     veto
    0.69
    kees
    0.69
     entit
    0.69
    âĺ
    0.68
    amus
    0.68
    ï¸
    0.67
    Act Density 0.159%

    No Known Activations