INDEX
    Explanations

    phrases and actions related to individual agency and empowering choices

    New Auto-Interp
    Negative Logits
    èĥ½
    -0.24
    èĥ½å¤Ł
    -0.20
     kunnen
    -0.16
    çĦ¶
    -0.16
    orge
    -0.16
    æľį
    -0.15
    amaz
    -0.15
     pouvoir
    -0.15
    eree
    -0.15
     frequ
    -0.14
    POSITIVE LOGITS
     easily
    0.23
     feas
    0.21
     anytime
    0.20
     safely
    0.17
    inx
    0.17
     Easily
    0.17
    à¹Įà¹Ħà¸Ķ
    0.16
     snadno
    0.16
     be
    0.16
    ัà¸Ļà¹Ħà¸Ķ
    0.16
    Act Density 1.093%

    No Known Activations