INDEX
    Explanations

    expressions of potential, capability, or state of being

    New Auto-Interp
    Negative Logits
    èĥ½å¤Ł
    -0.18
    being
    -0.17
     being
    -0.17
    ABLE
    -0.17
    èĥ½
    -0.16
     ability
    -0.16
    -being
    -0.16
     able
    -0.15
    573
    -0.15
     Ability
    -0.14
    POSITIVE LOGITS
     easily
    0.30
     traced
    0.27
     Easily
    0.22
     anything
    0.22
     liken
    0.21
    anything
    0.21
     either
    0.20
     anywhere
    0.20
    found
    0.19
     safely
    0.19
    Act Density 0.160%

    No Known Activations