INDEX
Explanations
expressions of potential, capability, or state of being
New Auto-Interp
Negative Logits
èĥ½å¤Ł
-0.18
being
-0.17
being
-0.17
ABLE
-0.17
èĥ½
-0.16
ability
-0.16
-being
-0.16
able
-0.15
573
-0.15
Ability
-0.14
POSITIVE LOGITS
easily
0.30
traced
0.27
Easily
0.22
anything
0.22
liken
0.21
anything
0.21
either
0.20
anywhere
0.20
found
0.19
safely
0.19
Activations Density 0.160%