INDEX
Explanations
references to actions or verbs that indicate direction or movement
New Auto-Interp
Negative Logits
clerosis
-0.73
Roy
-0.71
deg
-0.70
ounded
-0.69
DEV
-0.68
tu
-0.66
Bay
-0.66
ģ«
-0.66
independence
-0.66
azel
-0.64
POSITIVE LOGITS
guy
0.84
solution
0.82
superpower
0.81
destination
0.80
recommendation
0.79
option
0.77
babys
0.77
remedy
0.76
conduit
0.76
culprit
0.73
Activations Density 0.011%