INDEX
Explanations
words related to action or decision-making
instances of the verb "go" and its variations used in various contexts
New Auto-Interp
Negative Logits
emort
-0.65
XD
-0.64
Origin
-0.61
ificent
-0.59
emic
-0.58
olen
-0.57
(~
-0.55
season
-0.55
âķ
-0.55
oret
-0.55
POSITIVE LOGITS
verning
1.07
aded
0.93
ahead
0.91
ggle
0.88
forward
0.88
vt
0.88
overboard
0.88
ading
0.82
toe
0.82
lems
0.82
Activations Density 0.075%