INDEX
Explanations
phrases indicating movement or action involving physical space
instances of the word "around"
New Auto-Interp
Negative Logits
hetical
-0.79
qua
-0.75
vi
-0.71
ged
-0.70
iPhone
-0.69
igh
-0.67
jo
-0.67
IJ
-0.66
Sent
-0.66
andre
-0.66
POSITIVE LOGITS
corners
1.00
town
0.95
town
0.77
freely
0.76
uncontroll
0.70
bag
0.69
luster
0.67
bags
0.67
nervously
0.67
frantically
0.66
Activations Density 0.048%