INDEX
Explanations
phrases related to direction or movement
phrases indicating direction or movement
New Auto-Interp
Negative Logits
iability
-0.72
Corpus
-0.66
sylvania
-0.62
Mub
-0.62
TON
-0.62
rehens
-0.61
illon
-0.60
asions
-0.60
bos
-0.60
orum
-0.60
POSITIVE LOGITS
canon
1.09
liner
0.90
heading
0.88
lines
0.86
line
0.85
pin
0.85
pins
0.85
quarter
0.83
lander
0.82
toward
0.81
Activations Density 0.021%