INDEX
Explanations
verbs associated with movement or direction
New Auto-Interp
Negative Logits
.fp
-0.17
bred
-0.16
ebo
-0.16
acht
-0.15
iedy
-0.15
isin
-0.15
Äijiá»ĥn
-0.15
ercul
-0.14
Drv
-0.14
eday
-0.14
POSITIVE LOGITS
ahead
0.28
-ahead
0.27
ahead
0.20
hay
0.20
bers
0.19
beyond
0.19
vt
0.18
Ahead
0.18
round
0.17
-on
0.17
Activations Density 0.049%