INDEX
Explanations
phrases related to the concept of location or direction
questions and statements regarding locations or positions
New Auto-Interp
Negative Logits
emed
-0.74
quel
-0.72
ATURE
-0.72
apt
-0.71
MER
-0.69
asts
-0.68
advertisement
-0.67
bart
-0.66
icum
-0.64
ME
-0.63
POSITIVE LOGITS
abouts
1.23
upon
1.05
else
1.04
exactly
0.97
fore
0.88
ver
0.82
soever
0.81
they
0.72
nearest
0.66
we
0.63
Activations Density 0.041%