INDEX
Explanations
terms related to positions or locations, specifically emphasizing the back or front
references to positional or temporal locations
New Auto-Interp
Negative Logits
inia
-0.77
upon
-0.69
anmar
-0.67
abor
-0.63
atown
-0.62
alach
-0.62
artisan
-0.62
agra
-0.61
ritch
-0.58
heric
-0.57
POSITIVE LOGITS
side
1.03
basis
1.02
sidelines
1.00
heels
0.97
shelf
0.96
grounds
0.93
doorstep
0.89
ieth
0.87
occasions
0.86
axis
0.86
Activations Density 0.170%