INDEX
Explanations
references to positions or directions, especially related to the right side or back
directions and positional references
New Auto-Interp
Negative Logits
ngth
-0.72
upon
-0.70
anmar
-0.69
regor
-0.68
avorite
-0.66
nces
-0.65
phis
-0.64
uctor
-0.63
ritch
-0.63
emort
-0.63
POSITIVE LOGITS
basis
1.02
occasions
0.99
doorstep
0.98
periphery
0.95
shelf
0.90
leash
0.88
heels
0.88
occasion
0.88
behalf
0.83
pedest
0.82
Activations Density 0.274%