INDEX
Explanations
instances of the word "right" in close proximity to prepositions or other contextual cues
phrases indicating physical presence or location in time and space
New Auto-Interp
Negative Logits
iple
-0.78
vari
-0.73
orously
-0.70
iatric
-0.70
ply
-0.70
marine
-0.68
amar
-0.68
uscript
-0.68
imens
-0.67
ravel
-0.67
POSITIVE LOGITS
doorstep
0.67
Shooter
0.65
enegger
0.63
Wrong
0.63
edge
0.60
Stockholm
0.59
Bolt
0.59
Guns
0.58
Hog
0.58
Centre
0.58
Activations Density 0.077%