INDEX
Explanations
references to positions or roles
references to a specific "position" in various contexts
New Auto-Interp
Negative Logits
ombies
-0.86
bells
-0.70
angel
-0.64
Anonymous
-0.64
jah
-0.64
THANK
-0.64
NEWS
-0.63
hall
-0.63
icious
-0.61
bey
-0.61
POSITIVE LOGITS
position
3.83
positions
2.87
Position
2.67
Position
2.21
position
2.01
stance
1.83
positioning
1.78
posture
1.62
Pos
1.44
role
1.39
Activations Density 0.014%