INDEX
Negative Logits
Пол
-0.07
retiring
-0.07
bben
-0.07
aired
-0.07
artment
-0.07
(Pos
-0.07
SEN
-0.07
died
-0.06
office
-0.06
اب
-0.06
POSITIVE LOGITS
stimulus
0.08
IMAGE
0.08
stimuli
0.07
Instantiate
0.06
ups
0.06
stm
0.06
ulse
0.06
Markup
0.06
Hit
0.06
Tomato
0.06
Activations Density 0.004%