INDEX
Negative Logits
ornia
-0.66
ATURES
-0.65
Ut
-0.65
ffe
-0.64
ayette
-0.64
ATURE
-0.63
enough
-0.62
RIC
-0.61
OGR
-0.60
sufficient
-0.60
POSITIVE LOGITS
backdrop
1.39
perceived
0.74
intruder
0.73
wall
0.73
invaders
0.73
opponent
0.73
evils
0.73
extremes
0.73
prejudice
0.70
landlord
0.68
Activations Density 15.354%