INDEX
Negative Logits
p
0.72
old
0.65
ów
0.64
Communications
0.63
Cooper
0.62
Pace
0.62
v
0.62
Partnership
0.62
ranks
0.62
kelamin
0.62
POSITIVE LOGITS
EtOH
0.89
daños
0.87
heutigen
0.84
ми
0.83
ер
0.82
Hfn
0.81
ER
0.81
jenigen
0.81
CHREIB
0.80
plumage
0.80
Activations Density 0.001%