INDEX
Negative Logits
oppress
-0.07
.jump
-0.07
HIV
-0.07
occ
-0.07
Preconditions
-0.06
offence
-0.06
preced
-0.06
_plate
-0.06
avent
-0.06
inactive
-0.06
POSITIVE LOGITS
er
0.16
IGN
0.06
ável
0.06
Meteor
0.06
AGAIN
0.06
ENG
0.06
aking
0.06
excel
0.06
ctr
0.06
inals
0.06
Activations Density 0.002%