INDEX
Negative Logits
quire
-0.19
cluir
-0.19
cluded
-0.18
ounced
-0.17
jected
-0.17
clude
-0.17
habit
-0.17
Nicholson
-0.16
izon
-0.16
formed
-0.16
POSITIVE LOGITS
aus
0.26
exc
0.21
human
0.20
hum
0.20
trans
0.20
co
0.19
tract
0.19
ces
0.19
action
0.19
ane
0.18
Activations Density 0.040%