INDEX
Negative Logits
weise
-0.11
(s
-0.10
oise
-0.10
McKay
-0.10
urf
-0.09
imson
-0.09
ondere
-0.09
iac
-0.09
waters
-0.09
akening
-0.08
POSITIVE LOGITS
iness
0.15
0.11
-covered
0.11
plug
0.10
-filled
0.10
Cutter
0.10
INESS
0.10
bin
0.10
ey
0.09
0.09
Activations Density 0.091%