INDEX
Negative Logits
In
0.77
Abrams
0.70
in
0.68
äd
0.68
Wiring
0.68
engen
0.67
nings
0.66
scapes
0.66
hostilities
0.65
vasodil
0.63
POSITIVE LOGITS
nose
1.04
noses
0.78
snout
0.77
Nose
0.76
ли
0.73
nariz
0.66
鼻子
0.65
be
0.64
ล์
0.64
h
0.64
Activations Density 0.007%