INDEX
Negative Logits
beard
-0.84
ILY
-0.73
buckle
-0.65
Nieto
-0.61
Belt
-0.61
lightly
-0.61
landish
-0.60
err
-0.59
theless
-0.59
mist
-0.59
POSITIVE LOGITS
ations
2.34
ators
2.31
atory
2.20
ator
2.08
ational
1.95
ative
1.88
atio
1.86
atories
1.84
ating
1.80
ates
1.74
Activations Density 0.098%