INDEX
Negative Logits
grandparents
-0.68
haste
-0.67
intest
-0.66
iTunes
-0.66
ups
-0.64
dining
-0.63
grandfather
-0.63
slaughtered
-0.63
descent
-0.62
segments
-0.62
POSITIVE LOGITS
ible
1.39
ics
1.34
icism
1.30
icons
1.30
ibly
1.26
ical
1.26
icon
1.24
oric
1.21
ually
1.19
ual
1.18
Activations Density 0.056%