INDEX
Negative Logits
RANT
-0.86
Sunder
-0.75
maker
-0.68
ARD
-0.66
FUL
-0.65
mble
-0.65
eer
-0.65
footing
-0.63
ously
-0.63
prick
-0.62
POSITIVE LOGITS
cend
1.19
mission
1.11
lator
1.09
gender
1.08
latable
1.05
parency
1.05
missions
1.04
mitt
1.04
itional
1.03
istors
1.03
Activations Density 0.014%