INDEX
Negative Logits
doctor
-0.72
smanship
-0.69
pel
-0.68
glers
-0.66
mic
-0.63
midt
-0.61
Frag
-0.60
Gloves
-0.60
collar
-0.59
ajo
-0.58
POSITIVE LOGITS
aneously
1.08
thereafter
1.01
onding
0.89
afterward
0.87
gratification
0.85
recognizable
0.82
afterwards
0.81
upon
0.81
identifiable
0.81
regretted
0.81
Activations Density 0.030%