INDEX
Negative Logits
ADRA
-0.68
ktop
-0.67
ATING
-0.65
ctory
-0.64
nces
-0.64
ça
-0.62
++++++++
-0.60
dancer
-0.60
stract
-0.58
UGE
-0.58
POSITIVE LOGITS
sonian
1.75
son
1.02
field
0.94
smanship
0.93
sburg
0.90
Barney
0.86
gren
0.85
ies
0.82
anity
0.79
ie
0.79
Activations Density 0.032%