INDEX
Negative Logits
kered
-0.94
ulates
-0.91
ornia
-0.90
ceed
-0.87
anooga
-0.85
ulate
-0.83
*/(
-0.83
pload
-0.82
ounty
-0.78
pter
-0.78
POSITIVE LOGITS
Dull
1.03
Gins
0.98
Allen
0.82
Robinson
0.81
ham
0.76
stown
0.76
hurst
0.72
heim
0.71
quist
0.70
Poe
0.70
Activations Density 0.027%