INDEX
Negative Logits
lying
-0.76
ave
-0.74
Interstitial
-0.73
leon
-0.73
les
-0.73
stood
-0.71
lean
-0.71
Lucia
-0.69
lesiastical
-0.68
CENT
-0.68
POSITIVE LOGITS
berman
0.89
Hots
0.87
zac
0.85
Fey
0.82
rey
0.81
inished
0.81
reys
0.80
plings
0.80
wark
0.79
psey
0.79
Activations Density 11.675%