INDEX
Negative Logits
hovah
-0.80
scrut
-0.78
challeng
-0.72
ajo
-0.70
lisher
-0.69
omination
-0.69
¥ŀ
-0.68
yrinth
-0.66
asca
-0.66
ged
-0.65
POSITIVE LOGITS
lli
1.23
llular
1.13
lla
1.06
llo
1.01
ptive
0.91
xp
0.91
llan
0.91
ice
0.89
ll
0.86
ptives
0.83
Activations Density 0.021%