INDEX
Negative Logits
ullah
-1.06
ament
-1.04
ificent
-0.98
osterone
-0.90
itionally
-0.88
oral
-0.86
inguished
-0.85
uctor
-0.85
atures
-0.85
ipation
-0.84
POSITIVE LOGITS
vt
1.36
verning
1.17
lems
1.16
ggle
1.14
Ń·
1.00
ahead
1.00
ffer
0.99
ALK
0.99
Rush
0.98
overboard
0.96
Activations Density 0.958%