INDEX
Negative Logits
어
-0.07
Appendix
-0.07
ουλίου
-0.07
commons
-0.06
Paragraph
-0.06
าษ
-0.06
Holland
-0.06
ullah
-0.06
suff
-0.06
-0.06
POSITIVE LOGITS
Win
0.17
Win
0.15
win
0.14
_win
0.13
WIN
0.12
Winchester
0.11
win
0.11
WIN
0.09
Edwin
0.09
(WIN
0.09
Activations Density 0.007%