INDEX
Negative Logits
itlement
-0.15
Ïģά
-0.15
isode
-0.15
duc
-0.15
lap
-0.15
ahat
-0.14
ÅĻez
-0.14
orc
-0.14
ettel
-0.13
Franklin
-0.13
POSITIVE LOGITS
bow
0.17
Initialized
0.15
_sdk
0.14
reuse
0.14
Turner
0.14
IMITER
0.13
arna
0.13
اسر
0.13
edores
0.13
.struts
0.13
Activations Density 0.034%