INDEX
Negative Logits
Serg
-0.07
conc
-0.06
inci
-0.06
dungeons
-0.06
banda
-0.06
valu
-0.06
quam
-0.06
Sher
-0.06
텐
-0.06
ACTER
-0.06
POSITIVE LOGITS
officials
0.07
originate
0.07
*****/↵
0.07
Enter
0.06
.gov
0.06
famously
0.06
Jes
0.06
الت
0.06
_sdk
0.06
#####
0.06
Activations Density 0.021%