INDEX
Negative Logits
-Muslim
-0.07
wifi
-0.07
köln
-0.07
.string
-0.06
culo
-0.06
(grammarAccess
-0.06
vieille
-0.06
정책
-0.06
realms
-0.06
تی
-0.06
POSITIVE LOGITS
belir
0.06
inform
0.06
safely
0.06
CLL
0.06
barred
0.06
/check
0.06
fortune
0.06
_BAR
0.05
Judicial
0.05
.Sign
0.05
Activations Density 0.314%