INDEX
Negative Logits
obedient
-0.07
racket
-0.07
countries
-0.06
_void
-0.06
(report
-0.06
.side
-0.06
Roku
-0.06
sweetness
-0.06
(enemy
-0.06
ocht
-0.06
POSITIVE LOGITS
LOPT
0.06
چشم
0.06
iginal
0.06
prim
0.06
ıyoruz
0.06
рев
0.06
REG
0.06
iná
0.06
justices
0.06
ilendir
0.06
Activations Density 0.001%