INDEX
Negative Logits
مكاف
-0.08
tjejer
-0.08
Office
-0.08
fprintf
-0.07
Publish
-0.07
MOTOR
-0.07
San
-0.07
immigr
-0.07
immun
-0.07
anni
-0.07
POSITIVE LOGITS
bih
0.07
幖
0.07
are
0.07
depth
0.06
inue
0.06
divergence
0.06
(width
0.06
!--
0.06
Exclusive
0.06
:param
0.06
Activations Density 0.002%