INDEX
Negative Logits
Hor
-0.08
_delta
-0.07
elta
-0.07
�
-0.07
انه
-0.07
joint
-0.07
sas
-0.07
ۀ
-0.07
Inherited
-0.07
iston
-0.07
POSITIVE LOGITS
igjen
0.08
Fidel
0.08
again
0.07
步骤
0.07
ន
0.07
Vul
0.07
igen
0.07
Jian
0.07
deng
0.07
Vij
0.07
Activations Density 0.011%