INDEX
Negative Logits
selves
0.96
unser
0.95
equator
0.95
0.93
một
0.90
Greater
0.90
đôi
0.88
regal
0.87
der
0.86
mselves
0.86
POSITIVE LOGITS
ان
2.11
на
1.63
an
1.59
ar
1.43
zelfde
1.42
ಾ
1.41
ية
1.27
ad
1.22
ной
1.15
quele
1.15
Activations Density 0.066%