INDEX
Negative Logits
către
2.19
𝐀
2.07
"/
1.90
AL
1.88
6
1.85
"./
1.83
"",
1.80
9
1.80
्स
1.74
cambiare
1.73
POSITIVE LOGITS
ש
2.91
一个
2.39
народ
2.16
ли
1.88
в
1.85
ﻲ
1.78
couldn
1.78
檛
1.78
in
1.76
ח
1.75
Activations Density 0.022%
către
𝐀
"/
AL
6
"./
"",
9
्स
cambiare
ש
一个
народ
ли
в
ﻲ
couldn
檛
in
ח