INDEX
Negative Logits
expressions
-0.86
金牛
-0.76
baza
-0.75
塾
-0.74
tones
-0.74
̣ng
-0.73
телен
-0.72
mögliche
-0.71
ਰ
-0.71
摞
-0.70
POSITIVE LOGITS
Hehe
0.77
nio
0.71
ours
0.69
tudom
0.69
svr
0.68
きち
0.68
nitrite
0.67
{_0.67
kuitenkin
0.67
Regime
0.67
Activations Density 0.023%