INDEX
Negative Logits
ak
1.65
ો
1.64
s
1.56
ON
1.51
V
1.44
angry
1.41
runs
1.41
sal
1.39
ay
1.38
ang
1.38
POSITIVE LOGITS
Rho
1.24
Ты
1.23
Такие
1.21
satisfactorily
1.20
Trời
1.20
anxiously
1.19
िश्वत
1.19
Pengh
1.17
Equest
1.16
Southerners
1.16
Activations Density 0.004%