INDEX
Negative Logits
exemplified
0.46
salient
0.45
thinking
0.42
dissatisfaction
0.42
intervals
0.42
Clements
0.42
inska
0.41
Joaquin
0.41
childhood
0.41
Duckworth
0.40
POSITIVE LOGITS
0.44
www
0.42
интегра
0.42
(+
0.41
парази
0.41
формы
0.39
καθη
0.39
كلها
0.39
کۆ
0.39
имеют
0.39
Activations Density 0.000%