INDEX
Explanations
themes of competition and achievement
New Auto-Interp
Negative Logits
ıza
-0.13
utdown
-0.12
нÑĸÑı
-0.11
ÑıÑĤи
-0.11
inci
-0.11
λικά
-0.11
еÑĦ
-0.11
irement
-0.11
άβ
-0.10
udad
-0.10
POSITIVE LOGITS
one
1.45
one
0.95
ONE
0.89
_one
0.89
-one
0.85
One
0.85
One
0.83
.one
0.82
uno
0.80
одного
0.75
Activations Density 2.746%