INDEX
Explanations
references to winning or victorious outcomes
New Auto-Interp
Negative Logits
ikit
-0.16
709
-0.15
iki
-0.15
bounce
-0.15
Welch
-0.15
eka
-0.14
éné
-0.14
bah
-0.14
asi
-0.14
utes
-0.13
POSITIVE LOGITS
isia
0.16
itore
0.16
ÑĢай
0.16
ustanov
0.15
olor
0.14
osu
0.14
Nej
0.14
olla
0.14
à¥ģà¤
0.14
arness
0.14
Activations Density 0.079%