INDEX
Explanations
due to followed by a reason
New Auto-Interp
Negative Logits
ovati
0.71
tools
0.69
ائو
0.69
kab
0.68
palaces
0.68
шками
0.68
sov
0.67
troughs
0.67
thm
0.67
tools
0.67
POSITIVE LOGITS
diligence
1.18
наличи
0.90
odenum
0.86
Dil
0.85
adanya
0.85
probablemente
0.73
Reasons
0.71
razones
0.70
પાણી
0.70
دل
0.70
Activations Density 0.042%