INDEX
Explanations
statements related to critique or evaluation of performance
New Auto-Interp
Negative Logits
culturali
-0.34
ectoria
-0.32
cessed
-0.32
féminine
-0.31
jillo
-0.31
eléctrico
-0.30
femeninos
-0.29
eléctricas
-0.29
eléctricos
-0.29
visitor
-0.28
POSITIVE LOGITS
@"/
0.57
surla
0.56
للاسماء
0.54
Stoke
0.54
躇
0.52
Sunderland
0.51
GetAxis
0.50
ppig
0.50
manager
0.50
underland
0.50
Activations Density 0.187%