INDEX
Explanations
statements made by individuals
New Auto-Interp
Negative Logits
aÅŁ
-0.17
onest
-0.16
imon
-0.15
uma
-0.15
oce
-0.15
byn
-0.14
addCriterion
-0.14
pk
-0.14
gd
-0.14
yre
-0.13
POSITIVE LOGITS
">ÃĹ</
0.14
Bread
0.14
premature
0.14
Sahara
0.13
segmented
0.13
å¥Ī
0.13
ίδ
0.13
dép
0.13
73
0.13
é¤
0.13
Activations Density 0.040%