INDEX
Explanations
descriptions of cleanliness and tidiness in domestic settings
New Auto-Interp
Negative Logits
uxt
-0.13
SV
-0.13
ActiveForm
-0.13
æ½ľ
-0.13
uyu
-0.13
лод
-0.13
roofing
-0.13
uniqueness
-0.13
cramped
-0.12
thin
-0.12
POSITIVE LOGITS
clean
0.39
Clean
0.35
-clean
0.35
clean
0.34
Clean
0.33
cleaned
0.33
CLEAN
0.32
cleanliness
0.31
(clean
0.28
cleaning
0.28
Activations Density 0.189%