INDEX
Explanations
phrases related to hygiene practices, specifically handwashing
New Auto-Interp
Negative Logits
oren
-0.14
uvol
-0.14
afone
-0.14
peater
-0.14
Ú¯ÙĦ
-0.14
Nack
-0.13
nét
-0.13
GRID
-0.13
":-
-0.13
lowest
-0.13
POSITIVE LOGITS
soap
0.28
washing
0.26
hands
0.25
wash
0.25
hand
0.25
Soap
0.24
Wash
0.23
Soap
0.23
soap
0.23
washed
0.22
Activations Density 0.024%