INDEX
Explanations
references to laundry and cleaning items
New Auto-Interp
Negative Logits
åĪĢ
-0.16
ãĥ³ãĥģ
-0.15
ÑĸÑĩ
-0.15
OTS
-0.15
.Serve
-0.15
solder
-0.15
sonian
-0.14
곤
-0.14
جات
-0.14
Cz
-0.14
POSITIVE LOGITS
tumble
0.32
detergent
0.30
laundry
0.29
deter
0.28
washing
0.27
washer
0.26
Laundry
0.26
tum
0.26
clothes
0.26
dryer
0.25
Activations Density 0.066%