INDEX
Explanations
mentions of cleaning or removing something
New Auto-Interp
Negative Logits
yip
-0.99
abul
-0.65
XT
-0.65
akening
-0.64
trl
-0.64
ommod
-0.64
PsyNetMessage
-0.63
arsity
-0.63
Mand
-0.63
MP
-0.62
POSITIVE LOGITS
liness
1.05
cleaned
0.92
ashore
0.89
cleaner
0.88
linen
0.85
clean
0.85
towels
0.80
cleaners
0.80
toilets
0.79
stains
0.79
Activations Density 2.821%