INDEX
Explanations
phrases related to baths or bathroom activities
references to bathtubs and bathing
New Auto-Interp
Negative Logits
vernment
-0.68
srf
-0.66
bably
-0.65
deported
-0.63
vague
-0.63
VEN
-0.62
backer
-0.61
pez
-0.61
unsustainable
-0.61
KNOWN
-0.61
POSITIVE LOGITS
tub
1.70
rooms
1.29
room
1.29
urst
1.26
robe
1.18
baths
0.95
salts
0.91
maid
0.91
bath
0.89
Spa
0.89
Activations Density 0.015%