INDEX
Explanations
mentions of bathing-related activities and settings
New Auto-Interp
Negative Logits
談社
-0.72
fram
-0.65
scaff
-0.64
Dioxide
-0.64
Kepler
-0.64
phazard
-0.63
Westwood
-0.62
airs
-0.62
qvarna
-0.61
Dessert
-0.60
POSITIVE LOGITS
bath
1.11
baths
1.11
bathing
1.06
bathe
1.05
swimmers
1.04
swim
0.97
Swim
0.94
Swim
0.92
swimming
0.90
Bathing
0.90
Activations Density 0.182%