INDEX
Explanations
references to actions related to a shower
mentions of showers
New Auto-Interp
Negative Logits
rian
-0.75
folios
-0.67
Dominion
-0.66
ravel
-0.65
cius
-0.64
sem
-0.64
sup
-0.63
interest
-0.62
Libertarian
-0.61
verage
-0.60
POSITIVE LOGITS
showers
1.00
ysis
0.92
curtain
0.90
ing
0.90
robe
0.87
shower
0.86
ennes
0.81
ashore
0.79
issance
0.78
atur
0.77
Activations Density 0.008%