INDEX
Explanations
references to swimming activities
references to swimming and related activities
New Auto-Interp
Negative Logits
crushing
-0.71
oppers
-0.70
relevant
-0.70
paving
-0.65
________________________
-0.62
oned
-0.61
aples
-0.60
setback
-0.60
################
-0.59
crush
-0.59
POSITIVE LOGITS
Swim
1.42
swim
1.28
suit
1.00
suits
1.00
boats
0.95
boat
0.94
tub
0.89
swimming
0.83
fins
0.82
halla
0.82
Activations Density 0.004%