INDEX
Explanations
words related to swimming
references to swimming and swim-related activities
New Auto-Interp
Negative Logits
naires
-0.80
oned
-0.73
CRIP
-0.68
relevant
-0.65
________________________
-0.64
haar
-0.63
misplaced
-0.61
tense
-0.60
ICES
-0.60
ASE
-0.60
POSITIVE LOGITS
suit
0.97
swim
0.97
Swim
0.96
suits
0.93
swimming
0.91
boat
0.90
tub
0.87
halla
0.85
estone
0.82
upstream
0.82
Activations Density 0.018%