INDEX
Explanations
words related to outdoor activities and sports
New Auto-Interp
Negative Logits
ies
-0.24
eus
-0.21
IES
-0.20
ishly
-0.20
y
-0.19
iest
-0.19
hurst
-0.18
edException
-0.18
eenth
-0.18
boro
-0.17
POSITIVE LOGITS
ting
0.53
ging
0.50
ming
0.46
ged
0.41
bing
0.40
ning
0.39
gers
0.38
GING
0.36
ding
0.36
MING
0.35
Activations Density 0.086%