INDEX
Explanations
references to snow or winter-related terms
references to "Snow" in various contexts
New Auto-Interp
Negative Logits
igious
-0.71
ect
-0.69
Gujar
-0.65
����
-0.64
hetical
-0.63
izabeth
-0.62
ista
-0.61
ented
-0.61
arians
-0.61
arian
-0.60
POSITIVE LOGITS
flake
1.31
Snow
1.15
Snow
0.89
Leopard
0.86
storms
0.85
mathemat
0.80
dale
0.80
bats
0.79
Crash
0.79
tro
0.79
Activations Density 0.004%