INDEX
Explanations
references to the winter season or events related to it
references to winter-themed events or locations
New Auto-Interp
Negative Logits
inates
-0.88
riad
-0.84
ulhu
-0.79
onent
-0.75
-|
-0.74
atem
-0.72
tical
-0.71
Cu
-0.70
ean
-0.70
%]
-0.70
POSITIVE LOGITS
Winter
1.12
winter
1.02
Winter
1.01
rink
0.99
thur
0.97
Wonderland
0.93
fell
0.86
Wolves
0.84
Jackets
0.79
Olympics
0.76
Activations Density 0.004%