INDEX
Explanations
terms related to winter clothing and warmth
New Auto-Interp
Negative Logits
myſelf
-0.74
himſelf
-0.72
Efq
-0.71
themſelves
-0.69
itſelf
-0.67
BibitemShut
-0.64
Reſ
-0.64
ſtill
-0.63
Eſ
-0.63
Theſe
-0.62
POSITIVE LOGITS
winter
0.87
winters
0.70
Winter
0.70
winter
0.70
WINTER
0.69
shivering
0.67
Winter
0.66
warm
0.66
☃
0.65
hiver
0.65
Activations Density 0.092%