INDEX
Explanations
references to cold beverages
New Auto-Interp
Negative Logits
cheminée
-0.66
propOrder
-0.66
pushFollow
-0.54
ftagPool
-0.48
tagext
-0.48
Савезне
-0.47
fumée
-0.46
HomeAsUpEnabled
-0.46
UseVisualStyle
-0.45
fycat
-0.44
POSITIVE LOGITS
cold
0.66
cool
0.60
COLD
0.56
coolness
0.56
cold
0.52
Cool
0.50
outdoor
0.50
涼
0.50
Cold
0.49
coldness
0.49
Activations Density 0.008%