INDEX
Explanations
mentions of the word "Frost"
references to frost and freezing conditions
New Auto-Interp
Negative Logits
perature
-0.69
gae
-0.68
alogue
-0.67
terior
-0.67
selves
-0.67
Lumpur
-0.67
>>>>>>>>
-0.66
Indones
-0.66
pert
-0.66
ucha
-0.65
POSITIVE LOGITS
bite
1.28
enstein
0.94
burn
0.88
ships
0.86
flake
0.83
ed
0.80
idious
0.79
fur
0.77
fall
0.77
fell
0.77
Activations Density 0.034%