INDEX
Explanations
phrases or words related to high temperature or intense activity
instances of the word "hot" used in various contexts
New Auto-Interp
Negative Logits
ufact
-0.91
ajor
-0.79
uther
-0.78
eca
-0.78
INAL
-0.73
atively
-0.72
ARDIS
-0.71
confir
-0.70
yss
-0.69
vous
-0.68
POSITIVE LOGITS
spots
0.91
Chili
0.89
ness
0.86
stove
0.86
headed
0.85
hotter
0.84
hot
0.83
dogs
0.83
potato
0.83
sauce
0.82
Activations Density 0.021%