INDEX
Explanations
terms related to high temperature or intense activity
references to the word "hot" in various contexts
New Auto-Interp
Negative Logits
guiActiveUn
-0.80
ufact
-0.80
ĸļ
-0.78
uther
-0.75
yss
-0.72
ajor
-0.72
conclud
-0.70
uth
-0.70
©¶æ
-0.67
ngth
-0.66
POSITIVE LOGITS
dogs
0.99
spots
0.96
bed
0.95
headed
0.94
dog
0.93
ness
0.91
shots
0.89
water
0.88
shot
0.87
ened
0.86
Activations Density 0.025%