INDEX
Explanations
words related to fire and flames
New Auto-Interp
Negative Logits
alian
-0.82
oral
-0.73
laus
-0.72
istan
-0.71
ortment
-0.70
llah
-0.68
ournal
-0.68
los
-0.67
ographics
-0.67
sum
-0.67
POSITIVE LOGITS
flame
1.12
flames
1.05
hotter
0.99
flies
0.97
candles
0.97
candle
0.95
extinguished
0.94
flame
0.93
blaze
0.93
ashes
0.92
Activations Density 0.048%