INDEX
Explanations
mentions of burning or being burnt
words related to burning or destruction by fire
New Auto-Interp
Negative Logits
ournal
-0.72
onsense
-0.69
plom
-0.68
xus
-0.66
alian
-0.66
odore
-0.65
egal
-0.65
stood
-0.64
DonaldTrump
-0.64
IJ
-0.64
POSITIVE LOGITS
ished
1.06
ishing
1.01
burning
0.96
hotter
0.95
burn
0.94
ishes
0.94
burns
0.87
houses
0.85
burned
0.85
burning
0.83
Activations Density 0.030%