INDEX
Explanations
the word "explosion"
mentions of explosions
New Auto-Interp
Negative Logits
hist
-0.76
secut
-0.75
stra
-0.74
tern
-0.72
nda
-0.70
ĻĤ
-0.69
Ĥ
-0.68
stood
-0.66
guyen
-0.65
posted
-0.64
POSITIVE LOGITS
explosion
1.20
explosions
0.89
Explosion
0.85
blasts
0.85
boom
0.84
bursting
0.83
fireball
0.83
explodes
0.82
burst
0.81
blast
0.80
Activations Density 0.011%