INDEX
Explanations
references to explosive events or related situations
references to explosions or blasts
New Auto-Interp
Negative Logits
ccording
-0.89
pires
-0.84
gemony
-0.81
prison
-0.77
compr
-0.76
Serial
-0.75
phis
-0.75
Decre
-0.75
guyen
-0.75
Development
-0.70
POSITIVE LOGITS
blast
1.35
blasts
1.13
furnace
0.92
blast
0.89
furn
0.85
waves
0.78
ocy
0.78
bursting
0.77
showers
0.77
astically
0.77
Activations Density 0.010%