INDEX
Explanations
phrases related to explosions or explosive devices
New Auto-Interp
Negative Logits
oldem
-0.19
ernals
-0.16
edor
-0.15
lify
-0.15
INF
-0.15
à¥Ģस
-0.14
ools
-0.14
Interpolator
-0.14
vitae
-0.14
yte
-0.14
POSITIVE LOGITS
arded
0.27
shell
0.27
arding
0.27
astic
0.20
ard
0.20
(shell
0.18
ards
0.18
astically
0.18
adil
0.17
bomb
0.17
Activations Density 0.016%