INDEX
Explanations
references to explosives
references to explosives and related terms
New Auto-Interp
Negative Logits
aird
-0.89
Seah
-0.76
)=(
-0.73
naire
-0.72
esan
-0.71
Goose
-0.69
ups
-0.69
heit
-0.69
gling
-0.67
SEA
-0.66
POSITIVE LOGITS
detonated
1.10
deton
0.98
barric
0.92
explosives
0.91
deterrent
0.89
disposal
0.86
detectors
0.83
detector
0.81
ilitary
0.79
kit
0.78
Activations Density 0.029%