INDEX
Explanations
references to military-related terms, such as specific country names, military ranks, and weapons
references to grenades and related military terms
New Auto-Interp
Negative Logits
stakes
-0.79
orporated
-0.71
terson
-0.69
cript
-0.68
Penn
-0.67
tale
-0.67
gres
-0.67
tle
-0.64
elf
-0.64
enance
-0.63
POSITIVE LOGITS
adier
0.98
Launcher
0.76
Grenade
0.75
grenades
0.74
adian
0.72
grenade
0.72
vernment
0.69
launchers
0.66
oses
0.65
inches
0.64
Activations Density 0.081%