INDEX
Explanations
mentions of weapons
references to weapons
New Auto-Interp
Negative Logits
gres
-0.84
apest
-0.81
Strawberry
-0.77
lus
-0.75
coli
-0.71
noon
-0.71
dit
-0.69
alg
-0.69
âĸ¬
-0.67
Adams
-0.66
POSITIVE LOGITS
powder
0.96
caches
0.89
weapons
0.89
arsenal
0.88
manship
0.86
shipments
0.82
usable
0.81
smugglers
0.81
systems
0.81
manufacturer
0.80
Activations Density 0.036%