INDEX
Explanations
references to specific brand names related to firearms
New Auto-Interp
Negative Logits
catentry
-0.71
PLIED
-0.66
ionage
-0.65
ascript
-0.65
glers
-0.64
ruary
-0.62
MER
-0.60
Revenge
-0.59
士
-0.58
trope
-0.58
POSITIVE LOGITS
nikov
0.97
chuk
0.94
akra
0.86
ubi
0.85
din
0.83
gaard
0.83
nick
0.83
sky
0.82
odan
0.81
inski
0.79
Activations Density 0.020%