INDEX
Explanations
terms related to firearms and gun control
references to gun-related topics and issues
New Auto-Interp
Negative Logits
Borders
-0.75
Richards
-0.70
pillars
-0.69
intervals
-0.68
Shards
-0.66
amaz
-0.65
Jed
-0.65
pleasantly
-0.63
Cind
-0.63
observations
-0.60
POSITIVE LOGITS
carry
1.31
related
1.27
safety
1.23
themed
1.22
indust
1.18
loving
1.17
free
1.16
training
1.13
based
1.10
policy
1.10
Activations Density 0.081%