INDEX
Explanations
references to guns and gun control
New Auto-Interp
Negative Logits
Ede
-0.44
ctiveness
-0.41
Ej
-0.41
workflow
-0.40
ered
-0.40
fond
-0.39
ified
-0.39
structure
-0.39
erda
-0.39
forskj
-0.39
POSITIVE LOGITS
gun
0.92
firearm
0.81
guns
0.81
firearms
0.77
guns
0.77
Guns
0.76
GUN
0.75
Gun
0.75
gun
0.74
Firearms
0.70
Activations Density 0.008%