INDEX
Explanations
content related to national security issues and implications
New Auto-Interp
Negative Logits
quier
-0.16
esso
-0.14
zie
-0.14
letic
-0.13
Nom
-0.13
phia
-0.13
oped
-0.13
rl
-0.13
Savage
-0.13
windshield
-0.13
POSITIVE LOGITS
security
0.68
national
0.65
security
0.54
Security
0.52
-security
0.52
Security
0.50
national
0.50
_security
0.47
.security
0.46
SECURITY
0.45
Activations Density 0.184%