INDEX
Explanations
discussions related to geopolitical dynamics and international relations
New Auto-Interp
Negative Logits
cola
-0.15
nationwide
-0.14
idor
-0.14
Nationwide
-0.14
(strtolower
-0.14
assembly
-0.14
vsp
-0.13
šť
-0.13
prive
-0.13
egal
-0.13
POSITIVE LOGITS
security
0.23
-security
0.22
Cold
0.21
Cold
0.21
Security
0.21
dipl
0.20
/security
0.20
Security
0.20
geo
0.20
hawks
0.20
Activations Density 0.465%