INDEX
Explanations
phrases related to ongoing wars or conflicts
references to various "wars" on different issues or topics
New Auto-Interp
Negative Logits
NBA
-0.72
BIP
-0.71
ittal
-0.71
Patch
-0.70
rete
-0.68
HF
-0.68
yip
-0.67
meet
-0.67
UF
-0.67
Trigger
-0.65
POSITIVE LOGITS
drugs
1.23
Drugs
1.10
terror
1.07
whistleblowers
0.97
terrorism
0.95
terror
0.93
poverty
0.89
Terror
0.87
Poverty
0.82
Terrorism
0.79
Activations Density 0.086%