INDEX
Explanations
instances of the term "foreign policy."
references to foreign policy
New Auto-Interp
Negative Logits
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
-0.85
amaz
-0.82
LOAD
-0.80
DAY
-0.80
FORMATION
-0.77
upon
-0.76
oven
-0.76
RGB
-0.74
AUT
-0.72
sample
-0.72
POSITIVE LOGITS
advisor
0.95
adviser
0.93
advisors
0.92
diplomacy
0.89
superpower
0.88
escalation
0.87
stance
0.86
advisers
0.84
olitics
0.83
theorist
0.83
Activations Density 0.036%