INDEX
    Explanations

    terms related to foreign policy

    mentions of foreign policy

    New Auto-Interp
    Negative Logits
    amaz
    -0.81
    FORMATION
    -0.81
    upon
    -0.79
    oven
    -0.79
    igans
    -0.79
    berry
    -0.74
    DAY
    -0.73
    amination
    -0.72
    username
    -0.72
    unn
    -0.71
    POSITIVE LOGITS
     superpower
    1.01
     advisor
    1.00
     adviser
    1.00
     advisors
    1.00
     advisers
    0.97
     theorist
    0.92
     crises
    0.91
     objectives
    0.90
     diplomacy
    0.88
     correctness
    0.85
    Act Density 0.058%

    No Known Activations