INDEX
    Explanations

    mentions of economic or political sanctions

    New Auto-Interp
    Negative Logits
    WORK
    -0.96
    ITNESS
    -0.83
    ership
    -0.82
    ergy
    -0.79
    edes
    -0.79
    rx
    -0.79
    OTE
    -0.76
    rote
    -0.72
    swick
    -0.72
    ways
    -0.71
    POSITIVE LOGITS
     sanctions
    1.46
     sanction
    1.16
     embargo
    1.07
     imposed
    0.93
     levied
    0.91
     deterrent
    0.90
     deterrence
    0.89
     coerc
    0.89
     relief
    0.86
    ategory
    0.86
    Act Density 0.012%

    No Known Activations