INDEX
    Explanations

    terms related to foreign policy and international relations

    New Auto-Interp
    Negative Logits
    alach
    -0.16
    lam
    -0.15
    ás
    -0.14
    leich
    -0.14
    itial
    -0.14
    ahir
    -0.14
    lah
    -0.14
    est
    -0.14
    asher
    -0.14
    lor
    -0.13
    POSITIVE LOGITS
    ibern
    0.17
     McGu
    0.15
    547
    0.15
    анÑĤаж
    0.14
    /MPL
    0.14
    845
    0.14
    jec
    0.14
    rzy
    0.14
    hiba
    0.14
    ife
    0.13
    Act Density 0.007%

    No Known Activations