INDEX
    Explanations

    mentions of Afghanistan and related terms

    New Auto-Interp
    Negative Logits
    ignum
    -0.07
    egg
    -0.07
     Fortune
    -0.07
    eum
    -0.07
    inct
    -0.07
    nox
    -0.07
    eson
    -0.07
    apan
    -0.07
    portlet
    -0.07
    اسب
    -0.06
    POSITIVE LOGITS
    (AF
    0.07
    447
    0.07
    rika
    0.06
    ذا
    0.06
    ektiv
    0.06
    elon
    0.06
    _INET
    0.06
    elik
    0.06
     Utt
    0.06
    -*-
    0.06
    Act Density 0.009%

    No Known Activations