INDEX
    Explanations

    nationalities

    cultural figures and political activists from specific countries.

    New Auto-Interp
    Negative Logits
    !]
    -0.06
     recurring
    -0.06
    اعة
    -0.06
    ющихся
    -0.06
     polic
    -0.06
    entities
    -0.06
    .embedding
    -0.06
     keywords
    -0.06
    .scheduler
    -0.06
    _CONSOLE
    -0.05
    POSITIVE LOGITS
    цу
    0.08
     bron
    0.07
    vn
    0.07
     Melissa
    0.07
    _mgmt
    0.06
     Commun
    0.06
    ngle
    0.06
     urgency
    0.06
     follow
    0.06
    。而
    0.06
    Act Density 0.016%

    No Known Activations