INDEX
    Explanations

    mentions of people's names

    references to specific individuals or entities, particularly in a political context

    New Auto-Interp
    Negative Logits
     constitu
    -0.67
    geoning
    -0.60
     sts
    -0.59
     Emerald
    -0.56
    upload
    -0.55
    ioch
    -0.55
    manship
    -0.55
    ldom
    -0.55
     tack
    -0.55
    itational
    -0.55
    POSITIVE LOGITS
    ãĥ¯ãĥ³
    0.65
    uese
    0.65
    AAF
    0.64
    ij士
    0.63
    OIL
    0.63
    oji
    0.62
    ippi
    0.61
    WithNo
    0.61
    qua
    0.61
    cium
    0.60
    Act Density 0.176%

    No Known Activations