INDEX
    Explanations

    references to political entities and conflicts

    proper nouns related to countries, political entities, and organizations

    New Auto-Interp
    Negative Logits
     partName
    -0.65
     âĢº
    -0.64
    Sym
    -0.64
    ãĥ£
    -0.61
    \":
    -0.59
     <<
    -0.58
    pmwiki
    -0.54
    rencies
    -0.53
    aj
    -0.52
     Morty
    -0.51
    POSITIVE LOGITS
     embassy
    0.64
    artney
    0.60
    ledged
    0.59
    erent
    0.57
    usalem
    0.57
     himself
    0.57
    agine
    0.56
     abroad
    0.56
    ensis
    0.56
    inges
    0.54
    Act Density 0.733%

    No Known Activations