INDEX
    Explanations

    historical references to colonialism and territorial changes

    New Auto-Interp
    Negative Logits
     str
    -0.06
    aille
    -0.06
    ley
    -0.06
     Ved
    -0.06
    ottage
    -0.06
    exampleInput
    -0.06
    byter
    -0.06
    ocha
    -0.05
     galleries
    -0.05
     bod
    -0.05
    POSITIVE LOGITS
     Hurt
    0.08
    IENT
    0.07
    961
    0.07
    .AUTO
    0.07
    ãģıãģł
    0.07
     nouvel
    0.06
     Slut
    0.06
    íķĺìĭł
    0.06
     Hub
    0.06
    ÏĥοÏħ
    0.06
    Act Density 0.082%

    No Known Activations