INDEX
    Explanations

    references to societal themes and dynamics

    New Auto-Interp
    Negative Logits
    609
    -0.15
    oup
    -0.15
    .Internal
    -0.14
    atch
    -0.14
    100
    -0.14
    iton
    -0.14
    votes
    -0.14
     Anch
    -0.14
    jure
    -0.14
    çģŃ
    -0.14
    POSITIVE LOGITS
     similarly
    0.19
     similar
    0.17
    alking
    0.17
    åIJĮ
    0.15
    acci
    0.15
    Similarly
    0.15
     same
    0.14
    utar
    0.14
     ware
    0.14
    similar
    0.14
    Act Density 0.211%

    No Known Activations