INDEX
    Explanations

    references to military divisions and their activities

    New Auto-Interp
    Negative Logits
    SM
    -0.15
     itself
    -0.15
    ades
    -0.15
     Otherwise
    -0.14
     Uncomment
    -0.14
    unders
    -0.14
     otherwise
    -0.14
    anou
    -0.14
    aka
    -0.14
     himself
    -0.13
    POSITIVE LOGITS
     besides
    0.22
     similarly
    0.18
     niż
    0.17
    -than
    0.17
     than
    0.17
    world
    0.16
     equally
    0.16
    than
    0.16
    ¦æĥħ
    0.15
    _than
    0.15
    Act Density 0.292%

    No Known Activations