INDEX
    Explanations

    references to specific locations or events within a text

    instances of the word "Back" in various contexts

    New Auto-Interp
    Negative Logits
    ©¶æ¥µ
    -0.64
    ccess
    -0.63
    utical
    -0.63
    ihad
    -0.62
     Osc
    -0.62
     tyr
    -0.60
    izo
    -0.59
    isen
    -0.59
    ellen
    -0.59
     constitu
    -0.59
    POSITIVE LOGITS
    lash
    1.20
    stab
    1.18
    door
    1.11
    tracking
    1.08
    GROUND
    1.06
    yard
    1.05
    wards
    1.04
    stage
    1.04
    dated
    1.02
    pack
    1.02
    Act Density 0.027%

    No Known Activations