INDEX
    Explanations

    words related to a direction or place, specifically referring to the "back."

    instances of the word "back" in various contexts

    New Auto-Interp
    Negative Logits
     fuss
    -0.65
     Aires
    -0.62
    tein
    -0.61
    mble
    -0.61
     BUS
    -0.60
     faint
    -0.59
     Osc
    -0.59
    urious
    -0.56
    IFIED
    -0.56
     LINE
    -0.54
    POSITIVE LOGITS
    door
    1.17
    back
    1.11
    wards
    1.06
    lash
    1.06
    dated
    1.01
    gam
    0.94
    tracking
    0.92
    )=(
    0.91
    doors
    0.91
    hoe
    0.90
    Act Density 0.015%

    No Known Activations