INDEX
    Explanations

    words related to movement or transportation, especially involving return or reversal of direction

    instances of the word "back"

    New Auto-Interp
    Negative Logits
    entric
    -0.69
    inational
    -0.66
    ities
    -0.64
     viz
    -0.62
     delegation
    -0.62
    enaries
    -0.61
    eria
    -0.61
    izo
    -0.61
    ision
    -0.61
    ifix
    -0.59
    POSITIVE LOGITS
    dated
    1.20
    packs
    1.10
    fires
    1.07
    packing
    1.04
    fired
    1.01
    haul
    1.01
    wards
    1.00
    GROUND
    0.99
    doors
    0.97
    rower
    0.96
    Act Density 0.054%

    No Known Activations