INDEX
    Explanations

    phrases related to positioning or location

    phrases indicating spatial positioning or location

    New Auto-Interp
    Negative Logits
    rug
    -0.71
    ories
    -0.70
    iple
    -0.69
    vari
    -0.69
    anship
    -0.68
    ivities
    -0.65
    ldom
    -0.65
    ague
    -0.63
    marine
    -0.63
    partial
    -0.63
    POSITIVE LOGITS
     cue
    1.11
     doorstep
    0.80
     front
    0.72
     heels
    0.70
     center
    0.64
     button
    0.63
    !:
    0.62
    oho
    0.61
    !?"
    0.61
     Bolt
    0.61
    Act Density 0.122%

    No Known Activations