INDEX
    Explanations

    references to location or position in relation to the word "front."

    New Auto-Interp
    Negative Logits
    dna
    -0.16
    cles
    -0.16
    bach
    -0.14
    its
    -0.14
     æ¼
    -0.14
    ÑĦÑĦ
    -0.14
    ewidth
    -0.14
    ampion
    -0.14
    htag
    -0.14
    hid
    -0.14
    POSITIVE LOGITS
    iers
    0.27
    isp
    0.25
    /back
    0.21
    tier
    0.20
    -row
    0.20
    -runner
    0.20
    ality
    0.19
    ally
    0.19
    eer
    0.18
    matter
    0.17
    Act Density 0.036%

    No Known Activations