INDEX
    Explanations

    references to emphasis or focus in various contexts

    New Auto-Interp
    Negative Logits
    enberg
    -0.18
    -thumb
    -0.16
    ango
    -0.14
    fts
    -0.14
    agg
    -0.14
    bag
    -0.14
    Fcn
    -0.14
    itto
    -0.14
    puter
    -0.14
    -Sah
    -0.14
    POSITIVE LOGITS
    pell
    0.17
    och
    0.16
    lassen
    0.15
     fishes
    0.15
    597
    0.15
    mpp
    0.14
     osc
    0.14
    forme
    0.14
    oen
    0.14
    allax
    0.14
    Act Density 0.005%

    No Known Activations