INDEX
    Explanations

    phrases indicating a comparison between two options

    comparative phrases that emphasize distinctions or alternatives

    New Auto-Interp
    Negative Logits
    awar
    -0.82
    ug
    -0.77
    itiz
    -0.76
    enium
    -0.75
    ns
    -0.72
    hr
    -0.72
    eg
    -0.72
    iatus
    -0.72
    wm
    -0.68
    tan
    -0.68
    POSITIVE LOGITS
     preferably
    0.82
     assuming
    0.70
     apologies
    0.70
     alternatively
    0.70
     optionally
    0.69
     evidenced
    0.68
    Ͻ
    0.66
     perhaps
    0.65
     whichever
    0.65
     allowances
    0.65
    Act Density 0.286%

    No Known Activations