INDEX
    Explanations

    phrases indicating comparison or contrast

    phrases that indicate contrast or opposition

    New Auto-Interp
    Negative Logits
    obyl
    -0.77
    liam
    -0.76
    breakers
    -0.76
    nce
    -0.74
    ells
    -0.73
    inho
    -0.71
    estern
    -0.70
    negie
    -0.67
    assies
    -0.67
    aja
    -0.66
    POSITIVE LOGITS
    itably
    0.83
     opposed
    0.79
     necessarily
    0.71
     preferring
    0.71
     allowing
    0.67
    isons
    0.67
     materially
    0.67
     favoring
    0.67
     chronological
    0.66
     letting
    0.64
    Act Density 0.013%

    No Known Activations