INDEX
    Explanations

    phrases indicating a comparison between different aspects or options

    phrases that include alternatives or comparisons

    New Auto-Interp
    Negative Logits
    successfully
    -0.78
    igned
    -0.68
    ially
    -0.64
    ETS
    -0.64
    umerous
    -0.63
    requently
    -0.62
    Duration
    -0.62
    egu
    -0.60
    ottesville
    -0.59
    erest
    -0.59
    POSITIVE LOGITS
    whatever
    1.74
     whatever
    1.73
     something
    1.42
    acle
    1.36
     anything
    1.28
     whoever
    1.27
     somet
    1.23
     whichever
    1.20
    chard
    1.20
     wherever
    1.13
    Act Density 0.156%

    No Known Activations