INDEX
    Explanations

    mentions of improvement or comparison for different scenarios, with a preference towards the concept of "better"

    the word "better" and its variations

    New Auto-Interp
    Negative Logits
     Pione
    -0.73
    ategory
    -0.66
    cha
    -0.65
    sup
    -0.64
    ette
    -0.63
    cano
    -0.62
    amine
    -0.61
    kaya
    -0.61
    umo
    -0.60
    ums
    -0.59
    POSITIVE LOGITS
     than
    1.31
    than
    1.09
     suited
    1.08
     behaved
    1.04
     Than
    1.03
    ment
    0.95
     acquainted
    0.94
     equipped
    0.86
    ments
    0.81
     luck
    0.80
    Act Density 0.067%

    No Known Activations