INDEX
    Explanations

    phrases indicating a comparison or contrast

    phrases that emphasize the word "so" as a modifier to convey varying degrees of emphasis or comparison

    New Auto-Interp
    Negative Logits
    Aren
    -0.64
    neum
    -0.63
    MAP
    -0.62
     Ethics
    -0.60
     (>
    -0.59
    osterone
    -0.58
     Encyclopedia
    -0.57
    witz
    -0.57
    IED
    -0.56
    ertodd
    -0.55
    POSITIVE LOGITS
     much
    1.11
     lucky
    0.97
     forgiving
    0.94
     subtly
    0.91
     easy
    0.90
     fortunate
    0.88
     simple
    0.88
    bered
    0.87
     easily
    0.86
     bad
    0.85
    Act Density 0.035%

    No Known Activations