INDEX
    Explanations

    words related to simplicity or straightforwardness

    the word "simply" and its variations

    New Auto-Interp
    Negative Logits
    ieth
    -0.73
    ez
    -0.61
    gren
    -0.59
    inal
    -0.58
    ib
    -0.58
    vice
    -0.58
     Heard
    -0.58
    eur
    -0.56
    era
    -0.56
    erest
    -0.55
    POSITIVE LOGITS
     simply
    3.32
     merely
    2.38
    Simply
    1.91
     Simply
    1.90
     just
    1.40
     purely
    1.36
     mere
    1.36
     solely
    1.35
     plainly
    1.33
     bluntly
    1.27
    Act Density 0.024%

    No Known Activations