INDEX
    Explanations

    words related to actions that involve changing or transforming something

    variations of the word "error"

    New Auto-Interp
    Negative Logits
    kefeller
    -0.70
    IGHTS
    -0.69
    ership
    -0.68
    Premium
    -0.65
    ELS
    -0.64
    hips
    -0.64
     Spears
    -0.64
     supper
    -0.63
     Painter
    -0.63
    esville
    -0.62
    POSITIVE LOGITS
    asure
    1.28
    asures
    1.05
    rant
    1.00
    ogenous
    0.97
    icit
    0.94
    aser
    0.92
    rors
    0.91
    asing
    0.91
    got
    0.91
    mine
    0.91
    Act Density 0.015%

    No Known Activations