INDEX
    Explanations

    actions of transforming or converting things into something else

    phrases about transformation or conversion processes

    New Auto-Interp
    Negative Logits
    cation
    -0.77
    inately
    -0.73
    ritz
    -0.72
    erity
    -0.70
    ran
    -0.69
    ername
    -0.68
    enance
    -0.66
    antage
    -0.65
    raint
    -0.64
    no
    -0.64
    POSITIVE LOGITS
     usable
    0.88
     something
    0.73
     profitable
    0.65
    ãĥ¼ãĥ
    0.64
    quished
    0.64
    AFTA
    0.62
     a
    0.61
    Obj
    0.61
     surrogate
    0.59
     productive
    0.59
    Act Density 0.071%

    No Known Activations