INDEX
    Explanations

    terms related to transformation and personal change

    New Auto-Interp
    Negative Logits
    roke
    -0.16
    bourg
    -0.15
    ermann
    -0.15
    raman
    -0.15
    份
    -0.14
    æĸĹ
    -0.14
    emm
    -0.14
    erman
    -0.14
    pch
    -0.14
    ITOR
    -0.14
    POSITIVE LOGITS
    ative
    0.18
    ively
    0.17
     Äijá»ķi
    0.16
    iert
    0.16
    atic
    0.15
    oment
    0.15
    AllWindows
    0.15
    Ļæ±Ł
    0.14
    /trans
    0.14
    nemonic
    0.14
    Act Density 0.015%

    No Known Activations