INDEX
    Explanations

    descriptive phrases introducing people, objects, or concepts in a positive manner

    New Auto-Interp
    Negative Logits
    alion
    -0.17
    alars
    -0.15
    okable
    -0.14
    AGR
    -0.14
    anean
    -0.13
    bot
    -0.13
    cob
    -0.13
    geist
    -0.13
     goodness
    -0.13
    ê·Ģ
    -0.13
    POSITIVE LOGITS
     ticking
    0.15
     Pist
    0.15
    καν
    0.14
     Äijôi
    0.14
     Benn
    0.13
    ipe
    0.13
     Powell
    0.13
    esson
    0.13
    xa
    0.13
    ONSE
    0.13
    Act Density 0.088%

    No Known Activations