INDEX
    Explanations

    terms related to constancy and related concepts in various contexts

    New Auto-Interp
    Negative Logits
    age
    -0.18
    oring
    -0.17
    aty
    -0.15
    Ñīина
    -0.15
    lo
    -0.14
    gage
    -0.14
    Ph
    -0.14
    onder
    -0.14
    pring
    -0.14
    spir
    -0.14
    POSITIVE LOGITS
    uctor
    0.22
    antly
    0.19
    ipation
    0.19
    ellation
    0.19
    ople
    0.18
    undra
    0.17
    ipated
    0.17
    ihu
    0.16
    igham
    0.16
    EXPR
    0.15
    Act Density 0.043%

    No Known Activations