INDEX
    Explanations

    concepts related to change and transition

    New Auto-Interp
    Negative Logits
    iet
    -0.14
    arde
    -0.14
    adder
    -0.14
     thunk
    -0.14
     rather
    -0.14
    illis
    -0.13
    -vs
    -0.13
     vs
    -0.13
    chet
    -0.13
     Vs
    -0.13
    POSITIVE LOGITS
     ones
    0.40
     Ones
    0.27
     counterpart
    0.22
    ones
    0.21
     counterparts
    0.20
    á»ĵm
    0.18
    ONES
    0.17
    .decor
    0.16
    (er
    0.15
    ãĤĤãģ®
    0.15
    Act Density 0.207%

    No Known Activations