INDEX
    Explanations

    specific numerical values and counts related to various items and categories

    New Auto-Interp
    Negative Logits
    unas
    -0.15
    133
    -0.15
    optera
    -0.15
    umer
    -0.15
    iaux
    -0.14
    raud
    -0.14
    ory
    -0.14
    ri
    -0.14
    bjerg
    -0.14
    ica
    -0.14
    POSITIVE LOGITS
    ième
    0.19
    fold
    0.17
    -sided
    0.17
    antry
    0.16
    -handed
    0.15
    onta
    0.15
    /qu
    0.15
    /if
    0.15
    orra
    0.15
    odoxy
    0.15
    Act Density 0.205%

    No Known Activations