INDEX
    Explanations

    numerical values relating to measurements or counts

    New Auto-Interp
    Negative Logits
    ritte
    -0.14
     jus
    -0.14
    ron
    -0.14
    lub
    -0.13
    fait
    -0.13
    å
    -0.13
    eten
    -0.13
    argo
    -0.13
    sel
    -0.13
    .into
    -0.13
    POSITIVE LOGITS
    kea
    0.19
    herits
    0.15
    berman
    0.15
    st
    0.15
    \views
    0.15
    amarin
    0.14
    cho
    0.14
    quo
    0.14
    è´«
    0.14
     seiz
    0.13
    Act Density 0.060%

    No Known Activations