INDEX
    Explanations

    the word "example" and nearby words like "test" or "case"

    New Auto-Interp
    Negative Logits
    itten
    -0.07
    .bz
    -0.06
    eyh
    -0.06
    Äįan
    -0.06
    à¥įतर
    -0.06
    bout
    -0.06
    iyon
    -0.06
    (Paint
    -0.06
    ças
    -0.06
     Bias
    -0.06
    POSITIVE LOGITS
    omap
    0.06
    um
    0.06
    ummies
    0.06
    lings
    0.06
    ublic
    0.06
    umu
    0.06
     involving
    0.06
    umi
    0.06
    jax
    0.06
    ifax
    0.06
    Act Density 0.061%

    No Known Activations