INDEX
    Explanations

    references to numerical values, particularly focusing on counts and identifiers

    New Auto-Interp
    Negative Logits
    uzu
    -0.07
    ough
    -0.07
    ing
    -0.07
    inx
    -0.07
    è§
    -0.07
    ptal
    -0.06
    ạc
    -0.06
    comes
    -0.06
    gün
    -0.06
    ymi
    -0.06
    POSITIVE LOGITS
    ity
    0.08
    ucci
    0.08
    aken
    0.07
    jourd
    0.07
    alogy
    0.07
    ellan
    0.07
     discrepan
    0.07
    ulp
    0.07
     rig
    0.06
     latter
    0.06
    Act Density 0.036%

    No Known Activations