INDEX
    Explanations

    numerical references and citations within academic or formal texts

    New Auto-Interp
    Negative Logits
     sixth
    -0.37
    6
    -0.36
     Sixth
    -0.34
     SIX
    -0.34
     six
    -0.33
     åħŃ
    -0.32
    åħŃ
    -0.32
    -six
    -0.30
     Six
    -0.30
     seventh
    -0.29
    POSITIVE LOGITS
    2
    0.26
    第äºĮ
    0.24
     second
    0.23
     II
    0.22
     Second
    0.22
    âij¡
    0.21
    äºĮ
    0.20
     第äºĮ
    0.20
    Û²
    0.19
    02
    0.19
    Act Density 0.096%

    No Known Activations