INDEX
    Explanations

    numerical data and formatting information within the text

    New Auto-Interp
    Negative Logits
     Fifth
    -0.31
     fifth
    -0.31
     five
    -0.30
     Five
    -0.30
     äºĶ
    -0.30
    äºĶ
    -0.29
    _five
    -0.28
    five
    -0.28
    5
    -0.26
    Five
    -0.26
    POSITIVE LOGITS
    8
    0.36
    7
    0.30
     eighth
    0.26
    9
    0.25
    ï¼ĺ
    0.23
     eight
    0.23
     Eighth
    0.23
    Û¸
    0.23
     seventh
    0.22
    ८
    0.22
    Act Density 0.062%

    No Known Activations