INDEX
    Explanations

    numeric and date-related information

    New Auto-Interp
    Negative Logits
     ä¸ĥ
    -0.15
     Fifth
    -0.15
     Five
    -0.15
    _Entry
    -0.14
     Seventh
    -0.14
     Fourth
    -0.14
     Seven
    -0.14
     ÙĨÙĪÙģ
    -0.14
     fourth
    -0.14
    ï¼Ļ
    -0.13
    POSITIVE LOGITS
    1
    0.25
    0
    0.24
    2
    0.22
    3
    0.21
    .
    0.19
    <|end_of_text|>
    0.19
    143
    0.17
    8
    0.17
    9
    0.17
    127
    0.17
    Act Density 0.041%

    No Known Activations