INDEX
    Explanations

    numerical values or parameters in various contexts

    New Auto-Interp
    Negative Logits
    06
    -0.18
    906
    -0.17
    306
    -0.17
     Five
    -0.17
    05
    -0.17
    _five
    -0.16
     äºĶ
    -0.16
    ives
    -0.16
     Fifth
    -0.16
    /******/
    -0.16
    POSITIVE LOGITS
    7
    0.29
    8
    0.26
     seventh
    0.26
     eighth
    0.22
     seven
    0.22
     Seventh
    0.20
     VII
    0.19
     eight
    0.19
    à¥Ń
    0.18
     July
    0.18
    Act Density 0.076%

    No Known Activations