INDEX
    Explanations

    numerical sequences or lists within a document

    New Auto-Interp
    Negative Logits
    ibri
    -0.15
    ij¸
    -0.15
    afc
    -0.14
    ovice
    -0.14
    ynet
    -0.14
    imbus
    -0.14
    nnen
    -0.14
     fours
    -0.13
    usra
    -0.13
    fbe
    -0.13
    POSITIVE LOGITS
    4
    0.27
    6
    0.27
    7
    0.25
    8
    0.25
    5
    0.25
    9
    0.24
    3
    0.24
     
    0.21
    2
    0.19
    10
    0.19
    Act Density 0.071%

    No Known Activations