INDEX
    Explanations

    occurrences of mathematical expressions involving numeric values, particularly those with dollar signs, exponents, and equal signs

    New Auto-Interp
    Negative Logits
     itſelf
    -1.40
     myſelf
    -1.30
     Efq
    -1.21
     Jefus
    -1.20
     ―――――
    -1.20
     Houſe
    -1.20
    tvguidetime
    -1.19
     Monfieur
    -1.18
     Majefty
    -1.16
     $_"
    -1.16
    POSITIVE LOGITS
    1.01
    $
    0.99
     $
    0.98
    .
    0.86
    <eos>
    0.82
     $\
    0.78
    '
    0.78
    $\
    0.73
    __))
    0.71
    ↵↵
    0.69
    Act Density 0.524%

    No Known Activations