INDEX
    Explanations

    the presence of zeros and numerical values in the text

    New Auto-Interp
    Negative Logits
     myſelf
    -0.95
    dafx
    -0.94
    DockStyle
    -0.93
     Monfieur
    -0.93
     Jefus
    -0.92
     reaſon
    -0.91
    -0.90
    ^(@)
    -0.89
     itſelf
    -0.88
     Anſ
    -0.83
    POSITIVE LOGITS
    </blockquote>
    0.67
    doi
    0.59
      
    0.53
    ni
    0.52
     tek
    0.51
     https
    0.51
     http
    0.51
    те
    0.51
    ?
    0.50
    acaktır
    0.50
    Act Density 0.232%

    No Known Activations