INDEX
    Explanations

    punctuation marks and questions in text

    New Auto-Interp
    Negative Logits
    !
    -0.58
     low
    -0.54
     ideal
    -0.53
     Off
    -0.52
     In
    -0.51
     yes
    -0.51
    <h5>
    -0.50
     !
    -0.50
     Yes
    -0.50
     pure
    -0.50
    POSITIVE LOGITS
     Majefty
    1.03
     Efq
    1.03
    SourceChecksum
    0.98
    ?!?
    0.98
     Houſe
    0.97
    ſelves
    0.95
    ?!?!
    0.94
    ?!"
    0.92
     Monfieur
    0.91
     myſelf
    0.89
    Act Density 0.196%

    No Known Activations