INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Efq
    -0.97
     myſelf
    -0.91
     raiſ
    -0.91
     poffible
    -0.87
     pleaſure
    -0.87
     Shakspeare
    -0.86
    AnchorStyles
    -0.84
     purpoſe
    -0.84
    ^(@)
    -0.82
     Conſ
    -0.81
    POSITIVE LOGITS
    :
    0.42
    .
    0.42
    0.41
    Stream
    0.41
    îtra
    0.40
     dall
    0.39
    ``
    0.39
     •
    0.39
    ‘‘
    0.39
     IN
    0.38
    Act Density 0.044%

    No Known Activations