INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     itſelf
    -1.12
    ſelf
    -1.05
     Jefus
    -1.03
     purpoſe
    -1.02
     Efq
    -1.02
     myſelf
    -0.99
     chofe
    -0.94
     prevailed
    -0.93
     pleaſure
    -0.93
     raiſ
    -0.91
    POSITIVE LOGITS
     [
    0.57
     a
    0.56
     the
    0.55
     an
    0.55
     set
    0.54
    <eos>
    0.52
     setting
    0.52
    ,
    0.52
     both
    0.52
     events
    0.51
    Act Density 0.017%

    No Known Activations