INDEX
    Explanations

    instances of the word "wonderful."

    New Auto-Interp
    Negative Logits
     houſe
    -1.09
     pleaſure
    -1.08
     Houſe
    -1.06
     greateſt
    -1.06
     faſt
    -1.06
     Reſ
    -1.05
     Efq
    -1.05
     Anſ
    -1.05
     ſever
    -1.04
     Majefty
    -1.02
    POSITIVE LOGITS
     L
    0.69
    0.64
     I
    0.61
     He
    0.60
     El
    0.60
     (
    0.59
     T
    0.57
     is
    0.56
    addComponent
    0.55
     did
    0.54
    Act Density 0.143%

    No Known Activations