INDEX
    Explanations

    repeated occurrences of the letter 's'

    New Auto-Interp
    Negative Logits
     betweenstory
    -0.84
    SequentialGroup
    -0.75
     defaultstate
    -0.74
     pleaſure
    -0.69
     Houſe
    -0.69
     ब्रेकडाउन
    -0.68
    Personensuche
    -0.68
     uſed
    -0.68
     myſelf
    -0.68
     houſe
    -0.67
    POSITIVE LOGITS
    s
    1.11
     own
    0.81
     His
    0.66
    0.66
     their
    0.64
     his
    0.64
     s
    0.64
     egne
    0.59
    ys
    0.55
    ds
    0.55
    Act Density 0.189%

    No Known Activations