INDEX
    Explanations

    instances of the word "you" in various contexts

    New Auto-Interp
    Negative Logits
    ^(@)
    -1.56
     myſelf
    -1.50
     Efq
    -1.49
     betweenstory
    -1.48
    BibitemShut
    -1.47
     Monfieur
    -1.43
    bibfield
    -1.39
     themſelves
    -1.34
     Houſe
    -1.33
     itſelf
    -1.31
    POSITIVE LOGITS
    '
    0.93
    )
    0.92
    ),
    0.92
    ,
    0.91
    -
    0.88
    ...
    0.85
    .
    0.83
    ↵↵
    0.82
    .,
    0.82
    ?
    0.82
    Act Density 0.346%

    No Known Activations