INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     of
    -0.88
     time
    -0.87
    -0.82
     the
    -0.77
    <eos>
    -0.73
    -
    -0.69
     &
    -0.68
     (
    -0.67
    -0.65
     Time
    -0.64
    POSITIVE LOGITS
     myſelf
    1.57
     itſelf
    1.53
    ſelves
    1.50
     Majefty
    1.43
     Efq
    1.43
    ſelf
    1.42
     Jefus
    1.37
     faſt
    1.35
     Monfieur
    1.35
     iſt
    1.31
    Act Density 0.323%

    No Known Activations