INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     “
    -0.67
     he
    -0.66
     sa
    -0.66
     Us
    -0.64
     us
    -0.63
     c
    -0.63
     a
    -0.62
    -0.62
     cer
    -0.61
     por
    -0.60
    POSITIVE LOGITS
     Monfieur
    1.13
     Reſ
    1.09
     pleaſure
    1.01
     Shakspeare
    0.98
     myſelf
    0.93
     Houſe
    0.92
     Moslem
    0.92
     Shaksp
    0.92
     Cæsar
    0.91
     Eſ
    0.91
    Act Density 2.720%

    No Known Activations