INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     E
    -0.62
     (
    -0.57
     A
    -0.56
     _
    -0.56
     X
    -0.55
     ten
    -0.54
     IV
    -0.54
    -0.54
     base
    -0.52
     D
    -0.52
    POSITIVE LOGITS
     Majefty
    0.99
     greateſt
    0.90
     pleaſure
    0.90
     itſelf
    0.89
     myſelf
    0.89
     Diſ
    0.88
     ſtate
    0.88
     preſent
    0.86
     Chriftian
    0.85
     Anſ
    0.84
    Act Density 1.052%

    No Known Activations