INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     the
    -0.59
     her
    -0.48
    LookAnd
    -0.48
    tagHelperRunner
    -0.46
     }_{\
    -0.45
    cho
    -0.45
    //
    -0.45
    bu
    -0.45
    ­
    -0.45
     van
    -0.44
    POSITIVE LOGITS
     myſelf
    0.77
     Majefty
    0.75
     auffi
    0.75
     Monfieur
    0.74
     pleaf
    0.73
     quæ
    0.71
    entibus
    0.70
    amaño
    0.70
     Efq
    0.70
     ſtate
    0.69
    Act Density 1.310%

    No Known Activations