INDEX
    Explanations

    instances of names, titles, and terms related to subjects or concepts

    New Auto-Interp
    Negative Logits
     myſelf
    -0.83
     itſelf
    -0.78
     Majefty
    -0.76
     juſ
    -0.76
     raiſ
    -0.75
     Reſ
    -0.74
     ſeveral
    -0.73
     whoſe
    -0.73
     poffible
    -0.71
     ſche
    -0.70
    POSITIVE LOGITS
     simply
    1.21
     ‘
    1.07
     '
    1.00
    simply
    0.98
     "
    0.96
     “
    0.96
     simplesmente
    0.92
     semplicemente
    0.92
     appunto
    0.91
     simplemente
    0.90
    Act Density 0.228%

    No Known Activations