INDEX
    Explanations

    repeated phrases or connectors, particularly variations of "in."

    New Auto-Interp
    Negative Logits
     ―――――
    -1.50
     ་་
    -1.48
     Anſ
    -1.46
     ――――――――
    -1.28
     iſt
    -1.24
     itſelf
    -1.24
     Monfieur
    -1.23
    ſelf
    -1.21
     Theſe
    -1.13
     myſelf
    -1.12
    POSITIVE LOGITS
     en
    2.46
     EN
    1.19
     em
    1.08
     in
    1.04
     En
    0.97
    en
    0.96
     σε
    0.92
     on
    0.91
     в
    0.88
     at
    0.85
    Act Density 0.024%

    No Known Activations