INDEX
    Explanations

    occurrences of the language code "en," indicating English language content

    New Auto-Interp
    Negative Logits
     ―――――
    -1.11
     ་་
    -1.09
     ――――――――
    -1.03
     Anſ
    -0.99
     Theſe
    -0.97
     iſt
    -0.97
     Monfieur
    -0.94
     ―――
    -0.93
    ^(@)
    -0.92
     ſind
    -0.90
    POSITIVE LOGITS
    en
    2.06
    EN
    1.90
     EN
    1.86
     En
    1.72
     en
    1.70
    En
    1.68
     Coen
    0.90
    enn
    0.85
    enin
    0.82
     Eno
    0.82
    Act Density 0.037%

    No Known Activations