INDEX
    Explanations

    patterns indicative of LaTeX formatting or markup syntax

    New Auto-Interp
    Negative Logits
     LENG
    -0.17
    iffin
    -0.15
    olio
    -0.15
    дон
    -0.15
    rvé
    -0.15
    quip
    -0.15
    asmus
    -0.14
    eger
    -0.14
    eydi
    -0.14
    antine
    -0.13
    POSITIVE LOGITS
     te
    0.15
    uda
    0.14
    kc
    0.14
     Te
    0.14
     ar
    0.14
     Creek
    0.14
     pal
    0.13
    udy
    0.13
     â
    0.13
    â
    0.13
    Act Density 0.004%

    No Known Activations