INDEX
    Explanations

    punctuations or sentence-ending marks

    New Auto-Interp
    Negative Logits
    ernes
    -0.17
    499
    -0.16
    Interval
    -0.16
     Interval
    -0.15
    549
    -0.15
    737
    -0.15
    211
    -0.15
    anz
    -0.15
     Interrupt
    -0.15
     Hide
    -0.14
    POSITIVE LOGITS
     Gow
    0.16
    antas
    0.14
    olo
    0.14
    æŁı
    0.14
    .datab
    0.13
    MLE
    0.13
    anford
    0.13
     swinger
    0.13
    aż
    0.13
    ázev
    0.13
    Act Density 0.084%

    No Known Activations