INDEX
    Explanations

    the beginning of a new document or section

    Tokens often followed by punctuation or mathematical symbols

    calculating probability

    New Auto-Interp
    Negative Logits
     itſelf
    -1.06
     NUKAT
    -1.05
     Jefus
    -1.03
    ſelves
    -1.02
    RectangleBorder
    -1.00
     kasarigan
    -1.00
     '\\;'
    -0.99
     tartalomajánló
    -0.99
    AccessorTable
    -0.98
     Roskov
    -0.97
    POSITIVE LOGITS
    ,
    0.48
     to
    0.47
     ne
    0.44
     o
    0.43
     in
    0.42
     by
    0.41
     from
    0.38
     with
    0.37
     for
    0.37
    !
    0.36
    Act Density 0.012%

    No Known Activations