INDEX
    Explanations

    latex commands and syntax

    New Auto-Interp
    Negative Logits
     is
    1.09
     
    1.02
     was
    0.98
     of
    0.97
     to
    0.97
     were
    0.97
     had
    0.91
     has
    0.87
     I
    0.86
     for
    0.85
    POSITIVE LOGITS
    lined
    0.86
    sthe
    0.85
    0.82
    ка
    0.82
    لون
    0.82
    that
    0.80
    ່ວນ
    0.76
     посмотреть
    0.76
    ζει
    0.75
    を行います
    0.75
    Act Density 0.002%

    No Known Activations