INDEX
    Explanations

    references to figures and tables in a document

    New Auto-Interp
    Negative Logits
    /Layout
    -0.15
    APPED
    -0.14
    ukes
    -0.14
    ycop
    -0.14
    arg
    -0.14
    icals
    -0.14
    pii
    -0.14
    lama
    -0.13
    SSIP
    -0.13
    dge
    -0.13
    POSITIVE LOGITS
    arella
    0.17
    kiem
    0.14
     Bilim
    0.14
     æ¾
    0.14
    à¸ŀà¸Ļ
    0.14
    osph
    0.14
     wasted
    0.13
    念
    0.13
     below
    0.13
    reek
    0.13
    Act Density 0.085%

    No Known Activations