INDEX
    Explanations

    references to figures in a document

    New Auto-Interp
    Negative Logits
     ་་
    -0.71
     derer
    -0.71
    Дереккөздер
    -0.69
    [...]
    -0.69
    parsedMessage
    -0.69
    Okay
    -0.68
     rime
    -0.68
     fume
    -0.68
    #>
    -0.68
    venu
    -0.67
    POSITIVE LOGITS
     Fig
    3.35
    Fig
    3.18
     Figs
    2.39
    Figs
    2.22
     fig
    2.04
    fig
    1.83
     FIG
    1.69
     figs
    1.52
    FIG
    1.46
     Aug
    1.32
    Act Density 0.162%

    No Known Activations