INDEX
    Explanations

    references to visual elements, such as captions and figures, in the text

    New Auto-Interp
    Negative Logits
    acock
    -0.16
    enso
    -0.15
    ahoma
    -0.14
    dfs
    -0.14
    AFE
    -0.14
    edith
    -0.14
    fds
    -0.13
    ê°ij
    -0.13
     IDX
    -0.13
     Ðĵолов
    -0.13
    POSITIVE LOGITS
     caption
    0.21
    -caption
    0.20
    ì§
    0.19
     figure
    0.17
    caption
    0.16
     Caption
    0.16
    Caption
    0.15
    kening
    0.15
     legend
    0.15
     sizing
    0.15
    Act Density 0.090%

    No Known Activations