INDEX
    Explanations

    instances of the word "reference" and its variations, indicating a focus on citations or references in the text

    New Auto-Interp
    Negative Logits
    iron
    -0.16
    ForeignKey
    -0.15
     frank
    -0.15
    ern
    -0.15
    ards
    -0.15
    arr
    -0.15
    arily
    -0.14
    eenth
    -0.14
     chá»ĭu
    -0.14
    ивÑĪи
    -0.14
    POSITIVE LOGITS
    resher
    0.18
    izes
    0.18
    luž
    0.16
    coni
    0.15
    oldem
    0.15
    exual
    0.15
    ential
    0.15
    εια
    0.15
    attles
    0.14
    utable
    0.14
    Act Density 0.058%

    No Known Activations