INDEX
    Explanations

    references and citations in academic writing

    New Auto-Interp
    Negative Logits
    VD
    -0.16
    _mex
    -0.15
    наÑĢод
    -0.15
    @nate
    -0.14
    .ce
    -0.14
    bam
    -0.13
    íĤ¹
    -0.13
    ô
    -0.13
    -Token
    -0.13
    æĻ´
    -0.12
    POSITIVE LOGITS
    ÐĿаÑģ
    0.17
    26
    0.16
     vi
    0.16
    34
    0.15
     Kindle
    0.15
     facing
    0.15
    oggler
    0.15
    22
    0.15
    zcze
    0.15
    15
    0.14
    Act Density 0.054%

    No Known Activations