INDEX
    Explanations

    references and citations in academic texts

    New Auto-Interp
    Negative Logits
    .*")]
    -0.74
    ranean
    -0.74
    Britannique
    -0.63
    ROIT
    -0.60
     Slate
    -0.59
    "]]
    -0.59
    ''');
    -0.58
     censi
    -0.58
    AndEndTag
    -0.57
    )':
    -0.57
    POSITIVE LOGITS
    ref
    1.94
    REF
    1.26
    Ref
    0.93
     REF
    0.92
     Ref
    0.85
     ref
    0.84
    refs
    0.78
    eqref
    0.68
    reg
    0.66
    AddHtmlAttribute
    0.65
    Act Density 0.132%

    No Known Activations