INDEX
    Explanations

    components of references and citations in texts

    New Auto-Interp
    Negative Logits
    anki
    -0.17
    rahim
    -0.15
    änger
    -0.15
    abra
    -0.15
    weit
    -0.15
    ngrx
    -0.14
    roje
    -0.14
     Lam
    -0.14
    ati
    -0.14
    ayette
    -0.14
    POSITIVE LOGITS
    ULATE
    0.16
    iles
    0.14
     xPos
    0.14
    bsd
    0.14
    ullen
    0.14
    elsif
    0.14
    HB
    0.14
    tb
    0.14
    hausen
    0.13
    _Tis
    0.13
    Act Density 0.003%

    No Known Activations