INDEX
    Explanations

    references and citations in text

    references or citations within the document

    New Auto-Interp
    Negative Logits
     heights
    -0.72
    daq
    -0.71
    ################
    -0.68
    uliffe
    -0.66
     whiff
    -0.66
     deed
    -0.64
    hawk
    -0.64
    CHO
    -0.62
    ;;;;;;;;;;;;
    -0.62
    adena
    -0.62
    POSITIVE LOGITS
    eree
    1.33
    erences
    1.20
    lection
    1.19
    inement
    1.15
    lections
    1.14
    actor
    1.14
    erred
    1.14
    erential
    1.14
    riger
    1.12
    eren
    1.11
    Act Density 0.009%

    No Known Activations