INDEX
    Explanations

    mentions of specific symbols or formatting elements within text documents

    structural or statistical patterns in research data

    New Auto-Interp
    Negative Logits
     neighb
    -0.78
     swoop
    -0.76
     occas
    -0.72
     hopping
    -0.69
     replacements
    -0.67
     whine
    -0.67
     wiser
    -0.66
     whining
    -0.64
     whistle
    -0.63
     snap
    -0.63
    POSITIVE LOGITS
    ³³³³³³³³
    1.20
    ³³³³
    1.12
    Figure
    1.09
    ³³³
    1.07
    posted
    1.03
    ³³³³³³³³³³³³³³³³
    1.03
    METHOD
    1.02
    Nonetheless
    1.00
    Methods
    1.00
    âĵĺ
    0.98
    Act Density 0.308%

    No Known Activations