INDEX
    Explanations

    punctuation and formatting elements in numerical references or citations

    New Auto-Interp
    Negative Logits
    utherford
    -0.18
    chod
    -0.16
    575
    -0.16
    ACHI
    -0.15
    ize
    -0.14
     barring
    -0.14
    hum
    -0.14
    .twitch
    -0.14
    ened
    -0.14
     Wi
    -0.14
    POSITIVE LOGITS
    ansson
    0.16
    bbb
    0.16
    ripple
    0.15
    inou
    0.15
    intel
    0.14
    itel
    0.14
    brero
    0.14
    loth
    0.14
     MOUSE
    0.14
    eger
    0.14
    Act Density 0.002%

    No Known Activations