INDEX
    Explanations

    references to quotes or sources marked with 'W'

    instances of the letter "W" in uppercase

    New Auto-Interp
    Negative Logits
     unpre
    -0.75
     gratification
    -0.75
    arial
    -0.73
     apprehension
    -0.67
     afore
    -0.64
    uate
    -0.63
     sucker
    -0.62
     locality
    -0.62
    İĭ
    -0.60
     tion
    -0.60
    POSITIVE LOGITS
    atts
    1.30
    restling
    1.25
    nesday
    1.18
    OW
    1.15
    orst
    1.09
    atson
    1.09
    anted
    1.07
    esley
    1.06
    atcher
    1.06
    izard
    1.05
    Act Density 0.075%

    No Known Activations