INDEX
    Explanations

    article followed by descriptive word

    New Auto-Interp
    Negative Logits
     SUCCESSFULLY
    0.17
    #!/
    0.16
    0.16
     an
    0.16
     having
    0.16
    the
    0.15
    ately
    0.15
     preferentially
    0.15
    ably
    0.15
     the
    0.15
    POSITIVE LOGITS
     few
    0.30
     slight
    0.28
     bit
    0.27
     flurry
    0.27
    很好的
    0.26
     lot
    0.25
     fairly
    0.25
     nice
    0.25
     tremendous
    0.25
     little
    0.25
    Act Density 0.251%

    No Known Activations