INDEX
    Explanations

    occurrences of the word "this."

    New Auto-Interp
    Negative Logits
    the
    -0.15
     equivalents
    -0.15
     fortunes
    -0.14
    theless
    -0.14
    that
    -0.14
     the
    -0.14
    askell
    -0.14
     synonyms
    -0.14
     strides
    -0.14
     repercussions
    -0.13
    POSITIVE LOGITS
     particular
    0.51
     type
    0.36
     kind
    0.35
     same
    0.32
    /th
    0.32
     latest
    0.32
     sort
    0.30
     entire
    0.29
     week
    0.29
     year
    0.28
    Act Density 0.433%

    No Known Activations