INDEX
    Explanations

    frequent occurrences of the word "the."

    New Auto-Interp
    Negative Logits
    AA
    -0.17
    VV
    -0.15
     guard
    -0.15
     ter
    -0.15
    am
    -0.15
     (
    -0.15
    au
    -0.15
    ura
    -0.14
    mo
    -0.14
    ree
    -0.14
    POSITIVE LOGITS
     sake
    0.28
     purposes
    0.26
    geries
    0.19
    zar
    0.17
    OfDay
    0.17
    aylight
    0.17
    OfString
    0.16
    unately
    0.16
     Mitar
    0.15
    beeld
    0.15
    Act Density 0.111%

    No Known Activations