INDEX
    Explanations

    references to the letter 'T' or words starting with 'T'

    New Auto-Interp
    Negative Logits
    om
    -0.66
    op
    -0.52
    ag
    -0.51
    ra
    -0.49
    ile
    -0.48
     moks
    -0.48
    hou
    -0.47
    hat
    -0.47
    ype
    -0.47
    han
    -0.47
    POSITIVE LOGITS
     jScrollPane
    0.70
    parsedMessage
    0.69
     صوتيه
    0.65
    RetentionPolicy
    0.63
    providedIn
    0.63
     Formosa
    0.60
     Seer
    0.60
     locket
    0.59
     woodpecker
    0.57
    verständlich
    0.57
    Act Density 0.212%

    No Known Activations