INDEX
    Explanations

    phrases related to actions or events involving protection, assistance, support, or intention

    occurrences of the word "the."

    New Auto-Interp
    Negative Logits
    ãĥĺ
    -0.74
     with
    -0.73
    bet
    -0.72
    herer
    -0.69
    here
    -0.68
    SHIP
    -0.66
    Reviewed
    -0.65
    lessly
    -0.64
    raf
    -0.64
    ord
    -0.64
    POSITIVE LOGITS
     exception
    1.32
     utmost
    1.28
     intention
    1.12
     slightest
    1.09
     caveat
    1.03
     same
    1.02
     highest
    0.99
     greatest
    0.97
     afore
    0.96
     expectation
    0.95
    Act Density 0.177%

    No Known Activations