INDEX
    Explanations

    instances of emphasis on the word 'the'

    New Auto-Interp
    Negative Logits
    èĥ
    -0.14
    à¥Ģय
    -0.14
    esser
    -0.14
    째
    -0.14
    amber
    -0.13
     concern
    -0.13
    urgeon
    -0.13
     Concern
    -0.13
    ìŀij
    -0.13
     ###
    -0.13
    POSITIVE LOGITS
    ibar
    0.17
     manner
    0.17
    ãĥŃãĥ¼
    0.17
    ailable
    0.16
    ontent
    0.15
    åºĶ
    0.15
    ãĥĭãĥ¼
    0.15
     amount
    0.14
    _Cmd
    0.14
    oot
    0.14
    Act Density 0.392%

    No Known Activations