INDEX
    Explanations

    instances of the word "thinking" followed by a number

    instances of the word "thinking."

    New Auto-Interp
    Negative Logits
    çĦ
    -0.78
    feeding
    -0.64
     Wrestling
    -0.64
    CBC
    -0.63
    Ann
    -0.63
    any
    -0.61
    videos
    -0.60
    clad
    -0.60
     Videos
    -0.59
    owship
    -0.59
    POSITIVE LOGITS
     aloud
    0.84
     provoking
    0.81
    sonian
    0.80
    cient
    0.78
     about
    0.74
    eteen
    0.72
    ortment
    0.70
    lass
    0.70
    inery
    0.69
     strategically
    0.68
    Act Density 0.033%

    No Known Activations