INDEX
    Explanations

    instances of the word "thought" followed by a positive evaluation or thoughtful consideration

    New Auto-Interp
    Negative Logits
    ãĤ´ãĥ³
    -0.81
     Nanto
    -0.74
    ãĤ¬
    -0.74
    è¦ļéĨĴ
    -0.73
    ãĤº
    -0.72
    OURCE
    -0.72
    adra
    -0.68
    anwhile
    -0.64
     guiName
    -0.63
    */(
    -0.62
    POSITIVE LOGITS
     joking
    0.97
     cute
    0.93
     funny
    0.88
     invincible
    0.84
     hilarious
    0.82
     kidding
    0.82
     quaint
    0.80
     cool
    0.79
     silly
    0.75
     innocuous
    0.75
    Act Density 0.183%

    No Known Activations