INDEX
    Explanations

    expressions of thought or contemplation

    "think" or "thinking"

    thinking about or asking about

    New Auto-Interp
    Negative Logits
     Guill
    -0.47
     »
    -0.46
     Kob
    -0.44
     «
    -0.44
    <eos>
    -0.42
    я
    -0.41
    »
    -0.41
    «
    -0.40
     Jacobsen
    -0.40
     Quig
    -0.40
    POSITIVE LOGITS
     THINK
    1.74
     think
    1.74
    Think
    1.73
     Think
    1.72
    think
    1.68
    THINK
    1.59
     thinks
    1.54
     thinking
    1.41
     Thinking
    1.36
    thinking
    1.34
    Act Density 0.179%

    No Known Activations