INDEX
    Explanations

    the word "think" in various contexts

    New Auto-Interp
    Negative Logits
    API
    -0.65
    irement
    -0.65
    worldly
    -0.65
    ilaterally
    -0.65
    Submit
    -0.64
    thinkable
    -0.64
     Yourself
    -0.63
    eatured
    -0.62
    irrel
    -0.61
    ulent
    -0.61
    POSITIVE LOGITS
     it
    0.76
     whoever
    0.75
     everybody
    0.74
     anybody
    0.71
     thats
    0.71
    76561
    0.69
     there
    0.69
     Cantor
    0.68
     anecd
    0.67
     Seb
    0.66
    Act Density 0.599%

    No Known Activations