INDEX
    Explanations

    phrases related to advice or steps to take

    advice related to purchasing decisions or precautions to take when buying something

    New Auto-Interp
    Negative Logits
     Democr
    -0.73
    hawks
    -0.73
     )]
    -0.73
    "},
    -0.70
    Streamer
    -0.69
    ĸļ
    -0.66
    twitter
    -0.64
    hai
    -0.63
    apo
    -0.62
    Translation
    -0.60
    POSITIVE LOGITS
     yourself
    1.37
     yourselves
    1.08
     your
    1.04
    your
    1.02
     Yourself
    1.00
     YOUR
    0.93
    Your
    0.91
     confidently
    0.89
     yours
    0.87
     Your
    0.87
    Act Density 0.737%

    No Known Activations