INDEX
    Explanations

    instances of the word "talking" and its variations, indicating a focus on dialogue or discussions

    New Auto-Interp
    Negative Logits
    emale
    -0.76
    iverpool
    -0.75
    uilt
    -0.73
    feeding
    -0.70
    boa
    -0.70
    peria
    -0.68
    metic
    -0.67
    eele
    -0.65
    cffff
    -0.64
    proc
    -0.63
    POSITIVE LOGITS
     Heads
    0.87
     louder
    0.86
     about
    0.82
     aloud
    0.81
     Points
    0.78
     loudly
    0.78
    heads
    0.74
     voices
    0.72
    Points
    0.72
     filibuster
    0.71
    Act Density 0.020%

    No Known Activations