INDEX
    Explanations

    mentions of a particular individual named "Kid" with varying activation strengths

    references to the term "Kid" as it relates to specific individuals or contexts

    New Auto-Interp
    Negative Logits
    ãĥ¼ãĥĨãĤ£
    -0.78
     commission
    -0.72
    aukee
    -0.70
     weekday
    -0.68
     confir
    -0.66
     merge
    -0.65
     coalition
    -0.64
     ministers
    -0.63
     fuse
    -0.63
     timestamp
    -0.63
    POSITIVE LOGITS
     Icar
    1.12
    neys
    1.08
    ney
    0.98
    bean
    0.93
     Doodle
    0.90
    amac
    0.88
    Kid
    0.87
    sie
    0.87
    stones
    0.85
    pad
    0.83
    Act Density 0.028%

    No Known Activations