INDEX
    Explanations

    the word "noticed."

    instances of the word “notice” and its variations, indicating observations or awareness

    New Auto-Interp
    Negative Logits
    quer
    -0.73
    prep
    -0.73
     negotiator
    -0.69
    export
    -0.67
    cop
    -0.67
    wives
    -0.66
    cise
    -0.66
    ccording
    -0.65
    ãĥ³ãĤ¸
    -0.65
    venge
    -0.65
    POSITIVE LOGITS
     how
    0.78
    cules
    0.76
    ury
    0.73
     noticed
    0.71
    adow
    0.68
    lessly
    0.66
     notices
    0.64
    flies
    0.64
    enance
    0.63
     spikes
    0.62
    Act Density 0.023%

    No Known Activations