INDEX
    Explanations

    the word "boys" with varying activations, emphasizing a focus on this specific term

    occurrences of the word "boys."

    New Auto-Interp
    Negative Logits
    mediated
    -0.73
    uncture
    -0.71
    Accessory
    -0.69
    Sharp
    -0.69
    aeda
    -0.68
    ãĥķãĤ©
    -0.67
    osi
    -0.66
    ascular
    -0.66
    inventoryQuantity
    -0.66
    itures
    -0.66
    POSITIVE LOGITS
     boys
    0.97
     Scouts
    0.97
    friend
    0.85
    hood
    0.83
    boys
    0.81
    hift
    0.79
     puberty
    0.78
     ages
    0.77
     Boys
    0.77
     girls
    0.76
    Act Density 0.013%

    No Known Activations