INDEX
    Explanations

    motivation and goals

    This neuron activates on words related to perseverance and disciplined effort, such as persistence, resistance, sacrifice, and staying focused.

    New Auto-Interp
    Negative Logits
    füg
    -0.07
     IPP
    -0.07
     '-
    -0.07
    ImageData
    -0.07
     книж
    -0.07
    -0.07
     kvinn
    -0.07
     memorial
    -0.06
    ayı
    -0.06
     ETA
    -0.06
    POSITIVE LOGITS
    -square
    0.07
    086
    0.07
    ức
    0.06
     directional
    0.06
    oggled
    0.06
     proficiency
    0.06
    0.06
     analyzing
    0.05
    Figure
    0.05
    0.05
    Act Density 0.031%

    No Known Activations