INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    76
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    terms related to collaboration and engagement in discussions or forums

    New Auto-Interp
    Negative Logits
    GEBURTS
    -0.50
     Paglinawan
    -0.44
    zheimer
    -0.43
    GEBURTSDATUM
    -0.43
    doria
    -0.42
    ]',
    -0.41
    Kariera
    -0.40
    ),),
    -0.40
    arkov
    -0.40
     unlucky
    -0.39
    POSITIVE LOGITS
     engagement
    0.57
     audience
    0.57
    Engagement
    0.50
    fromnode
    0.49
    engagement
    0.49
     outreach
    0.49
    Engage
    0.48
     engage
    0.48
    audience
    0.48
     audiences
    0.47
    Act Density 0.000%

    No Known Activations