INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    28.5
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    quotation marks and formatting syntax in text

    New Auto-Interp
    Negative Logits
     bubuk
    -0.31
     někdo
    -0.31
     réputation
    -0.31
     économies
    -0.30
     maleta
    -0.29
    addWidget
    -0.28
     manteiga
    -0.28
     leña
    -0.28
     reputación
    -0.28
     miteinander
    -0.28
    POSITIVE LOGITS
    rrggbb
    0.76
    0.70
     propOrder
    0.69
     متعلقه
    0.67
    <unused1>
    0.64
    <unused3>
    0.64
    <unused15>
    0.64
    <unused51>
    0.64
    <unused52>
    0.64
    <unused55>
    0.64
    Act Density 2.905%

    No Known Activations