INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    58.75
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    punctuation and its usage in text

    New Auto-Interp
    Negative Logits
    :✨
    -0.45
     chemic
    -0.44
    lgica
    -0.43
    uxxxx
    -0.41
    省市镇
    -0.39
    astéroïdes
    -0.39
     sinner
    -0.38
    ]',
    -0.38
     diagnose
    -0.38
    aarrggbb
    -0.38
    POSITIVE LOGITS
     betweenstory
    0.50
    ########.
    0.47
    ContentAlignment
    0.47
     coinciden
    0.45
    คลิ
    0.44
     שוליים
    0.44
    NOPQRST
    0.42
    +#+#
    0.41
     agree
    0.41
    agree
    0.41
    Act Density 0.064%

    No Known Activations