INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    47.75
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    punctuation and emotional expressions

    New Auto-Interp
    Negative Logits
     ste
    -0.37
     hin
    -0.36
     Иванович
    -0.36
    wood
    -0.35
     hyp
    -0.35
    goog
    -0.35
     vers
    -0.34
    綿
    -0.34
     documentation
    -0.34
    ктория
    -0.34
    POSITIVE LOGITS
     bezeichneter
    0.76
    fromnode
    0.57
     nakalista
    0.57
    InstrumentedTest
    0.57
    TagMode
    0.56
    évaluateur
    0.55
     propOrder
    0.55
    InjectAttribute
    0.54
     snippetHide
    0.54
    cerpt
    0.53
    Act Density 0.386%

    No Known Activations