INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    57.25
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    terms related to health metrics and outcomes

    New Auto-Interp
    Negative Logits
     ModelExpression
    -0.69
    Personendaten
    -0.69
    WriteTagHelper
    -0.64
    Бахар
    -0.61
    :✨
    -0.61
     lenker
    -0.59
     وتسجيلات
    -0.59
    DockStyle
    -0.57
     ***!
    -0.57
     AssemblyCulture
    -0.54
    POSITIVE LOGITS
     health
    0.35
     durs
    0.34
     ſur
    0.32
     duros
    0.32
     desvi
    0.32
     lucha
    0.31
     þat
    0.30
     ſtate
    0.30
     individuals
    0.29
     purpoſe
    0.29
    Act Density 0.021%

    No Known Activations