INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    73.5
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    terms related to health and safety regulations

    New Auto-Interp
    Negative Logits
    Personendaten
    -0.60
    labelledby
    -0.42
     SYLLABLE
    -0.42
    zheimer
    -0.41
    WriteTagHelper
    -0.38
     grond
    -0.37
    Kariera
    -0.37
    andExpect
    -0.37
     getItemId
    -0.37
    GEBURTS
    -0.36
    POSITIVE LOGITS
     guidelines
    0.53
    ValueStyle
    0.52
    +#+#
    0.49
    guidelines
    0.48
    コロナ禍
    0.48
    Guidelines
    0.45
     CreateTagHelper
    0.44
    ſelf
    0.44
     Guidelines
    0.43
    ſelves
    0.43
    Act Density 0.002%

    No Known Activations