INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    45.75
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    technical terminology and uncertainty expressions in policy or regulatory contexts

    New Auto-Interp
    Negative Logits
     Schild
    -0.36
     koop
    -0.32
    Weblinks
    -0.31
     asum
    -0.30
    platte
    -0.30
    janja
    -0.30
     esternos
    -0.30
     fundido
    -0.30
     pukul
    -0.29
     Wunsch
    -0.29
    POSITIVE LOGITS
    LookAnd
    0.69
    fromnode
    0.69
    帖最后由
    0.62
     queſta
    0.54
    :+:
    0.51
     Administrativna
    0.51
    setVerticalGroup
    0.50
    PerformLayout
    0.49
    InstrumentedTest
    0.49
    ioneer
    0.47
    Act Density 4.208%

    No Known Activations