INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    0
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     immoral
    -0.43
    󠁢
    -0.41
    omitempty
    -0.39
     deleteById
    -0.39
     oppress
    -0.39
     nerfed
    -0.39
     ब्रेकडाउन
    -0.38
     Wich
    -0.38
    unków
    -0.37
     kém
    -0.36
    POSITIVE LOGITS
    AutoScaleMode
    0.66
     Normdatei
    0.64
    LookAnd
    0.62
     typelib
    0.62
    esModule
    0.61
    principalTable
    0.61
    تقاوى
    0.54
    InstrumentedTest
    0.54
     unknownFields
    0.54
     الحره
    0.53
    Act Density 0.000%

    No Known Activations