INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    64.5
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    terms related to denotation or definition

    New Auto-Interp
    Negative Logits
     ब्रेकडाउन
    -0.44
    avir
    -0.42
     وتسجيلات
    -0.42
     oppress
    -0.42
    󠁢
    -0.41
     immoral
    -0.40
     letz
    -0.39
    unconfirmed
    -0.38
    くれない
    -0.38
    󠁣
    -0.38
    POSITIVE LOGITS
    LookAnd
    0.58
    parsedMessage
    0.57
    principalTable
    0.57
     '\\;'
    0.53
    setVerticalGroup
    0.52
    InstrumentedTest
    0.49
    Hentet
    0.49
     unknownFields
    0.48
     typelib
    0.47
     abstrato
    0.45
    Act Density 0.004%

    No Known Activations