INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    49.5
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    Python function definitions

    New Auto-Interp
    Negative Logits
     AssemblyCulture
    -0.62
    WriteTagHelper
    -0.55
    }{*}{
    -0.50
    Diweddarwch
    -0.49
    それとも
    -0.47
    lewati
    -0.46
    -------------</
    -0.45
     enfans
    -0.45
    ChildScrollView
    -0.45
     HasFactory
    -0.44
    POSITIVE LOGITS
    XmlAccessorType
    0.46
    IBOutlet
    0.46
    InstrumentedTest
    0.45
     propOrder
    0.44
    Count
    0.43
    Override
    0.43
    count
    0.42
    fines
    0.41
    WithIOException
    0.41
    def
    0.41
    Act Density 0.065%

    No Known Activations