INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    0
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     Wikimedijinoj
    -0.65
    şört
    -0.50
     himo
    -0.49
    WriteTagHelper
    -0.49
    -0.49
     THEORY
    -0.49
    èdia
    -0.49
     metast
    -0.48
     biodegradable
    -0.48
    tyimages
    -0.48
    POSITIVE LOGITS
     carefully
    0.59
     careful
    0.52
    ValueStyle
    0.47
     memperhatikan
    0.44
     soigneusement
    0.43
     caution
    0.42
     attenzione
    0.42
     dikkat
    0.40
     duquel
    0.40
    WaitGroup
    0.39
    Act Density 0.000%

    No Known Activations