INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    74.5
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    terms related to metrics and evaluations in various analyses

    New Auto-Interp
    Negative Logits
    WriteTagHelper
    -0.50
    Smarty
    -0.49
     SYLLABLE
    -0.47
    Personendaten
    -0.44
    :✨
    -0.43
    tovers
    -0.41
    tanleria
    -0.41
    abestanden
    -0.40
     himo
    -0.40
    ècie
    -0.40
    POSITIVE LOGITS
     evaluation
    0.65
     measure
    0.57
     evaluate
    0.57
     evalu
    0.56
     evaluar
    0.54
    FormState
    0.54
    ValueStyle
    0.52
     Evaluation
    0.52
    InstrumentedTest
    0.52
     evaluating
    0.51
    Act Density 0.011%

    No Known Activations