INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    68.5
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    references to fictional characters and their attributes

    New Auto-Interp
    Negative Logits
     تضيفلها
    -0.68
    httphttps
    -0.68
     OMITBAD
    -0.67
     Audiodateien
    -0.63
    出版年
    -0.61
    andExpect
    -0.59
     nahilalakip
    -0.59
     lizenzfreies
    -0.58
    лтемелер
    -0.56
    Jeografia
    -0.56
    POSITIVE LOGITS
     characters
    0.45
     actors
    0.37
     heroes
    0.34
    like
    0.33
    role
    0.33
     personnages
    0.33
    évaluateur
    0.33
     like
    0.33
     portrayed
    0.32
     personaggi
    0.31
    Act Density 0.063%

    No Known Activations