INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    49.25
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    symbols and formatting elements commonly used in text

    New Auto-Interp
    Negative Logits
     referenties
    -0.41
    Hauptartikel
    -0.40
    Litteratur
    -0.37
     nawr
    -0.37
    Smarty
    -0.36
    Literatur
    -0.36
    loaf
    -0.34
    erobic
    -0.34
    request
    -0.33
    出版年
    -0.32
    POSITIVE LOGITS
     kasarigan
    0.53
    LookAnd
    0.53
     betweenstory
    0.53
    fromnode
    0.51
     queſta
    0.49
    LEGGI
    0.48
    انتهای
    0.47
     BoxFit
    0.47
     laſſen
    0.47
    árbol
    0.47
    Act Density 0.698%

    No Known Activations