INDEX
    Model
    gemma-2-9b-it
    Layer #
    20
    Steering Hook
    blocks.20.hook_resid_pre
    Steering Strength
    44.5
    Uploader
    bot-neuronpedia
    Created At
    2/15/2025 1:06:43 AM
    Raw Vector
    Actions
    Explanations

    occurrences of the word "string" in programming-related contexts

    New Auto-Interp
    Negative Logits
     ویکی‌پدی
    -0.87
     незавершена
    -0.78
    ]]:
    -0.65
     kaarangay
    -0.63
     noDo
    -0.63
    "]="
    -0.61
     дописавши
    -0.57
    }{*}{
    -0.56
    ]]=
    -0.56
    tonode
    -0.54
    POSITIVE LOGITS
     chaud
    0.39
    soort
    0.38
     suspendu
    0.38
    Either
    0.37
     sufrimiento
    0.36
     BeautifulSoup
    0.35
     Pantai
    0.35
     världen
    0.35
    htiö
    0.35
     malheureux
    0.35
    Act Density 0.029%

    No Known Activations