INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
59
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
instances of high-impact words or phrases that suggest importance or classification in contexts like rules, procedures, or significant events
New Auto-Interp
Negative Logits
météo
-0.34
ėte
-0.32
traditionally
-0.31
Historically
-0.31
nawr
-0.31
ekor
-0.31
)
-0.30
terjun
-0.30
historically
-0.29
ząd
-0.29
POSITIVE LOGITS
noDo
0.66
webElementXpaths
0.61
Manbalar
0.59
ſelf
0.57
AssemblyCompany
0.55
purpoſe
0.55
Хьажоргаш
0.54
ContentAlignment
0.54
CanadaChoose
0.54
fromnode
0.53
Activations Density 1.069%