INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
81
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms related to labor relations and disputes
New Auto-Interp
Negative Logits
فريبيس
-0.62
zheimer
-0.50
Captor
-0.50
andExpect
-0.49
realy
-0.49
instancetype
-0.47
Audiodateien
-0.47
styleType
-0.47
ویکیپدیای
-0.47
]")]
-0.46
POSITIVE LOGITS
workers
0.46
workers
0.42
unions
0.40
union
0.40
worker
0.39
trabajo
0.38
lucha
0.38
worker
0.37
disputes
0.36
employees
0.35
Activations Density 0.000%