INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
66
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
concepts related to responsibility and compliance in organizational contexts
New Auto-Interp
Negative Logits
Personendaten
-0.56
lito
-0.45
SequentialGroup
-0.45
zheimer
-0.44
Bellow
-0.43
meisten
-0.43
énéral
-0.41
sistors
-0.41
Larg
-0.40
diagnose
-0.40
POSITIVE LOGITS
Manbalar
0.48
ValueStyle
0.44
+#+#
0.44
betweenstory
0.42
Administrativna
0.41
herence
0.40
utilice
0.39
FormState
0.38
attenzione
0.38
cumplimiento
0.37
Activations Density 0.008%