INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
73.5
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms related to health and safety regulations
New Auto-Interp
Negative Logits
Personendaten
-0.60
labelledby
-0.42
SYLLABLE
-0.42
zheimer
-0.41
WriteTagHelper
-0.38
grond
-0.37
Kariera
-0.37
andExpect
-0.37
getItemId
-0.37
GEBURTS
-0.36
POSITIVE LOGITS
guidelines
0.53
ValueStyle
0.52
+#+#
0.49
guidelines
0.48
コロナ禍
0.48
Guidelines
0.45
CreateTagHelper
0.44
ſelf
0.44
Guidelines
0.43
ſelves
0.43
Activations Density 0.002%