INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
74.5
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms related to metrics and evaluations in various analyses
New Auto-Interp
Negative Logits
WriteTagHelper
-0.50
Smarty
-0.49
SYLLABLE
-0.47
Personendaten
-0.44
:✨
-0.43
tovers
-0.41
tanleria
-0.41
abestanden
-0.40
himo
-0.40
ècie
-0.40
POSITIVE LOGITS
evaluation
0.65
measure
0.57
evaluate
0.57
evalu
0.56
evaluar
0.54
FormState
0.54
ValueStyle
0.52
Evaluation
0.52
InstrumentedTest
0.52
evaluating
0.51
Activations Density 0.011%