INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
64.5
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms related to denotation or definition
New Auto-Interp
Negative Logits
ब्रेकडाउन
-0.44
avir
-0.42
وتسجيلات
-0.42
oppress
-0.42
-0.41
immoral
-0.40
letz
-0.39
unconfirmed
-0.38
くれない
-0.38
-0.38
POSITIVE LOGITS
LookAnd
0.58
parsedMessage
0.57
principalTable
0.57
'\\;'
0.53
setVerticalGroup
0.52
InstrumentedTest
0.49
Hentet
0.49
unknownFields
0.48
typelib
0.47
abstrato
0.45
Activations Density 0.004%