INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
52
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
significant numbers, statistical references, and structural indicators in text
New Auto-Interp
Negative Logits
Seznam
-0.40
compressor
-0.36
nawr
-0.35
supersonic
-0.34
nitrous
-0.34
switchTo
-0.33
)
-0.33
фик
-0.32
off
-0.31
vapour
-0.31
POSITIVE LOGITS
InstrumentedTest
0.68
betweenstory
0.63
LookAnd
0.62
fromnode
0.60
useAppContext
0.60
rrggbb
0.58
transQ
0.58
majánló
0.57
desmotivaciones
0.56
queſta
0.55
Activations Density 4.872%