INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
59.5
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
words that indicate significant findings or highlights in research and analysis
New Auto-Interp
Negative Logits
:✨
-0.67
$_['
-0.63
WriteTagHelper
-0.60
himo
-0.59
AssemblyCulture
-0.58
GEBURTSDATUM
-0.58
Audiodateien
-0.55
estacks
-0.55
bluetooth
-0.54
BTU
-0.54
POSITIVE LOGITS
débats
0.45
discussing
0.37
membahas
0.36
charla
0.35
discusión
0.34
sesión
0.34
discourse
0.34
observar
0.34
discussions
0.34
discussion
0.33
Activations Density 0.000%