INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
74
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms related to architectural design and structures
New Auto-Interp
Negative Logits
Personendaten
-0.72
commonest
-0.46
MessageOf
-0.46
⤒
-0.45
OFDb
-0.44
witcher
-0.42
awaiter
-0.41
IDENTIFIED
-0.41
indefinite
-0.40
+:+
-0.40
POSITIVE LOGITS
architecture
0.63
architectural
0.60
building
0.59
construction
0.55
construcción
0.53
buildings
0.52
arquite
0.51
bangunan
0.51
IntoConstraints
0.51
arquitectura
0.51
Activations Density 0.000%